xDM Discovery Tables Reference
xDM Discovery stores in the Semarchy xDM repository the profiling metrics, using the tables described in this document. Use this reference information to customize the existing charts or build your own visualizations on top of this data.
PRF_PROFILING
This table contains information about the profiling status of the tables.
Column Name | Description | |
---|---|---|
|
text |
Name of the datasource. |
|
text |
Name of the table. |
|
text |
Profiling status of the table in the profiling process.
|
|
number |
Progress of a running profiling process (1..100) |
|
text |
Error message when the profiling process has failed. |
|
datetime |
Timestamp when the profiling process was queued. |
|
datetime |
Timestamp when the profiling process was started. |
|
datetime |
Timestamp when the profiling process was finished. |
PRF_TABLE
This table contains profiling metrics for the tables.
Column Name | Description | |
---|---|---|
|
text |
Name of the datasource. |
|
text |
Name of the table. |
|
number |
Number of rows in the tables |
|
datetime |
Timestamp when the profile was updated. |
PRF_COLUMN
This table contains profiling metrics for the columns.
These metrics are reported for the entire table. They are not limited by the table’s profiling option. |
Column Name | Description | |
---|---|---|
|
text |
Name of the datasource. |
|
text |
Name of the table. |
|
text |
Name of the profiled column. |
|
number |
Ordinal position of the column in the table. |
|
text |
Datatype of the column. |
|
number |
Number of rows in the table. |
|
text |
Lowest value found in the column for all records. |
|
text |
Highest value found in the column for all records. |
|
text |
Average value found for the column for all records. |
|
text |
Most frequent value found for the column for all records. |
|
number |
Minimum value length found for the column for all records. |
|
number |
Maximum value length found for the column for all records. |
|
number |
Average value length found for the column for all records. |
|
number |
Number of null value occurrences found in the column for all records. |
|
number |
Number of distinct values (different values) found in the column for all records. |
|
number |
Number of unique values (values that appear once) found in the column for all records. |
|
number |
Number of records with values unique for the column. |
|
number |
Number of non-unique values (values that appear two or more times) found in the column for all records. |
|
number |
Number of records with values non-unique for the column. |
|
text |
Not loaded yet. Reserved for future use. |
Note that not all metrics are available for all datatypes. The following table lists the metrics available depending on the column datatype.
Metric | Applicable datatype |
---|---|
|
All |
|
text, numeric, date & datetime |
|
numeric, date & datetime |
|
text, numeric, date & datetime, boolean |
|
text, longtext, binary |
|
text, numeric, date & datetime, boolean |
|
Not available yet. |
Value distribution |
text, numeric, date & datetime |
Pattern distribution |
text |
PRF_DIST_VALS
This table contains the list of distinct values in the profiled columns.
Only a subset of the column values are reported. The distinct values reported for each column are limited by the table’s Maximum distinct values stored profiling option. |
Column Name | Description | |
---|---|---|
|
text |
Name of the datasource. |
|
text |
Name of the table. |
|
text |
Name of the profiled column. |
|
text |
Column value. |
|
number |
Number of occurrences of the value found in the table. |
PRF_DIST_PATTERNS
This table contains the list of distinct patterns found in the values of the profiled columns.
Only a subset of the column value patterns are reported. The patterns reported for each column are limited by the table’s Maximum patterns stored profiling option. |
Column Name | Description | |
---|---|---|
|
text |
Name of the datasource. |
|
text |
Name of the table. |
|
text |
Name of the profiled column. |
|
text |
Pattern found in the column values. |
|
number |
Number of occurrences of the pattern in the column. |
|
text |
Highest value matching this pattern in the column. |
|
text |
Lowest value matching this pattern in the column. |