Field Dictionary

Overview:


The Field Dictionary in DvSum provides users with a comprehensive view of all data columns across different sources and datasets. It consolidates metadata, profiling details, and data quality insights into a structured interface, enabling users to explore and manage their data efficiently.

 

Accessing the Field Dictionary

On the left-side menu of the DvSum application, click on Field Dictionary to navigate to its detailed page.

This page displays all column-specific details, provided that the sources are connected and cataloged.

 

Field Dictionary Details

The Field Dictionary provides an organized view of column-related metadata, including:

  • Source
  • Table
  • Column Name 
  • Business Name
  • Column Description
  • Data Type
  • Profiling Info
  • Range Value
  • DQ Score
  • Column Sub-type

Column Detail Page

Clicking on a Column Name opens the Column Detail Page, which contains three tabs:

  1. Overview – Displays general metadata and descriptions.
  2. Profiling – Shows profiling statistics and insights.
  3. Data Quality – Highlights rule violations and data quality metrics.

Overview Tab:

The Overview tab in the Field Dictionary comprises several key sections that provide a structured summary of column attributes and configurations. These sections typically include:

  • Column Name & Description: Displays the name of the column along with a brief description of its purpose.
  • Data Type: Specifies the type of data the column holds (e.g., String, Integer, Boolean).
  • Source & Lineage: Identifies the origin of the column and traces its transformation across datasets.
  • Usage & Dependencies: Indicates where and how the column is utilized within the system.
  • Constraints & Validation: Lists any applied rules, such as mandatory columns, unique constraints, or default values.
  • Tags & Classification: Allows categorization of columns for better organization and governance.
  • Additional Info: Displays any extra details relevant to the column.
  • Similar Columns: Suggests columns with similar characteristics to aid in analysis and comparison.

Editing and Deleting a Column
On the Overview page, users have the option to either Edit or Delete a column.

  • Edit: Clicking the Edit button (represented by a blue button with a pencil icon) allows users to modify the column’s properties, such as its name, tags, or other metadata.

  • Delete: The Delete button (represented by a red trash bin icon) enables users to remove the column permanently. Users should exercise caution while deleting, as this action may not be reversible.

 

Profiling Tab:

The Profiling tab in the Field Dictionary contains the following sections:

  • Statistics: Displays key metrics such as record count, number of empty/null values, unique values, and completeness percentage.
  • Information: Provides details including the last profiled date, min-max values, whether the column is a primary key or nullable, technical data type, column position, size, and the number of fractional digits.
    Note: The Min-Max values depend on the column's data type. For string data types, the minimum and maximum values are determined based on lexicographical order.
  • Visualization: Includes graphical representations of data distributions, with options to view distribution and pattern insights.
  • Run Profiling: An option is available to initiate online profiling for updating column statistics and insights.

 

 

Data Quality Tab:

The Data Quality tab in the Field Dictionary contains the following sections:

  • Statistics: Displays key metrics such as DQ Score, total rules, rules with alerts, total exceptions, total records scanned, and last rule execution status.
  • Rules: Lists all data quality rules applied to the column, including details such as rule ID, description, run status, alert status, and exception count.
  • Add Rule: Provides an option to add new data quality rules for monitoring and validation. Only the following three rules can be applied:
    • Blanks: Checks for missing or empty values in the column.
    • Value Range: Ensures that data falls within a specified range.
    • Data Format: Validates the format of the data based on predefined patterns.

 

DQ Score History:

Clicking on the arrow icon history is displayed.


Learn more about the Rules section in Data Quality tab here

Field Reference Page:
The Field Reference page appears when a user clicks on a column name from the homepage.

It slides out from the right side of the screen, displaying relevant information about the selected column.

Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.
Powered by Zendesk