Skip to main content
Version: Angophra

Data Source Reports

Menu path: Observe → Deep Dive → Data Sources
URL: /en/insights/dashboard/identity/datasources

Purpose

The Data Source Reports page provides visibility into the directories and HR feeds connected to Apporetum - the systems from which account and person data is synchronised. It shows what types of directories are connected, how many accounts each contributes, their sync health, and how they are configured for provisioning.

This page is operational and governance-focused. It answers the question: "Are my connected data sources working, and are they configured correctly?" A data source that is not syncing correctly means the downstream reports in Observe are based on stale or incomplete data, which undermines the accuracy of every other insight in the module.


Charts and Reports

What is the source of the accounts?

Chart type: add example --> Grouped bar chart
What it shows: The count of accounts in each data source, broken down by account type within each source. The Y-axis shows account counts, and each cluster on the X-axis represents a data source (Modern IAM, Hospital, Modern 443, Government Users, etc.).

Why it matters: This chart gives you an immediate visual sense of the relative size of each connected directory and its composition. A very large data source with many account types (like Modern IAM with over 9,000 accounts) versus a small CSV-uploaded data source (like Hospital with 25 accounts) shows the distribution of your account population. Unexpected growth or shrinkage in a data source's account count between sync cycles can indicate data issues.

What to look for: Compare the current values against your expected population for each data source. Large unexplained changes in account counts should be investigated - an account source that suddenly shows significantly fewer accounts may have had accounts deleted or a sync failure that prevented new accounts from importing.

Account Data Source Type

Chart type: add example --> Donut chart
What it shows: The distribution of all accounts across data source types - for example, how many accounts are in Entra ID tenants vs Active Directory vs CSV uploads.

Why it matters: Understanding the technology mix of your account landscape helps with governance planning. CSV uploads are typically manual processes with no real-time sync, which means they require more active maintenance. Entra ID tenants can be synced continuously. Active Directory domains sync on a schedule. Knowing this breakdown helps you understand where your governance automation is strongest and where manual oversight is still required.


The Data Sources Table

The table lists all connected data sources with the following columns:

  • Data Source Name - The name configured in Apporetum
  • Type - The directory type (Entra ID / Azure AD, Active Directory, CSV File Upload)
  • Last Sync - When the last synchronisation was performed and how many accounts were synced
  • Sync Schedule - How often syncs are configured (or N/A if manual)
  • Status - Active or inactive
  • Has App Implementation - Whether an application-based provisioning integration is configured
  • Provisioning Config columns - Details of the automated provisioning configuration, including service principal IDs, job IDs, and authentication settings

Interpreting the table

ColumnWhat to look for
Last SyncAny data source where the last sync is more than 24-48 hours ago (for production systems) should be investigated. Stale data means inaccurate reports.
Sync Schedule"N/A" indicates a manual sync process - ensure there is a documented process and schedule for these.
StatusAny data source showing "Inactive" when it should be active needs investigation.
Has App ImplementationA red indicator here means no app-based provisioning is configured, so any provisioning is manual or via a different mechanism.
Provisioning ConfigThe presence of a Provisioning Config means automated provisioning (writing changes back to the directory) is set up. This should be reviewed to ensure the service principal and authentication settings are correct and the job is running.

Data source health and its impact on reporting

When a data source shows issues (red status, failed last sync), the data from that directory is not current in Apporetum. This means:

  • Orphaned account counts may be inflated (accounts that were deleted from the directory still appear)
  • Active accounts that were recently created may not appear
  • Disabled accounts may still appear as enabled
  • The Daily Insights metrics on the Overview page may be inaccurate

Always check data source health first when investigating any anomalous metrics in the other Observe reports.