Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Data Tools Services Overview

Legacy System Notice: Data Tools is a legacy application targeted for migration into BERT (Base Environment for Re-tooled Technology) in 2026. This documentation captures the current state for reference during the migration planning process.

What is Data Tools?

Data Tools is a collection of automated services that handle data processing workflows at Path2Response. These services manage file ingestion, conversion, validation, and various operational tasks that support the core data cooperative business.

Architecture

Data Tools consists of three categories of services:

  1. Embedded Services - Run within the main app.js application
  2. Micro Services - Standalone services for specific processing tasks
  3. Deprecated Services - Legacy functionality no longer in active use

1. Embedded Services (app.js)

These services are embedded directly in the main Data Tools application:

ServiceDescription
autoConvertPicks up files where the map is enabled and the file tagging matches a map
autoPreprocessCreates documents for custom processes where there is a single file tagged as preprocess and the preprocess is auto-enabled
autoEnableRouteRe-enables routes that show up in SFTP Waiting when the route has been manually disabled
autoValidateAutomatically reviews documents where the map has AutoValidate enabled. Marks “Review Complete” for those that pass thresholds
oldAccountNon-functional. When enabled only provides reporting on SmartFile users that last logged in over a year ago. Configuration document indefinitely disabled
productMapImportFinds and processes productMap files provided by Production Operations
browseStatusChecks the titlekey profiles of second party converts and updates #sitetagging Slack channel with status
sftpWaitingScans SmartFile for client data that has not been pulled down. Mainly used to monitor locked SmartFile paths (brokers that post with the wrong credentials)
updateTitlesInZendeskUpdates the list of titlekeys in Zendesk for organizational purposes

2. Micro Services

Standalone services that handle specific processing workflows:

ServiceDescription
responseAnalysisCollects stats on fulfillment files, mail files, list of lists, and response files, as well as how they overlap, in order to provide feedback on order performance
fileOperationsMasterScans all data rules/incoming routes for data to be pulled. Creates documents picked up by fileOperationsChild
fileOperationsChildProcesses documents created by fileOperationsMaster. Initial extraction and cleaning of client provided data into plain text format able to be processed by convert
importRunner/convertProcess that picks up files designated to be processed by convert. Picks up “convertrun” documents and appends them with post convert stats
preprocessRunnerPicks up new custom process requests and submits them to their relative custom process
cassRunnerPicks up files that contain address information that have completed convert and have been marked “Review Complete”. Attaches CASS data objects
missingImportRunnerQueries mongo for expected data and posts to #orders-data with data that is not in the anticipated state
taskManagerCreates tasks relating to global block mismatch status, Salesforce title sync status, and recency
salesForceSyncPulls information from SalesForce (via Dashboards) to monitor titlekey matching and global block mismatches
dsrRunnerLooks for Data Subject Request tickets in Zendesk (usually originating from OneTrust) and performs the appropriate DSR action. Automatically closes Zendesk tickets and corresponding OneTrust requests when finished processing

3. Deprecated Services

Services no longer in active use:

ServiceDescription
sameDayOmitRunnerGrabs HHids / IndIds after CASSing data and matching to households. Used for suppression purposes.

Key Integrations

Data Tools integrates with several internal and external systems:

  • SmartFile - SFTP server for client file uploads
  • MongoDB - Document storage for processing state
  • Slack - Notifications (#sitetagging, #orders-data channels)
  • Zendesk - Ticket management and titlekey organization
  • Salesforce - Title synchronization via Dashboards
  • OneTrust - Data Subject Request origination

Data Flow

Client Data (SmartFile)
        ↓
fileOperationsMaster (scan routes)
        ↓
fileOperationsChild (extract/clean)
        ↓
importRunner/convert (process)
        ↓
autoValidate (review)
        ↓
cassRunner (CASS append)
        ↓
Ready for downstream processing

Migration Considerations

When migrating to BERT, consider:

  1. Service consolidation - Many embedded services may be candidates for consolidation
  2. Non-functional services - oldAccount is disabled and may not need migration
  3. Deprecated services - sameDayOmitRunner should be evaluated for retirement
  4. Integration points - SmartFile, Zendesk, Salesforce integrations need BERT equivalents
  5. Notification patterns - Slack channel notifications should map to BERT alerting

Source: Data Tools Services (Confluence) Last updated from source: 2019-07-25 Documentation created: 2026-01-24