← All work
Product · 2016

Domain & Host Data-Enrichment Script

Overview

An early PHP utility for grabbing and processing domain and host data, enriching it via third-party APIs and persisting results into an OrientDB graph database.

Why It Exists

Collecting and structuring data about domains and hosts at scale requires batch processing and a store that models relationships well. This script was an early-career build in building such a pipeline.

What We Built

Two batch processors, ProcessDomains.php and ProcessHosts.php, that iterate over input data and enrich it. The stack (via Composer) includes piwik/device-detector for parsing, mashape/unirest-php for outbound HTTP/API calls, and ostico/phporient to write into OrientDB, a graph/document database well suited to domain–host relationships.

Technologies & Approach

PHP with Composer-managed dependencies kept the tooling simple. OrientDB was chosen to model the graph-like relationships between domains and hosts, while Unirest handled enrichment calls to external services.

Outcome / Impact

A working proof of a domain/host enrichment pipeline, early hands-on experience with batch data processing, external API enrichment, and graph-database storage.

Capabilities Demonstrated

  • Batch data-grabbing and enrichment pipelines
  • Graph/document database modeling (OrientDB)
  • Integrating third-party APIs for data enrichment
More work See all →