Domain & Host Data-Enrichment Script
Overview
An early PHP utility for grabbing and processing domain and host data, enriching it via third-party APIs and persisting results into an OrientDB graph database.
Why It Exists
Collecting and structuring data about domains and hosts at scale requires batch processing and a store that models relationships well. This script was an early-career build in building such a pipeline.
What We Built
Two batch processors, ProcessDomains.php and ProcessHosts.php, that iterate
over input data and enrich it. The stack (via Composer) includes piwik/device-detector
for parsing, mashape/unirest-php for outbound HTTP/API calls, and ostico/phporient
to write into OrientDB, a graph/document database well suited to domain–host
relationships.
Technologies & Approach
PHP with Composer-managed dependencies kept the tooling simple. OrientDB was chosen to model the graph-like relationships between domains and hosts, while Unirest handled enrichment calls to external services.
Outcome / Impact
A working proof of a domain/host enrichment pipeline, early hands-on experience with batch data processing, external API enrichment, and graph-database storage.
Capabilities Demonstrated
- Batch data-grabbing and enrichment pipelines
- Graph/document database modeling (OrientDB)
- Integrating third-party APIs for data enrichment