← All work
Engineering · 2023

News Topic-Modelling Pipeline, Evaluation

Overview

A clone of the open-source Newspaper-Topic-Modelling project, a Python NLP pipeline that ingests news articles, applies topic modelling, and surfaces results through a React frontend. Retained for evaluation; the repository carries upstream history (CZboop) with no studio-authored changes.

Why It Exists

To evaluate an end-to-end topic-modelling workflow over news text, data processing, model training, and a web visualization layer, including its testing and CI setup.

What We Built

No custom development; an upstream snapshot studied as reference. Observed: a Python data-processing pipeline with OS-agnostic path handling and unit tests, a React frontend, and a GitHub Actions workflow deploying both the Python and React jobs.

Technologies & Approach

Python for NLP/topic modelling with a React visualization frontend, wired into CI via GitHub Actions. Reviewed for its pipeline structure and test discipline.

Outcome / Impact

Provided the studio with a working reference for NLP topic-modelling pipelines and their visualization/CI patterns. Documented as evaluation/R&D.

Capabilities Demonstrated

  • Evaluating NLP topic-modelling pipelines
  • Studying data-processing + visualization + CI workflows
More work See all →