A Switzerland-based digital solutions provider aimed to automate PDF-based data extraction for faster inventory mapping. With a growing need for structured article pricing data, the client sought a robust backend module to streamline extraction, classification, and integration. The solution needed to support real-time processing through a web-based interface for GraphQL-connected systems.
The client sought a solution with Oodles for a system to extract article numbers and prices from PDF files and fetch related results via GraphQL API. The goal was to replace manual data collection with a scalable module. Key areas of work included PDF parsing, OCR integration, NER model execution, GraphQL connectivity, and structured export generation.
To address the client’s need for automation, a Django-based REST API system was built to handle PDF-to-text conversion, entity recognition, and result generation. Here's how it worked: