aiShare Your Requirements
Home
Home
Our Industries
Project Details

Project Description

A Switzerland-based digital solutions provider aimed to automate PDF-based data extraction for faster inventory mapping. With a growing need for structured article pricing data, the client sought a robust backend module to streamline extraction, classification, and integration. The solution needed to support real-time processing through a web-based interface for GraphQL-connected systems.

Scope Of Work

The client sought a solution with Oodles for a system to extract article numbers and prices from PDF files and fetch related results via GraphQL API. The goal was to replace manual data collection with a scalable module. Key areas of work included PDF parsing, OCR integration, NER model execution, GraphQL connectivity, and structured export generation.

Our Solution

To address the client’s need for automation, a Django-based REST API system was built to handle PDF-to-text conversion, entity recognition, and result generation. Here's how it worked:

  • Image Conversion Module: PDF files were converted into JPGs using pdf2img for improved OCR accuracy.
  • Text Extraction: Google Tesseract OCR processed the images to extract raw textual content, saved in intermediate .txt files.
  • Entity Recognition: A spaCy-based Named Entity Recognition (NER) model was trained to extract specific fields from the text.
  • Data Mapping & Export: Extracted data was sent through a GraphQL API to fetch the updated article details. 

Related Projects

Wellsite AI

Wellsite™ serves oil and gas operators and service companies, streamlining oilfield operations through automation, application integration and collaboration ac

Technologies Involved:

Angular

+5 more

Area of Expertise:

Machine Learning

+2 more

Extricator

Extractor is a smart data extraction tool that enables us to process and extract data easily. Not just out of invoices but receipts and forms too. And we can downloa

Technologies Involved:

DevOps

+3 more

Area of Expertise:

Computer Vision

OCR for Liberate Health

Liberate Health, a health-tech platform dedicated to improving clinical communication, aimed to digitize handwritten prescriptions for quicker data access and improv

Technologies Involved:

DevOps

+2 more

Area of Expertise:

Computer Vision

+1 more

Viral Nation

Viral Nation, an influencer marketing agency, sought an advanced platform to streamline influencer campaigns and talent management. They needed a system to analyze d

Technologies Involved:

DevOps

+7 more

Area of Expertise:

Machine Learning

+8 more

Neuralwave

NeuralWave is an AI platform that analyzes documents, images, and videos in any format or language, understanding context, sentiments, and nuances while continuously

Technologies Involved:

Django

+3 more

Area of Expertise:

Computer Vision

+6 more

ESP Safety OCR

ESP designs and manufactures advanced detection systems, specializing in fixed toxic and combustible gas detectors, flame detectors, sand detectors, and pig detector

Technologies Involved:

My SQL

+2 more

Area of Expertise:

Computer Vision

DemoGPT - Text to Site Maker

DemoGPT transforms website creation by allowing users to build Streamlit sites through text prompts, making it accessible to all skill levels. The client sought Oodl

Technologies Involved:

DevOps

+3 more

Area of Expertise:

Computer Vision

+4 more

Snapworks

SnapWorks revolutionizes education by enhancing communication between classrooms, teachers, students, and parents, creating seamless collaboration. They sought to en

Technologies Involved:

Django

+2 more

Area of Expertise:

Machine Learning

+4 more

E-commerce OCR

The client specializes in capturing and analyzing user activity across diverse websites, including e-commerce, informational, and entertainment platforms. Their goal

Technologies Involved:

Chatgpt

+4 more

Area of Expertise:

E Commerce Development

+2 more

GCG Excel Tool

The client specializes in innovative solutions for data extraction and processing. They sought to enhance their capabilities by developing a system to upload PDFs an

Technologies Involved:

Python

Area of Expertise:

Computer Vision

+1 more

ErieTCG OCR

ErieTCG OCR is an AI-powered engine that extracts structured data from trading card images, specifically targeting Basic-EX and Mega-EX cards. The application uses a

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Doctor AI Assistant

A healthcare-focused tech startup, Doctor AI Assistant committed to improving doctor-patient interactions sought an AI-powered assistant to enhance consultation effi

Technologies Involved:

Python

Area of Expertise:

Computer Vision

+1 more

Diabetic Prediction System

A health-focused tech startup committed to early disease detection aimed to make diabetes screening faster and more accessible through AI. The client sought an inter

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Ceiling Measurement Tool

A leading industrial automation company specializing in warehouse operations approached Oodles for a smart alternative to manual ceiling height measurements. Their n

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Agentic Doc Extractor

An emerging GenAI-focused enterprise set out to build visual AI applications without traditional model training. Their goal was to solve real-world problems using pr

Technologies Involved:

Python

Area of Expertise:

Computer Vision

SmartDocDiff

A research-focused client in the AI space, dedicated to improving factual integrity in NLP outputs, approached Oodles to build a lightweight tool for discrepancy det

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Ai Rento Soft

An auto-rental company, Ai Rento, aiming to streamline vehicle check-in and checkout, approached Oodles to automate damage detection using AI. With a growing fleet a

Technologies Involved:

Python

Area of Expertise:

Computer Vision

HomeWise AI

A forward-looking real estate solutions provider, focused on improving property search and client engagement, approached Oodles to strengthen their AI-driven capabil

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Hatch

Hatch is an AI-powered data extraction platform built to process multiple documents in parallel. Oodles developed the solution to allow users to upload files in vari

Technologies Involved:

Python

Area of Expertise:

Computer Vision

Opsian Technologies

Opsian, a hospitality technology provider focused on seamless guest experiences, partnered with Oodles to build a multi-property digital check-in system. The require

Technologies Involved:

AWS

+3 more

Area of Expertise:

Computer Vision

+3 more

Ready to Build With an AI-Powered Engineering Partner?

We get started in minutes. No commitment required.

500+

Projects Delivered

300+

Technologies

17+

Years of Trust

© Copyright 2009-2026 Oodles Technologies. All Rights Reserved.