Job Description: AI/ML Developer
Position Overview
We are seeking an experienced AI/ML Developer with strong data extraction and analysis skills to build a system that processes structured and unstructured financial documents, extracts key transactional data, and performs intelligent pricing analysis. The ideal candidate will have expertise in document AI, natural language processing, and data pipeline development.
Core Technical Requirements:
1. AI/ML & Data Processing Skills
- Strong experience with document processing pipelines (OCR, text extraction, data normalization)
- Proficiency in natural language processing (NLP) for entity recognition and relationship extraction
- Experience with computer vision techniques for document layout analysis
- Knowledge of machine learning for tabular data extraction (invoice/bill parsing)
- Familiarity with large language models (LLMs) for document understanding
- Ability to implement custom rule engines for domain-specific logic
2. Programming & Backend Development
- Expert in Python with experience in:
- Data science stack (Pandas, NumPy, Scikit-learn)
- NLP libraries (SpaCy, NLTK, Hugging Face Transformers)
- Deep learning frameworks (PyTorch, TensorFlow)
- Experience with API development (FastAPI, Flask, Django)
- Knowledge of distributed processing for large document volumes
- Understanding of data security and compliance in financial data processing
3. Data Infrastructure & Cloud
- Experience with document storage and retrieval systems
- Knowledge of vector databases for semantic search
- Familiarity with cloud AI services (AWS Textract, Azure Form Recognizer, GCP Document AI)
- Experience with data pipeline orchestration (Airflow, Luigi)
- Understanding of scalable system architecture
4. Additional Valuable Skills
- Experience with financial document schemas (invoices, receipts, statements)
- Knowledge of pricing analysis and discount calculation methodologies
- Familiarity with multi-language document processing
- Experience with active learning systems for continuous model improvement
- Understanding of human-in-the-loop validation systems
Qualifications
- 3+ years of professional experience in AI/ML development
- Strong background in applied machine learning (not just theoretical)
- Proven track record of production-grade document processing systems
- Degree in Computer Science, Data Science, or related technical field (or equivalent experience)
Project Expectations
- Design and implement end-to-end document processing systems
- Develop custom models for domain-specific extraction tasks
- Build scalable data pipelines for batch and real-time processing
- Implement quality control mechanisms for data extraction
- Optimize performance and accuracy of AI components
Application Requirements
Please provide:
1. Current resume/CV highlighting relevant experience
2. Code samples or GitHub profile demonstrating:
- Document processing implementations
- Data extraction pipelines
- Relevant ML/NLP projects
3. Brief description of your approach to:
- Handling varied document formats
- Ensuring extraction accuracy
- Scaling document processing systems
This role offers the opportunity to work on challenging document AI problems with real-world financial applications. We're looking for developers who enjoy solving complex extraction and analysis problems at scale.