✦ Applied AI & Backend Engineer

Neerali Acharya

7 years shipping production backend systems. I build AI that works at scale — not just in demos.

Backend & AI

AI AgentsRAGLLM IntegrationMCP

PythonJavaTypeScriptAWSFastAPIElasticSearchVector DBVapiReactDocker

7+Years Production

5AI Systems

95%SQL Cost Reduction

71msWebhook ACK Median

View Projects ↓ GitHub ↗ LinkedIn ↗ Medium ↗

Writing

From the trenches

✦ Medium 6 min read

I Built a RAG Pipeline. Then Reality Hit. Here's Every Problem I Solved

The three production bugs no RAG tutorial warns you about — non-deterministic outputs, hallucination, and context loss in multi-turn chats — and exactly how I fixed each one.

RAGLLMHallucinationVector SearchPrompt Engineering

Read on Medium →

✍️

AI systems

01 / 05

AI Receptionist
Chat and Voice Agent

Autonomous AI receptionist for a Pilates studio — handles inbound voice calls, bookings, rescheduling, and pricing. Built with a 4-model LLM fallback chain for uninterrupted service under API failures.

PythonFastAPIVapiGroqGemini 2.0 FlashGoogle Calendar APIGoogle Sheets APISSEPydantic-SettingsGradioasynciongrok

4LLM Fallback Chain

10Function Tools

≤1.5sFiller SLA

GitHub ↗

⚙ Architecture

📞

Caller

Inbound

🎙️

Vapi

STT / TTS

⚡

FastAPI

asyncio · SSE

🔀

LLM Fallback

Qwen→Gemini→Llama

🛠️

Function Calls

10 tools

📅

Calendar API

Atomic booking

📊

Sheets CRM

Call logging

↳ Resilience: spoken fillers at each model switch, empty-response retry, rate-limit detection & human-agent escalation.

02 / 05

AI Doc Assist —
Knowledge Assistant

Production-grade RAG platform for internal documentation. Upload any PDF, query it conversationally — every answer is grounded with doc name + page-level source traceability. Built RAGAS evaluation harness measuring Faithfulness 0.88, Answer Relevancy 0.76, Context Precision 0.81.

PythonFastAPILangChainSentence TransformersCross-Encoder Re-rankingGroqQwen/LLaMAPineconeFAISSAWS S3/SQSDynamoDBReact 18OllamaRAGASpytest

500+Docs Indexed

<1svs 15–30 min

0.88RAGAS Faithfulness

⚙ Architecture

📄

PDF Upload

React 18

☁️

AWS S3

Object store

📬

SQS + DLQ

Async queue

⚙️

Worker

Chunk + Embed

🧮

FAISS / Pinecone

Vector index

🔎

Re-ranking

Cross-Encoder

🤖

Groq LLM

Grounded answer

↳ Lifecycle: DynamoDB tracks pending → processing → ready → failed; full per-query observability on latency & re-rank scores.

03 / 05

AI Webhook
Ingestion Agent

Real-time supply chain normaliser. Ingests raw webhook events from logistics providers, classifies them with LLMs, and writes typed schemas to PostgreSQL — all under 100ms vendor ACK.

PythonFastAPISQLAlchemy asyncPostgreSQLPydantic v2asyncio.QueueGroqGeminiAnthropic ClaudeDocker

71msMedian ACK

21Tests Shipped

3sDashboard Refresh

GitHub ↗

⚙ Architecture

🪝

Vendor Webhook

Maersk · GFP

⚡

FastAPI 202

71ms ACK

🧵

asyncio.Queue

Decoupled

🧠

LLM Normalize

Groq/Gemini/Claude

✅

Pydantic v2

Typed schema

🐘

PostgreSQL

SQLAlchemy

📊

Dashboard

Auto-refresh 3s

↳ Exactly-once: dual-layer dedup (vendor idempotency key + SHA-256 payload hash) — no application-level locking.

04 / 05

IT Onboarding Automator —
MCP Provisioning Agent

Spec-driven MCP provisioning agent built in Kiro. Exposes an MCP server with three agent-callable tools — lets an AI agent inspect access and retry failed events without direct DB access. Dual-delivery mode: HTTP API and MCP server share a single provisioner module with exactly-once state machine guarantees.

TypeScriptNode.jsExpressMCP SDKbetter-sqlite3JestOllamaKiroSpec-Driven DevMCP Inspector

3Agent Tools

13Jest Tests

0LLM Dependency

GitHub ↗

IT Onboarding Automator - hire new employee

⚙ Architecture

🤖

AI Agent

Kiro / Ollama

🔌

MCP Server

stdio/JSON-RPC

🛠️

3 Agent Tools

inspect · retry

⚙️

Provisioner

Shared module

🔄

State Machine

pending→done→failed

🗄️

SQLite WAL

Audit trail

🌐

HTTP API

Express · webhooks

↳ Spec-driven: full SPEC.md authored first; implemented in AI-directed loops with tagged Git checkpoints v1-webhook → v2-mcp → v3-final.

05 / 05

LLM-Powered
Resume Parser

Structured data extraction from PDF and DOCX resumes using a multi-provider LLM backend. Pre-processes documents with regex URL detection before LLM ingestion, implements self-repair retry on malformed JSON, and exposes results via FastAPI REST and a Streamlit UI — no external infrastructure required.

PythonFastAPIStreamlitPydantic v2PyMuPDFpython-docxGroqGeminiAnthropic ClaudeSQLitepytest

37Tests (no API keys)

5LLM Providers

PDF+DOCXFile Support

GitHub ↗

⚙ Architecture

📤

Upload

PDF / DOCX

📝

Text Extract

PyMuPDF · docx

🔗

URL Pre-scan

Regex detect

🧠

LLM Parse

Groq · Gemini · Claude

🔁

Self-repair

JSON retry

✅

Pydantic v2

Typed schema

🗄️

SQLite WAL

FastAPI · Streamlit

↳ Swappable LLMs: provider set via env var — Groq (Llama 3.3 70B default), Gemini, OpenAI, Anthropic, Ollama. 37 tests run fully mocked.

Tools

What I work with

Languages

PythonJavaJavaScriptTypeScriptPL/SQL

AI / ML

RAGLangChainVector SearchSemantic SearchPrompt EngineeringAgentic WorkflowsTool CallingLLM IntegrationMCP Tool DesignSpec-Driven DevelopmentRAGAS

LLMs & Frameworks

Groq APIAnthropic ClaudeGemini 2.0 FlashQwen / LLaMAVapiSentence TransformersCross-Encoder Re-rankingOllama

Backend & APIs

FastAPISpring BootSpring JPANode.jsExpressSSEWebhooksSQLAlchemy

Databases & Vector DB

PineconeFAISSElasticsearchPostgreSQLMySQLRedisMongoDBDynamoDBSQLite

Cloud & DevOps

AWS S3SQSDynamoDBDLQIAMLambdaEC2KinesisStep FunctionsCloudWatchDockerJenkins

Tools & Testing

GradioPydantic v2KiroJestpytestReact 18MCP InspectorUvicornViteGit

Recognition

Awards

🏆

Rising Star Award

QMetry India Pvt. Ltd.

Awarded within 3 months of joining for outstanding performance, fast adaptation, and measurable 30–40% API performance improvement across core endpoints.

Contact

Let's connect

Open to senior AI and backend engineering opportunities.

💼

LinkedInlinkedin.com/in/neeraliacharya

🐙

GitHubgithub.com/neeraliacharya

✍️

Mediummedium.com/@neeraliacharya

Neerali Acharya

From the trenches

I Built a RAG Pipeline. Then Reality Hit. Here's Every Problem I Solved

AI systems

AI ReceptionistChat and Voice Agent

AI Doc Assist —Knowledge Assistant

AI WebhookIngestion Agent

IT Onboarding Automator —MCP Provisioning Agent

LLM-PoweredResume Parser

What I work with

Awards

Let's connect

AI Receptionist
Chat and Voice Agent

AI Doc Assist —
Knowledge Assistant

AI Webhook
Ingestion Agent

IT Onboarding Automator —
MCP Provisioning Agent

LLM-Powered
Resume Parser