Applied AI & Backend Engineer

Neerali Acharya

7 years shipping production backend systems. I build AI that works at scale — not just in demos.

Backend & AI
AI AgentsRAGLLM IntegrationMCP
PythonJavaTypeScriptAWSFastAPIElasticSearchVector DBVapiReactDocker
7+Years Production
5AI Systems
95%SQL Cost Reduction
71msWebhook ACK Median

AI systems

01 / 05

AI Receptionist
Chat and Voice Agent

Autonomous AI receptionist for a Pilates studio — handles inbound voice calls, bookings, rescheduling, and pricing. Built with a 4-model LLM fallback chain for uninterrupted service under API failures.

PythonFastAPIVapiGroqGemini 2.0 FlashGoogle Calendar APIGoogle Sheets APISSEPydantic-SettingsGradioasynciongrok
4LLM Fallback Chain
10Function Tools
≤1.5sFiller SLA
⚙ Architecture
📞
Caller
Inbound
🎙️
Vapi
STT / TTS
FastAPI
asyncio · SSE
🔀
LLM Fallback
Qwen→Gemini→Llama
🛠️
Function Calls
10 tools
📅
Calendar API
Atomic booking
📊
Sheets CRM
Call logging
Resilience: spoken fillers at each model switch, empty-response retry, rate-limit detection & human-agent escalation.
02 / 05

AI Doc Assist —
Knowledge Assistant

Production-grade RAG platform for internal documentation. Upload any PDF, query it conversationally — every answer is grounded with doc name + page-level source traceability. Built RAGAS evaluation harness measuring Faithfulness 0.88, Answer Relevancy 0.76, Context Precision 0.81.

PythonFastAPILangChainSentence TransformersCross-Encoder Re-rankingGroqQwen/LLaMAPineconeFAISSAWS S3/SQSDynamoDBReact 18OllamaRAGASpytest
500+Docs Indexed
<1svs 15–30 min
0.88RAGAS Faithfulness
⚙ Architecture
📄
PDF Upload
React 18
☁️
AWS S3
Object store
📬
SQS + DLQ
Async queue
⚙️
Worker
Chunk + Embed
🧮
FAISS / Pinecone
Vector index
🔎
Re-ranking
Cross-Encoder
🤖
Groq LLM
Grounded answer
Lifecycle: DynamoDB tracks pending → processing → ready → failed; full per-query observability on latency & re-rank scores.
03 / 05

AI Webhook
Ingestion Agent

Real-time supply chain normaliser. Ingests raw webhook events from logistics providers, classifies them with LLMs, and writes typed schemas to PostgreSQL — all under 100ms vendor ACK.

PythonFastAPISQLAlchemy asyncPostgreSQLPydantic v2asyncio.QueueGroqGeminiAnthropic ClaudeDocker
71msMedian ACK
21Tests Shipped
3sDashboard Refresh
⚙ Architecture
🪝
Vendor Webhook
Maersk · GFP
FastAPI 202
71ms ACK
🧵
asyncio.Queue
Decoupled
🧠
LLM Normalize
Groq/Gemini/Claude
Pydantic v2
Typed schema
🐘
PostgreSQL
SQLAlchemy
📊
Dashboard
Auto-refresh 3s
Exactly-once: dual-layer dedup (vendor idempotency key + SHA-256 payload hash) — no application-level locking.
04 / 05

IT Onboarding Automator —
MCP Provisioning Agent

Spec-driven MCP provisioning agent built in Kiro. Exposes an MCP server with three agent-callable tools — lets an AI agent inspect access and retry failed events without direct DB access. Dual-delivery mode: HTTP API and MCP server share a single provisioner module with exactly-once state machine guarantees.

TypeScriptNode.jsExpressMCP SDKbetter-sqlite3JestOllamaKiroSpec-Driven DevMCP Inspector
3Agent Tools
13Jest Tests
0LLM Dependency
⚙ Architecture
🤖
AI Agent
Kiro / Ollama
🔌
MCP Server
stdio/JSON-RPC
🛠️
3 Agent Tools
inspect · retry
⚙️
Provisioner
Shared module
🔄
State Machine
pending→done→failed
🗄️
SQLite WAL
Audit trail
🌐
HTTP API
Express · webhooks
Spec-driven: full SPEC.md authored first; implemented in AI-directed loops with tagged Git checkpoints v1-webhook → v2-mcp → v3-final.
05 / 05

LLM-Powered
Resume Parser

Structured data extraction from PDF and DOCX resumes using a multi-provider LLM backend. Pre-processes documents with regex URL detection before LLM ingestion, implements self-repair retry on malformed JSON, and exposes results via FastAPI REST and a Streamlit UI — no external infrastructure required.

PythonFastAPIStreamlitPydantic v2PyMuPDFpython-docxGroqGeminiAnthropic ClaudeSQLitepytest
37Tests (no API keys)
5LLM Providers
PDF+DOCXFile Support
⚙ Architecture
📤
Upload
PDF / DOCX
📝
Text Extract
PyMuPDF · docx
🔗
URL Pre-scan
Regex detect
🧠
LLM Parse
Groq · Gemini · Claude
🔁
Self-repair
JSON retry
Pydantic v2
Typed schema
🗄️
SQLite WAL
FastAPI · Streamlit
Swappable LLMs: provider set via env var — Groq (Llama 3.3 70B default), Gemini, OpenAI, Anthropic, Ollama. 37 tests run fully mocked.

What I work with

Languages
PythonJavaJavaScriptTypeScriptPL/SQL
AI / ML
RAGLangChainVector SearchSemantic SearchPrompt EngineeringAgentic WorkflowsTool CallingLLM IntegrationMCP Tool DesignSpec-Driven DevelopmentRAGAS
LLMs & Frameworks
Groq APIAnthropic ClaudeGemini 2.0 FlashQwen / LLaMAVapiSentence TransformersCross-Encoder Re-rankingOllama
Backend & APIs
FastAPISpring BootSpring JPANode.jsExpressSSEWebhooksSQLAlchemy
Databases & Vector DB
PineconeFAISSElasticsearchPostgreSQLMySQLRedisMongoDBDynamoDBSQLite
Cloud & DevOps
AWS S3SQSDynamoDBDLQIAMLambdaEC2KinesisStep FunctionsCloudWatchDockerJenkins
Tools & Testing
GradioPydantic v2KiroJestpytestReact 18MCP InspectorUvicornViteGit

Awards

🏆
Rising Star Award
QMetry India Pvt. Ltd.
Awarded within 3 months of joining for outstanding performance, fast adaptation, and measurable 30–40% API performance improvement across core endpoints.

Let's connect

Open to senior AI and backend engineering opportunities.