Available for select product and platform conversations

Rahul Dhiman

Software Engineer | Backend, Search & AI Systems

I design high-throughput backend systems and AI-powered support platforms that make complex products feel fast, reliable, and commercially useful.

Software engineer with ~4 years of experience building enterprise search, low-latency APIs, and production-grade LLM workflows. My work sits at the intersection of distributed systems, information retrieval, and product execution.

GitHub LinkedIn hi@rahuldhiman.comFounder-friendly, recruiter-friendly, execution-first

Current focus

Search, retrieval, and AI systems that hold up in production.

50K+

Production users

10M+

Documents indexed

1K+ QPS

Request throughput

65%

Latency reduction

Quick actions

Open the action palette to print, share, copy contact details, or send the resume without hunting through the page.

What I do

The resume story in one read.

A concise overview for hiring managers, founders, and teams that need both systems depth and product judgment.

I build the backend systems that power search, support, and AI-driven customer experiences at scale. My strength is translating deep technical constraints into products that feel immediate and reliable for end users.

I am at my best on platform-heavy problems: search relevance, ingestion pipelines, API performance, AI retrieval systems, fault tolerance, and workflow automation that touches real business outcomes.

I am targeting senior software engineering roles where systems design, search, AI integration, and execution quality all matter. The differentiator I bring is practical depth with a product mindset.

Scalable backend architecture

Search relevance and indexing

Low-latency API design

LLM systems with grounding and fallback logic

Positioning

Technical depth with commercial intent.

The through-line across the work: systems that scale, interfaces that serve real teams, and AI that is measured by outcomes rather than novelty.

Enterprise search at scale

Shipped search and retrieval systems where performance, relevance, and uptime materially affect support efficiency.

AI that survives production

Focused on grounded LLM workflows with fallbacks, measurable outcomes, and operational confidence.

Operator mindset

I care about latency, observability, failure modes, and maintainability as much as shipping speed.

Voice and multimodal systems

Built SIP-based IVR voice agents, multimodal RAG, and image-answering pipelines with tight latency and reliability constraints.

Experience

Built for scale, measured by impact.

A structured timeline of the roles, systems, and outcomes that define the current body of work.

Software Engineer

Grazitti Interactive (SearchUnify)

2023 - Present

Hybrid / Remote

Own backend and AI-heavy initiatives for an enterprise search platform used across customer support and knowledge workflows.

Designed and scaled enterprise search systems serving 50K+ users across large support environments.
Built indexing pipelines for 10M+ documents across CRM, knowledge base, and internal content systems.
Reduced p95 latency from 400ms to 140ms through query optimization, caching strategy, and API tuning.
Developed backend services in Node.js and Python handling 1K+ QPS under production workloads.
Led Salesforce Service Cloud integration that improved knowledge recommendations and cut ticket resolution time by ~25%.
Built LLM-powered support automation with grounded retrieval and agentic workflows, reducing repetitive support queries by ~30%.
Delivered multimodal RAG workflows that combine text, document, and image context for grounded answer generation.
Built image question-answering flows that extract visual context from uploads and return natural-language answers.
Improved reliability with fallback logic, production monitoring, and alerting around search failure states.

Node.jsPythonSearchRAGMultimodal AISalesforceAWS

Software Engineer

Grazitti Interactive

2022 - 2023

On-site / Hybrid

Worked on search infrastructure foundations, ingestion pipelines, and production debugging across distributed services.

Built REST APIs and ingestion pipelines for search-oriented backend systems.
Designed batch and streaming workflows for document ingestion across multiple data sources.
Reduced ingestion failures by ~40% with retry orchestration, validation, and stronger failure recovery.
Debugged production issues in distributed services and improved service resilience.

Outcome

Designed ranking, deduplication, and multi-source indexing for 10M+ documents across enterprise environments.

Enterprise SearchSearch RelevanceScale

See capabilities

Expertise

Organized by capability, not buzzwords.

A signal-dense breakdown of the domains I ship in most often.

Systems and backend design

High-signal execution on APIs, throughput, reliability, and operational safeguards.

Search quality and data pipelines

Deep comfort with indexing, retrieval, relevance, and scale-sensitive search experiences.

Applied AI for production products

Multimodal RAG systems, image answering, voice agents, grounding, and pragmatic integration into real user workflows.

Backend & Distributed Systems

Node.jsPythonAPI DesignMicroservicesAsync ProcessingLow-latency Services

Search & Retrieval

ElasticsearchSolrIndexing PipelinesRankingQuery OptimizationDeduplication

AI & LLM Systems

Multimodal RAGImage QARAG PipelinesAgentic WorkflowsPrompt EngineeringLangChainEvaluation

Data & Persistence

SQLData ModelingChunking StrategiesKnowledge SystemsContext Retrieval

Cloud & Delivery

AWS EC2S3RDSCI/CDDockerProduction Monitoring

Product & Operations

Service CloudSupport AutomationCross-functional DeliveryReliability Engineering

Metrics

Numbers that make the story easy to scan.

The signal most recruiters and engineering leaders want to find quickly: scope, load, performance, and measurable change.

Years building production systems

0K+

Enterprise users served

0M+

Documents indexed across systems

p95 latency reduction on key flows

Support handling time improvement

Repetitive support queries reduced

Education

Compact, clear, and in service of the work.

Formal education is presented with restraint so the professional signal stays in focus.

Completed

Masters of Computer Application

Kurukshetra University

Contact

Built to convert curiosity into conversation.

If you are hiring for backend platforms, search, AI systems, or product infrastructure, this site should make the next step obvious.

hi@rahuldhiman.com

Direct response path for opportunities, portfolio requests, and architecture conversations.

India · IST (UTC+5:30)

Open to full-time, consulting, and high-impact platform work

GitHub LinkedIn Share profile Send resume