Available for select product and platform conversations

Rahul Dhiman

Software Engineer | Backend, Search & AI Systems

I design high-throughput backend systems and AI-powered support platforms that make complex products feel fast, reliable, and commercially useful.

Software engineer with ~4 years of experience building enterprise search, low-latency APIs, and production-grade LLM workflows. My work sits at the intersection of distributed systems, information retrieval, and product execution.

GitHubLinkedInhi@rahuldhiman.comFounder-friendly, recruiter-friendly, execution-first

Current focus

Search, retrieval, and AI systems that hold up in production.

Rahul Dhiman

50K+

Production users

10M+

Documents indexed

1K+ QPS

Request throughput

65%

Latency reduction

Quick actions

Open the action palette to print, share, copy contact details, or send the resume without hunting through the page.

What I do

The resume story in one read.

A concise overview for hiring managers, founders, and teams that need both systems depth and product judgment.

I build the backend systems that power search, support, and AI-driven customer experiences at scale. My strength is translating deep technical constraints into products that feel immediate and reliable for end users.

I am at my best on platform-heavy problems: search relevance, ingestion pipelines, API performance, AI retrieval systems, fault tolerance, and workflow automation that touches real business outcomes.

I am targeting senior software engineering roles where systems design, search, AI integration, and execution quality all matter. The differentiator I bring is practical depth with a product mindset.

Scalable backend architecture
Search relevance and indexing
Low-latency API design
LLM systems with grounding and fallback logic

Positioning

Technical depth with commercial intent.

The through-line across the work: systems that scale, interfaces that serve real teams, and AI that is measured by outcomes rather than novelty.

Enterprise search at scale

Shipped search and retrieval systems where performance, relevance, and uptime materially affect support efficiency.

AI that survives production

Focused on grounded LLM workflows with fallbacks, measurable outcomes, and operational confidence.

Operator mindset

I care about latency, observability, failure modes, and maintainability as much as shipping speed.

Voice and multimodal systems

Built real-time voice pipelines that orchestrate speech, context, and response generation with tight latency budgets.

Experience

Built for scale, measured by impact.

A structured timeline of the roles, systems, and outcomes that define the current body of work.

01

Software Engineer

Grazitti Interactive (SearchUnify)

2023 - Present

Hybrid / Remote

Own backend and AI-heavy initiatives for an enterprise search platform used across customer support and knowledge workflows.

  • Designed and scaled enterprise search systems serving 50K+ users across large support environments.
  • Built indexing pipelines for 10M+ documents across CRM, knowledge base, and internal content systems.
  • Reduced p95 latency from 400ms to 140ms through query optimization, caching strategy, and API tuning.
  • Developed backend services in Node.js and Python handling 1K+ QPS under production workloads.
  • Led Salesforce Service Cloud integration that improved knowledge recommendations and cut ticket resolution time by ~25%.
  • Built LLM-powered support automation with grounded retrieval and agentic workflows, reducing repetitive support queries by ~30%.
  • Improved reliability with fallback logic, production monitoring, and alerting around search failure states.
Node.jsPythonSearchRAGSalesforceAWS

02

Software Engineer

Grazitti Interactive

2022 - 2023

On-site / Hybrid

Worked on search infrastructure foundations, ingestion pipelines, and production debugging across distributed services.

  • Built REST APIs and ingestion pipelines for search-oriented backend systems.
  • Designed batch and streaming workflows for document ingestion across multiple data sources.
  • Reduced ingestion failures by ~40% with retry orchestration, validation, and stronger failure recovery.
  • Debugged production issues in distributed services and improved service resilience.
REST APIsIngestionDistributed SystemsValidationObservability

Projects

Case studies that prove taste, architecture, and execution.

Each project is framed around the problem, the system design decisions, and the business or operational impact.

AI Systems

Voice-to-Voice AI Support Agent

Details

A low-latency conversational support pipeline that converts speech to action-ready answers in real time.

Problem solved

Support teams needed faster first-response automation without sacrificing conversational quality or operational guardrails.

Outcome

Delivered sub-2 second multi-turn interaction latency with session continuity, fallback handling, and voice-first orchestration.

Voice AILatencyAutomation

Product Automation

Text-Based AI Support for Salesforce Case Management

Details

A context-aware AI assistant embedded into case workflows to speed up responses and reduce manual effort.

Problem solved

Case agents needed better suggestions grounded in enterprise context, not generic text generation.

Outcome

Integrated AI suggestions into agent workflows and reduced case handling time by roughly 25 to 30 percent.

SalesforceSupportLLM

Retrieval & Ranking

Production RAG System

Details

A grounded question-answering system built over enterprise knowledge with careful ranking, chunking, and answer controls.

Problem solved

Generic LLM outputs were not reliable enough for enterprise support use cases where correctness and traceability mattered.

Outcome

Improved answer accuracy by ~35% through retrieval design, ranking strategy, chunk quality, and grounding discipline.

RAGEvaluationEnterprise AI

Search Infrastructure

Multi-Source Enterprise Search

Details

A federated search experience spanning CRM, documentation, and internal knowledge systems.

Problem solved

Users were losing time switching across fragmented systems with inconsistent relevance and duplicate results.

Outcome

Designed ranking, deduplication, and multi-source indexing for 10M+ documents across enterprise environments.

Enterprise SearchSearch RelevanceScale

Expertise

Organized by capability, not buzzwords.

A signal-dense breakdown of the domains I ship in most often.

Systems and backend design

High-signal execution on APIs, throughput, reliability, and operational safeguards.

Search quality and data pipelines

Deep comfort with indexing, retrieval, relevance, and scale-sensitive search experiences.

Applied AI for production products

RAG systems, agentic flows, grounding, and pragmatic integration into real user workflows.

Backend & Distributed Systems

Node.jsPythonAPI DesignMicroservicesAsync ProcessingLow-latency Services

Search & Retrieval

ElasticsearchSolrIndexing PipelinesRankingQuery OptimizationDeduplication

AI & LLM Systems

RAG PipelinesAgentic WorkflowsPrompt EngineeringLangChainFallback LogicEvaluation

Data & Persistence

SQLData ModelingChunking StrategiesKnowledge SystemsContext Retrieval

Cloud & Delivery

AWS EC2S3RDSCI/CDDockerProduction Monitoring

Product & Operations

Service CloudSupport AutomationCross-functional DeliveryReliability Engineering

Metrics

Numbers that make the story easy to scan.

The signal most recruiters and engineering leaders want to find quickly: scope, load, performance, and measurable change.

0+

Years building production systems

0K+

Enterprise users served

0M+

Documents indexed across systems

0%

p95 latency reduction on key flows

0%

Support handling time improvement

0%

Repetitive support queries reduced

Education

Compact, clear, and in service of the work.

Formal education is presented with restraint so the professional signal stays in focus.

Completed

Masters of Computer Application

Kurukshetra University

Contact

Built to convert curiosity into conversation.

If you are hiring for backend platforms, search, AI systems, or product infrastructure, this site should make the next step obvious.

hi@rahuldhiman.com

Direct response path for opportunities, portfolio requests, and architecture conversations.

India · IST (UTC+5:30)

Open to full-time, consulting, and high-impact platform work