Tag: #LLM

6 articles tagged with LLM

Building AI Agents with LangGraph: State Machines for LLM Workflows

AI agents aren't just prompt chains. LangGraph brings state machines, conditional routing, and human-in-the-loop to LLM workflows for production agents.

Rajath Kumar·Jun 27, 2026

ML / DL

LoRA Fine-Tuning for Edge Deployment: Shrink, Quantize, Ship

Full fine-tuning is expensive and wasteful when you only need domain adaptation. LoRA trains 1% of the parameters, quantizes to INT8, and runs on edge hardware.

Rajath Kumar·Jun 27, 2026

MLOps / LLMOps

Deploying LLMs with vLLM and Docker: A Production LLMOps Guide

Serving LLMs in production isn't just loading a model in Flask. Learn vLLM for high-throughput inference, Docker for reproducible deploys, and the ops layer.

Rajath Kumar·Jun 27, 2026

Generative AI

Advanced RAG: Hybrid Search, Reranking, and Citation for Production

Basic RAG retrieves and generates. Production RAG uses hybrid search, cross-encoder reranking, and grounded citations. Here's how to build the latter.

Rajath Kumar·Jun 27, 2026

AI Tools

Running LLMs Locally: The Engineer's Practical Guide to Ollama

No API keys, no cloud costs, no data leaving your machine. Ollama makes running LLMs locally practical for engineers who want real AI integration.

Rajath Kumar·Jun 19, 2026

Generative AI

Building a RAG Pipeline for Engineers: From PDF to Answers in 50 Lines

RAG sounds complicated. It's not. Here's how to build a working pipeline that answers questions from your own documents in under 50 lines of Python.

Rajath Kumar·Jun 17, 2026