Tag: #LLM
6 articles tagged with LLM
Building AI Agents with LangGraph: State Machines for LLM Workflows
AI agents aren't just prompt chains. LangGraph brings state machines, conditional routing, and human-in-the-loop to LLM workflows for production agents.
LoRA Fine-Tuning for Edge Deployment: Shrink, Quantize, Ship
Full fine-tuning is expensive and wasteful when you only need domain adaptation. LoRA trains 1% of the parameters, quantizes to INT8, and runs on edge hardware.
Deploying LLMs with vLLM and Docker: A Production LLMOps Guide
Serving LLMs in production isn't just loading a model in Flask. Learn vLLM for high-throughput inference, Docker for reproducible deploys, and the ops layer.
Advanced RAG: Hybrid Search, Reranking, and Citation for Production
Basic RAG retrieves and generates. Production RAG uses hybrid search, cross-encoder reranking, and grounded citations. Here's how to build the latter.
Running LLMs Locally: The Engineer's Practical Guide to Ollama
No API keys, no cloud costs, no data leaving your machine. Ollama makes running LLMs locally practical for engineers who want real AI integration.
Building a RAG Pipeline for Engineers: From PDF to Answers in 50 Lines
RAG sounds complicated. It's not. Here's how to build a working pipeline that answers questions from your own documents in under 50 lines of Python.