50% OFF

ESP32-IDF Workshop

Tag: #LLM

6 articles tagged with LLM

Building AI Agents with LangGraph: State Machines for LLM Workflows
AI Agents

Building AI Agents with LangGraph: State Machines for LLM Workflows

AI agents aren't just prompt chains. LangGraph brings state machines, conditional routing, and human-in-the-loop to LLM workflows for production agents.

Rajath KumarRajath Kumar·Jun 27, 2026
LoRA Fine-Tuning for Edge Deployment: Shrink, Quantize, Ship
ML / DL

LoRA Fine-Tuning for Edge Deployment: Shrink, Quantize, Ship

Full fine-tuning is expensive and wasteful when you only need domain adaptation. LoRA trains 1% of the parameters, quantizes to INT8, and runs on edge hardware.

Rajath KumarRajath Kumar·Jun 27, 2026
Deploying LLMs with vLLM and Docker: A Production LLMOps Guide
MLOps / LLMOps

Deploying LLMs with vLLM and Docker: A Production LLMOps Guide

Serving LLMs in production isn't just loading a model in Flask. Learn vLLM for high-throughput inference, Docker for reproducible deploys, and the ops layer.

Rajath KumarRajath Kumar·Jun 27, 2026
Advanced RAG: Hybrid Search, Reranking, and Citation for Production
Generative AI

Advanced RAG: Hybrid Search, Reranking, and Citation for Production

Basic RAG retrieves and generates. Production RAG uses hybrid search, cross-encoder reranking, and grounded citations. Here's how to build the latter.

Rajath KumarRajath Kumar·Jun 27, 2026
Running LLMs Locally: The Engineer's Practical Guide to Ollama
AI Tools

Running LLMs Locally: The Engineer's Practical Guide to Ollama

No API keys, no cloud costs, no data leaving your machine. Ollama makes running LLMs locally practical for engineers who want real AI integration.

Rajath KumarRajath Kumar·Jun 19, 2026
Building a RAG Pipeline for Engineers: From PDF to Answers in 50 Lines
Generative AI

Building a RAG Pipeline for Engineers: From PDF to Answers in 50 Lines

RAG sounds complicated. It's not. Here's how to build a working pipeline that answers questions from your own documents in under 50 lines of Python.

Rajath KumarRajath Kumar·Jun 17, 2026