Posts RSS feed

11 January 2026
Fine-Tuning LLMs with Unsloth: A Complete Guide

Learn to fine-tune your own LLMs using Unsloth with LoRA/QLoRA. Covers data preparation, SFT, DPO/ORPO alignment, evaluation metrics, and cost analysis for when self-hosted beats API providers.
11 January 2026
Production vLLM Deployment: The Complete Guide

A comprehensive guide to deploying vLLM for high-throughput LLM inference in production. Covers server configuration, async client patterns, parallel processing, Kubernetes deployment, performance tuning, and monitoring.
06 January 2026
Say No to Pandas

A head-to-head benchmark comparing pandas, polars, and duckdb. See why modern DataFrame libraries deliver 5-20x speedups through lazy evaluation, predicate pushdown, and automatic parallelization.

14 February 2025
RAG From Scratch with OpenAI and NumPy

Building a retrieval-augmented generation pipeline using just the OpenAI API and NumPy—no vector databases, no frameworks, just the fundamentals
05 February 2025
Building AI Agents from Scratch

Build an AI agent with tool use, memory, and error recovery using OpenAI's function calling API. No frameworks—just the patterns that matter.