Index
2025 June
2025 April
2025 March
ReAct agent from scratch with Gemini 2.5 and LangGraph
March 31, 2025 — Agents, Gemini, LangGraph, ReAct
Pass@k vs Pass^k: Understanding Agent Reliability
March 24, 2025 — Agents, Metrics, Reliability, Production
Google Gemma 3 Function Calling Example
March 14, 2025 — Gemma, Function Calling, Google, Agents
Function Calling Guide: Google DeepMind Gemini 2.0 Flash
March 5, 2025 — Gemini, Function Calling, Google, Agents
2025 February
2025 January
Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial
January 30, 2025 — Deepseek, Reinforcement Learning, Reasoning, Hugging Face
How to align open LLMs in 2025 with DPO & and synthetic data
January 23, 2025 — HuggingFace, DPO, RL, LLMs
Bite: How Deepseek R1 was trained
January 17, 2025 — Bite, Deepseek, Reinforcement Learning, Reasoning