inference-scaling

Implemented a recurrent-depth LLM (PyTorch) based on arXiv:2502.05171. Demonstrated that scaling inference compute increased arithmetic reasoning accuracy from 8% to 100% without additional parameters.

llm chain-of-thought inference-scaling latent-reasoning

UpdatedNov 27, 2025
Jupyter Notebook

samuelvinay91 /deep-research

Star0

Deep Research capability with reasoning models, CoT prompting, and inference-time scaling

docker reasoning fastapi ai-engineer chain-of-thought tree-of-thought langgraph inference-scaling deep-research

UpdatedFeb 7, 2026
Python

AntonioVFranco /elamonica

Star0

Production-ready test-time compute optimization framework for LLM inference. Implements Best-of-N, Sequential Revision, and Beam Search strategies. Validated with models up to 7B parameters.

machine-learning deep-learning optimization transformers inference pytorch llm llm-orchestration inference-scaling self-hosted-ai test-time-compute context-extension recursive-language-models compute-optimal-inference verifier-models

UpdatedJan 27, 2026
Python

Improve this page

Add a description, image, and links to theinference-scaling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theinference-scaling topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

inference-scaling

Here are 6 public repositories matching this topic...

Gen-Verse /Diffusion-Sharpening

Zanette-Labs /SpeculativeRejection

kreasof-ai /stable-latent-reasoning

jeffasante /latent-reasoning-transformer

samuelvinay91 /deep-research

AntonioVFranco /elamonica

Improve this page

Add this topic to your repo