jianzhnieFollow

jianzhnie

🎯

Focusing

Robin jianzhnie

🎯

Focusing

Machine learning， Reinforcement Learning， Transformers

125 followers ·224 following

Achievements

Achievement: Arctic Code Vault Contributor

Achievements

Highlights

Developer Program Member

jianzhnie/README.md

Hi there, I'm Robin 👋

Hi there 👋

Hey, I am jianzhnie, Thanks for stopping by!

I work as a full-time Machine Learning engineer and write tutorials on basic and advanced topics (NLP, Reinforcement Learning , Computer vision and code - lots of it).

I read and think a lot. And sometimes I put them in a form of a painting or a piece of music. And when I need to catch a breath I go for a run.

I’m currently working on 🔭

Reinforcement Learning

Code Repo	About
Deep-RL-Toolkit	Deep reinforcemnt Learning Toolkit For Single Agent (DQN, Reinbow, DDPG, PPO, SAC, TD3…)
Deep-MARL-Toolkit	Deep reinforcemnt Learning Toolkit For Multi Agent (VDN, Qmix, MADDPG, MAPPO, …)
RLZero	Monte Carlo Tree Search in General Sequential Decision Scenarios( AlphaZero， Muzero…)
ScaleRL	Handy and simple scaling of distributed reinforcement learning framework ( A3C, Ape-x, Impala, …)
CyberAttackSimulator	A Reinforcement Learning (RL) simulation environment built for training and evaluating autonomous cyber attack & defense models on simulated networks.

Large language Models

Code Repo	About
LLMToolkit	NLPToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large Language Models) using Pytorch.
LLamaTuner	Easy and Efficient Finetuning LLMs.
open-chatgpt	Developing the open source ChatGPT, Alpaca, Vicuna and RLHF Pipeline.
awesome-instruction-datasets	A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt.

Others

Developing Diffuser-toolkit: All kinds of diffusion models for image and audio generation in PyTorchdiffusion-toolkit
Developing AutoML tools for DeepLearning Project and MacheLearning ProjectAutoTimm |AutoTabular
Trying hard to reduce the Learning Machine Learning(LML) loss 😂
Coding everyday for better research engineering skill

I’m currently learning 🌱

LLM System and Artificial General Intelligence
Large Scale Distribute Reinforcemnet Learning System

How to reach me 📫

📫 Email:jianzhnie@gmail.com
📫 Homepage:https://jianzhnie.github.io
📫 Blog:https://jianzhnie.github.io/llmtech/
📖 ZhiHu:https://www.zhihu.com/column/fengnie
🤗 Huggingface Org:https://huggingface.co/GaussianTech
📫 Linkdin:https://www.linkedin.com/in/jianzheng-nie-2749b7156/
💬 Ask me about: Statistics and Machine Learning.
❤️Sponsor me on GitHub

Have an awesome day!

PinnedLoading

LLamaTunerLLamaTunerPublic
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
Python 608 64
Open-R1Open-R1Public
The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1
Python 261 50
microsoft/nnimicrosoft/nniPublic archive
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Python 14.2k 1.8k
autogluon/autogluonautogluon/autogluonPublic
Fast and Accurate ML in 3 Lines of Code
Python 9.1k 1k
tatsu-lab/stanford_alpacatatsu-lab/stanford_alpacaPublic
Code and documentation to train Stanford's Alpaca models, and generate the data.
Python 30.1k 4k
deep-marl-toolkitdeep-marl-toolkitPublic
MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...
Python 140 19

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robin jianzhnie

Achievements

Achievements

Highlights

Block or report jianzhnie

Hi there, I'm Robin 👋

Hi there 👋

I’m currently working on 🔭

Reinforcement Learning

Large language Models

Others

I’m currently learning 🌱

How to reach me 📫

PinnedLoading

Uh oh!