Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@jianzhnie
jianzhnie
Follow
View jianzhnie's full-sized avatar
🎯
Focusing

Robin jianzhnie

🎯
Focusing
Machine learning, Reinforcement Learning, Transformers

Block or report jianzhnie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

Report abuse
jianzhnie/README.md

jianzhnie's GitHub Streak


Hi there 👋

Hey, I am jianzhnie, Thanks for stopping by!

I work as a full-time Machine Learning engineer and write tutorials on basic and advanced topics (NLP, Reinforcement Learning , Computer vision and code - lots of it).

I read and think a lot. And sometimes I put them in a form of a painting or a piece of music. And when I need to catch a breath I go for a run.

I’m currently working on 🔭

Reinforcement Learning

Code RepoAbout
Deep-RL-ToolkitDeep reinforcemnt Learning Toolkit For Single Agent (DQN, Reinbow, DDPG, PPO, SAC, TD3…)
Deep-MARL-ToolkitDeep reinforcemnt Learning Toolkit For Multi Agent (VDN, Qmix, MADDPG, MAPPO, …)
RLZeroMonte Carlo Tree Search in General Sequential Decision Scenarios( AlphaZero, Muzero…)
ScaleRLHandy and simple scaling of distributed reinforcement learning framework ( A3C, Ape-x, Impala, …)
CyberAttackSimulatorA Reinforcement Learning (RL) simulation environment built for training and evaluating autonomous cyber attack & defense models on simulated networks.

Large language Models

Code RepoAbout
LLMToolkitNLPToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large Language Models) using Pytorch.
LLamaTunerEasy and Efficient Finetuning LLMs.
open-chatgptDeveloping the open source ChatGPT, Alpaca, Vicuna and RLHF Pipeline.
awesome-instruction-datasetsA collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt.

Others

  • Developing Diffuser-toolkit: All kinds of diffusion models for image and audio generation in PyTorchdiffusion-toolkit

  • Developing AutoML tools for DeepLearning Project and MacheLearning ProjectAutoTimm |AutoTabular

  • Trying hard to reduce the Learning Machine Learning(LML) loss 😂

  • Coding everyday for better research engineering skill

I’m currently learning 🌱

  • LLM System and Artificial General Intelligence
  • Large Scale Distribute Reinforcemnet Learning System

How to reach me 📫

Have an awesome day!

PinnedLoading

  1. LLamaTunerLLamaTunerPublic

    Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

    Python 596 63

  2. Open-R1Open-R1Public

    The open source implementation of DeepSeek-R1. 开源复现 DeepSeek-R1

    Python 245 47

  3. microsoft/nnimicrosoft/nniPublic archive

    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

    Python 14.1k 1.8k

  4. autogluon/autogluonautogluon/autogluonPublic

    Fast and Accurate ML in 3 Lines of Code

    Python 8.6k 976

  5. tatsu-lab/stanford_alpacatatsu-lab/stanford_alpacaPublic

    Code and documentation to train Stanford's Alpaca models, and generate the data.

    Python 29.9k 4.1k

  6. deep-marl-toolkitdeep-marl-toolkitPublic

    MARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...

    Python 120 17


[8]ページ先頭

©2009-2025 Movatter.jp