- PCNLab
- Shen zhen
- https://jianzhnie.github.io/llmtech/
Highlights
Hey, I am jianzhnie, Thanks for stopping by!
I work as a full-time Machine Learning engineer and write tutorials on basic and advanced topics (NLP, Reinforcement Learning , Computer vision and code - lots of it).
I read and think a lot. And sometimes I put them in a form of a painting or a piece of music. And when I need to catch a breath I go for a run.
Code Repo | About |
---|---|
Deep-RL-Toolkit | Deep reinforcemnt Learning Toolkit For Single Agent (DQN, Reinbow, DDPG, PPO, SAC, TD3…) |
Deep-MARL-Toolkit | Deep reinforcemnt Learning Toolkit For Multi Agent (VDN, Qmix, MADDPG, MAPPO, …) |
RLZero | Monte Carlo Tree Search in General Sequential Decision Scenarios( AlphaZero, Muzero…) |
ScaleRL | Handy and simple scaling of distributed reinforcement learning framework ( A3C, Ape-x, Impala, …) |
CyberAttackSimulator | A Reinforcement Learning (RL) simulation environment built for training and evaluating autonomous cyber attack & defense models on simulated networks. |
Code Repo | About |
---|---|
LLMToolkit | NLPToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large Language Models) using Pytorch. |
LLamaTuner | Easy and Efficient Finetuning LLMs. |
open-chatgpt | Developing the open source ChatGPT, Alpaca, Vicuna and RLHF Pipeline. |
awesome-instruction-datasets | A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt. |
Developing Diffuser-toolkit: All kinds of diffusion models for image and audio generation in PyTorchdiffusion-toolkit
Developing AutoML tools for DeepLearning Project and MacheLearning ProjectAutoTimm |AutoTabular
Trying hard to reduce the Learning Machine Learning(LML) loss 😂
Coding everyday for better research engineering skill
- LLM System and Artificial General Intelligence
- Large Scale Distribute Reinforcemnet Learning System
- 📫 Email:jianzhnie@gmail.com
- 📫 Homepage:https://jianzhnie.github.io
- 📫 Blog:https://jianzhnie.github.io/llmtech/
- 📖 ZhiHu:https://www.zhihu.com/column/fengnie
- 🤗 Huggingface Org:https://huggingface.co/GaussianTech
- 📫 Linkdin:https://www.linkedin.com/in/jianzheng-nie-2749b7156/
- 💬 Ask me about: Statistics and Machine Learning.
- ❤️Sponsor me on GitHub
Have an awesome day!
PinnedLoading
- LLamaTuner
LLamaTuner PublicEasy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
- microsoft/nni
microsoft/nni Public archiveAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
- tatsu-lab/stanford_alpaca
tatsu-lab/stanford_alpaca PublicCode and documentation to train Stanford's Alpaca models, and generate the data.
- deep-marl-toolkit
deep-marl-toolkit PublicMARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...
If the problem persists, check theGitHub status page orcontact support.