- LLMTech
- Shen zhen
- https://jianzhnie.github.io/llmtech/
Highlights
Hey, I am jianzhnie, Thanks for stopping by!
I work as a full-time Machine Learning engineer and write tutorials on basic and advanced topics (NLP, Reinforcement Learning , Computer vision and code - lots of it).
I read and think a lot. And sometimes I put them in a form of a painting or a piece of music. And when I need to catch a breath I go for a run.
Code Repo | About |
---|---|
Deep-RL-Toolkit | Deep reinforcemnt Learning Toolkit For Single Agent (DQN, Reinbow, DDPG, PPO, SAC, TD3…) |
Deep-MARL-Toolkit | Deep reinforcemnt Learning Toolkit For Multi Agent (VDN, Qmix, MADDPG, MAPPO, …) |
RLZero | Monte Carlo Tree Search in General Sequential Decision Scenarios( AlphaZero, Muzero…) |
ScaleRL | Handy and simple scaling of distributed reinforcement learning framework ( A3C, Ape-x, Impala, …) |
CyberAttackSimulator | A Reinforcement Learning (RL) simulation environment built for training and evaluating autonomous cyber attack & defense models on simulated networks. |
Code Repo | About |
---|---|
LLMToolkit | NLPToolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large Language Models) using Pytorch. |
LLamaTuner | Easy and Efficient Finetuning LLMs. |
open-chatgpt | Developing the open source ChatGPT, Alpaca, Vicuna and RLHF Pipeline. |
awesome-instruction-datasets | A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt. |
Developing Diffuser-toolkit: All kinds of diffusion models for image and audio generation in PyTorchdiffusion-toolkit
Developing AutoML tools for DeepLearning Project and MacheLearning ProjectAutoTimm |AutoTabular
Trying hard to reduce the Learning Machine Learning(LML) loss 😂
Coding everyday for better research engineering skill
- LLM System and Artificial General Intelligence
- Large Scale Distribute Reinforcemnet Learning System
- 📫 Email:jianzhnie@gmail.com
- 📫 Homepage:https://jianzhnie.github.io
- 📫 Blog:https://jianzhnie.github.io/llmtech/
- 📖 ZhiHu:https://www.zhihu.com/column/fengnie
- 🤗 Huggingface Org:https://huggingface.co/GaussianTech
- 📫 Linkdin:https://www.linkedin.com/in/jianzheng-nie-2749b7156/
- 💬 Ask me about: Statistics and Machine Learning.
- ❤️Sponsor me on GitHub
Have an awesome day!
PinnedLoading
- LLamaTuner
LLamaTuner PublicEasy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
- microsoft/nni
microsoft/nni Public archiveAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
- tatsu-lab/stanford_alpaca
tatsu-lab/stanford_alpaca PublicCode and documentation to train Stanford's Alpaca models, and generate the data.
- deep-marl-toolkit
deep-marl-toolkit PublicMARLToolkit: The Multi-Agent Rainforcement Learning Toolkit. Include implementation of MAPPO, MADDPG, QMIX, VDN, COMA, IPPO, QTRAN, MAT...
If the problem persists, check theGitHub status page orcontact support.
Uh oh!
There was an error while loading.Please reload this page.