Haonan Zhang zchoi

🎯

Focusing

Ph.D. student. Research Interests: LLM-Agents, Vision-Language.

Achievements

zchoi/README.md

👻 I'm Haonan, a Ph.D. student of Center for Future Media at UESTC.

$\mathcal{Life\ isn't\ long\ enough\ for\ love\ and\ art. \ ——《The\ Moon\ and\ Sixpence》}$

Awesome-Embodied-Robotics-and-AgentAwesome-Embodied-Robotics-and-AgentPublic
This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥
1.2k 70
RainBowLuoCS/MMEvolRainBowLuoCS/MMEvolPublic
🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
Jupyter Notebook 13
S2-TransformerS2-TransformerPublic
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
Python 82 4
RainBowLuoCS/OpenOmniRainBowLuoCS/OpenOmniPublic
OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis
Python 40 2
GLSCLGLSCLPublic
Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"
Python 13
PKOLPKOLPublic
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
Python 46