Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@zchoi
zchoi
Follow
View zchoi's full-sized avatar
🎯
Focusing

Haonan Zhang zchoi

🎯
Focusing
Ph.D. student. Research Interests: LLM-Agents, Vision-Language.

Highlights

  • Pro

Block or report zchoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

Report abuse
zchoi/README.md

👻 I'm Haonan, a Ph.D. student of Center for Future Media at UESTC.

  • 🦾 Python / C++ / Jupyter / Pytorch
  • 🤔 LLM-based Agents / Vision&Language / Multimodal Learning
  • 🌱 Attending courses & doing research at UESTC
  • 🍙 Homepage:Link
  • 🙋‍♂️ CV :Link (Last updated: 2024.2)

$\mathcal{Life\ isn't\ long\ enough\ for\ love\ and\ art. \ ——《The\ Moon\ and\ Sixpence》}$

PinnedLoading

  1. Awesome-Embodied-Robotics-and-AgentAwesome-Embodied-Robotics-and-AgentPublic

    This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

    1.2k 70

  2. RainBowLuoCS/MMEvolRainBowLuoCS/MMEvolPublic

    🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"

    Jupyter Notebook 13

  3. S2-TransformerS2-TransformerPublic

    [IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”

    Python 82 4

  4. RainBowLuoCS/OpenOmniRainBowLuoCS/OpenOmniPublic

    OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

    Python 40 2

  5. GLSCLGLSCLPublic

    Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"

    Python 13

  6. PKOLPKOLPublic

    [TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”

    Python 46


[8]ページ先頭

©2009-2025 Movatter.jp