😢
Focusing
Master student at SUSTech, ShenZhen, China. My research focuses on Computer Vision, specifically exploring the intersection of vision and language learning.
- Southern University of Science and Technology
- Shen Zhen
Highlights
- Pro
PinnedLoading
- ttengwang/Caption-Anything
ttengwang/Caption-Anything PublicCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
- LLMVA-GEBC
LLMVA-GEBC PublicWinner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
- Context-GEBC
Context-GEBC PublicSecond-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
- Awesome-Multimodal-Chatbot
Awesome-Multimodal-Chatbot PublicAwesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction, such as text, speech, images, and videos, to provide a sea…
Something went wrong, please refresh the page to try again.
If the problem persists, check theGitHub status page orcontact support.
If the problem persists, check theGitHub status page orcontact support.