Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings
THUDM

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@THUDM

THUKEG

ChatGLM, GLM-4, CogVLM, CodeGeeX, CogView, ImageReward, CogVideoX | CogDL, GraphMAE, AMiner | Zhipu.ai (Z.ai) & Knowledge Engineering Group (KEG)

PinnedLoading

  1. GLMGLMPublic

    GLM (General Language Model)

    Python 3.4k 336

  2. slimeslimePublic

    slime is an LLM post-training framework for RL Scaling.

    Python 2.9k 344

  3. P-tuning-v2P-tuning-v2Public

    An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

    Python 2.1k 206

  4. ReST-MCTSReST-MCTSPublic

    ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

    Python 685 50

  5. T1T1Public

    RL Scaling and Test-Time Scaling (ICML'25)

    112 1

  6. AgentRLAgentRLPublic

    Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

    Python 155 8

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 125 repositories
  • MobileRL Public
    THUDM/MobileRL’s past year of commit activity
    Python 37MIT 3 0 0 UpdatedDec 18, 2025
  • slime Public

    slime is an LLM post-training framework for RL Scaling.

    THUDM/slime’s past year of commit activity
    Python 2,891Apache-2.0 344 104(6 issues need help) 45 UpdatedDec 18, 2025
  • AgentRL Public

    Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

    THUDM/AgentRL’s past year of commit activity
    Python 155MIT 8 6 0 UpdatedDec 16, 2025
  • AgentBench Public

    A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

    THUDM/AgentBench’s past year of commit activity
    Python 3,006Apache-2.0 220 57(38 issues need help) 7 UpdatedNov 17, 2025
  • ComputerRL Public
    THUDM/ComputerRL’s past year of commit activity
    Python 9Apache-2.0 4 3 0 UpdatedNov 7, 2025
  • PETra Public
    THUDM/PETra’s past year of commit activity
    Python 20 0 0 UpdatedNov 5, 2025
  • AlignBench Public

    大模型多维度中文对齐评测基准 (ACL 2024)

    THUDM/AlignBench’s past year of commit activity
    Python 423 31 15 0 UpdatedOct 25, 2025
  • THUDM/LLM4CardGame’s past year of commit activity
    Python 9 1 2 0 UpdatedOct 15, 2025
  • DeepDive Public

    DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL

    THUDM/DeepDive’s past year of commit activity
    Python 216 19 2 0 UpdatedOct 2, 2025
  • TDRM Public
    THUDM/TDRM’s past year of commit activity
    Python 9Apache-2.0 1 0 0 UpdatedSep 25, 2025

[8]ページ先頭

©2009-2025 Movatter.jp