[Question]: Can we set a custom topology for All-Reduce? #1892

New issue

Open

[Question]: Can we set a custom topology for All-Reduce?#1892

Labels

question

Description

fopdoodle8

opened

on Nov 1, 2025

Question

I want to achieve Tree All-Reduce across intra-node GPUs. However, even after settingNCCL_ALGO=allreduce:tree, the reduction still appears to happen sequentially. Can we force intra-node All-Reduce to use a tree-structured topology?

Example log:
[2] NCCL INFO Trees [0] 3/-1/-1->2->1 [1] 3/-1/-1->2->1
[1] NCCL INFO Trees [0] 2/-1/-1->1->0 [1] 2/-1/-1->1->0
[3] NCCL INFO Trees [0] -1/-1/-1->3->2 [1] -1/-1/-1->3->2
[0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1

Metadata

Assignees

No one assigned

Labels

question

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Question]: Can we set a custom topology for All-Reduce? #1892

Description

Question

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions