Movatterモバイル変換

Question 1

0votes

0answers

24views

AWS SageMaker PyTorch Model Deployment - is entry_point needed?

I'm trying to deploy a pre-trained PyTorch model to SageMaker using the Python SDK. I have a model.tar.gz file that is uploaded to S3, with the following structure:code/code/requirements.txtcode/...

RefresherM

1

asked18 hours ago

Question 2

Tooling

0votes

1replies

28views

Good packages for bounded Linear Quantile Regression?

I'm looking for a good package to train a linear quantile regression model, i.e. $\hat y = \sum_{i=1}^n w_i \cdot X_i$. With $x_i$ are the input features, and $w_i$ are the bounded trainable weights. ...

student13

13

asked18 hours ago

Question 3

0votes

0answers

19views

Attribution Error when using Huggingface transformers Trainer with FSDP

I am now trying to use FSDP in Huggingface transformers Trainer. The training script is something liketrain_dataset = Mydataset(...)args = TrainingArguments(...)model = LlamaForCausalLM....

MR_Xhao

11

askedyesterday

Question 4

0votes

0answers

41views

Optimization Challenge in Hugging Face: Effcienntly Serving Muliple, Differently Sized LLMs on a Single Gpu with PyTorch [closed]

I am currently working on a Python based Gen AI project that requires the efficient deployment and serving of multiple LLMs specifically models with different parameter counts ( Llama-2 7B and Mistral ...

Amira Yassin

1

askedyesterday

Question 5

2votes

1answer

72views

Having trouble with R's torch and tensor dimensions

I am trying to follow along with this webpage: https://jtr13.github.io/cc21fall2/tutorial-on-r-torch-package.htmlI am trying to understand R's implementation of PyTorch.I am having some trouble with ...

Huy Pham

173

asked2 days ago

Question 6

0votes

0answers

35views

How to force NCCL build to embed PTX for all kernels (prevent linker from stripping ncclDevKernel PTX)?

I am compiling NCCL 2.27.5-1 (I tried also 2.28.9-1) from source for a V100 GPU (sm_70). My goal is to have libnccl.so contain compute_70 PTX for every kernel.Despite passing explicit -gencode=arch=...

CiZ

10

asked2 days ago

Question 7

-1votes

1answer

39views

YOLOv8 custom training loop using v8DetectionLoss fails to converge on custom dataset (7 classes) [closed]

I am trying to implement a custom training loop for object detection using YOLOv8 (Ultralytics) and PyTorch. My goal is to fine-tune a pre-trained yolov8n.pt model on the Aquarium dataset, which ...

Quốc Tiến Trần

7

asked2 days ago

Question 8

1vote

0answers

55views

PyTorch installed via uv project shows CPU-only version on Windows with CUDA specification in pyproject.toml

I'm trying to set up a Python project using uv and pyproject.toml on Windows. I want to install the CUDA-enabled PyTorch, but after installing, when I check the version, it shows CPU-only.Here’s my ...

wonone11

11

askedNov 25 at 9:01

Question 9

Advice

0votes

0replies

29views

When using TensorDictPrioritizedReplayBuffer, should I apply the priority weight manually or not?

With Prioritized Experience Replay (PER), we use Beta parameter, so we can find weight that will be used to offset the bias introduced by PER. Now, with PyTorch's TensorDictPrioritizedReplayBuffer, I ...

Bejo

13

askedNov 25 at 6:43

Question 10

1vote

2answers

124views

pytorch Module B=A, A.to('cpu'), but the tensor in B is still in GPU, why?

After converting module A to CPU, the origin parameter tensor still stays on the GPU? When it is released? Is it wrong if I reuse the parameter?My code:import torch.nn as nnclass A(nn.Module): ...

jiwei zhang

11

askedNov 21 at 10:11

Question 11

2votes

0answers

21views

PyTorch .view() operation to manipulate tensor dimensions vis a vis using torch.unbind followed by torch.cat

In Torch, .view() reshapes the tensor. However, there are multiple ways to reshape a multi-dimensional tensor to a target shape. How does it decide between those different ways?For example, in Torch, ...

Sanchit

21

askedNov 20 at 21:47

Question 12

2votes

1answer

468views

PyTorch fails on Windows Server 2019: “Error loading c10.dll” (works fine on Windows 10)

I'm trying to deploy a Python project on Windows Server 2019, but PyTorch fails to import with a DLL loading error.On my local machine (Windows 10, same Python version), everything works perfectly....

Rael Clariana

21

askedNov 20 at 17:59

Question 13

1vote

1answer

59views

.so file built on same CPU but different EC2 instances lead to missing symbols

I am building a wheel of PyTorch from source, based on their https://github.com/pytorch/pytorch/blob/v2.6.0/.ci/manywheel/build_common.sh CI build script. I tested on a "local" instance of a ...

Corneau

93

askedNov 18 at 21:40

Question 14

Advice

0votes

2replies

45views

Fixing a UNET in pytorch that doesn't work in eval mode due to BatchNorm2d layers

I have a UNET model trained in pytorch (by someone else) that produces quite different results in eval mode to train mode (train mode results look good, eval mode they are rubbish). A bit of googling ...

user18504955

11

askedNov 17 at 11:26

Question 15

0votes

0answers

52views

Given groups=1, weight of size [64, 1024, 1, 1], expected input[1, 256, 1, 1] to have 1024 channels, but got 256 channels instead

I have encountered this issue and I searched on the forums but I couldnt solve it. How can I solve this problem ?I tried to add CBAM module in yolov12 for my custom dataset to improve accuracy. I ...

partizal

33

askedNov 17 at 11:22

Movatterモバイル変換

Collectives™ on Stack Overflow

AWS SageMaker PyTorch Model Deployment - is entry_point needed?

Good packages for bounded Linear Quantile Regression?

Attribution Error when using Huggingface transformers Trainer with FSDP

Optimization Challenge in Hugging Face: Effcienntly Serving Muliple, Differently Sized LLMs on a Single Gpu with PyTorch [closed]

Having trouble with R's torch and tensor dimensions

How to force NCCL build to embed PTX for all kernels (prevent linker from stripping ncclDevKernel PTX)?

YOLOv8 custom training loop using v8DetectionLoss fails to converge on custom dataset (7 classes) [closed]

PyTorch installed via uv project shows CPU-only version on Windows with CUDA specification in pyproject.toml

When using TensorDictPrioritizedReplayBuffer, should I apply the priority weight manually or not?

pytorch Module B=A, A.to('cpu'), but the tensor in B is still in GPU, why?

PyTorch .view() operation to manipulate tensor dimensions vis a vis using torch.unbind followed by torch.cat

PyTorch fails on Windows Server 2019: “Error loading c10.dll” (works fine on Windows 10)

.so file built on same CPU but different EC2 instances lead to missing symbols

Fixing a UNET in pytorch that doesn't work in eval mode due to BatchNorm2d layers

Given groups=1, weight of size [64, 1024, 1, 1], expected input[1, 256, 1, 1] to have 1024 channels, but got 256 channels instead

Hot Network Questions

Subscribe to RSS