Movatterモバイル変換

Question 1

0votes

0answers

101views

Overhead of instantiating a flax model

Is it expensive to keep recreating a Flax network, such asclass QNetwork(nn.Module): dim: int @nn.compact def __call__(self, x): x = nn.Dense(120)(x) x = nn.relu(x) ...

joel

8,122

askedOct 1 at 15:47

Question 2

1vote

1answer

99views

How to control hyperparameter within flax.nnx loss function using an optax.schedule?

from jax import numpy as jnpfrom jax import randomfrom flax import nnximport optaxfrom matplotlib import pyplot as pltif __name__ == '__main__': shape = (2,55,1) epochs = 123 rngs = ...

user137146

35

askedSep 9 at 20:32

Question 3

0votes

1answer

99views

JAX shard on GPU and shard on CPU in subroutine, all with JIT

Duplicating my question here: https://github.com/google/flax/discussions/4825I want to have a JAX or NNX jitted function that consumes and returns GPU-sharded tensors. However, inside the function, I ...

David Braun

820

askedJul 21 at 17:03

Question 4

1vote

2answers

268views

Does vmap correctly split the RNG keys?

In the following code, when I remove the vmap, I have the right randomized behavior. However, with vmap, I don't anymore. Isn't this supposed to be one of the features of nnx.vmap?import jaximport ...

Jackpap

8,086

askedJul 11 at 12:04

Question 5

0votes

1answer

54views

Efficient multi-host TPU dataset processing

I want to train LLM on TPUv4-32 using JAX/Flax. The dataset is stored in a mounted google storage bucket. The dataset (Red-Pajama-v2) consists of 5000 shards, which are stored in .json.gz files: ~/...

innerproduct

3

askedJul 10 at 21:35

Question 6

0votes

1answer

89views

Differentiable weight setting in flax NNX

I'm doing some experiments with Flax NNX (not Linen!).What I'm trying to do is compute the weights of a network using another network:A hypernetwork receives some input parameters W and outputs a ...

Riccardo Rota

1

askedJul 2 at 14:04

Question 7

1vote

0answers

75views

LAPACK Inconsistent across multiple different operating systems and devices

DescriptionI have a deterministic program that uses jax, and is heavy on linear algebra operations.I ran this code on CPU, using three different CPUs. Two MacOs Systems (one on Sequoia (M1 Pro), ...

yousef elbrolosy

135

askedJun 20 at 7:27

Question 8

2votes

1answer

108views

How to type hint `flax.linen.Module.apply`'s output correctly?

As of writing, this code does not pass the PyRight type checker:import jaximport jax.numpy as jnpimport jax.typing as jtimport flax.linen as nnclass MLP(nn.Module): @nn.compact def ...

Arno

431

askedJun 8 at 18:48

Question 9

1vote

1answer

290views

Would using lists rather than jax.numpy arrays lead to more accurate numerical transformations?

I am doing a project with RNNs using jax and flax and I have noticed some behavior that I do not really understand.My code is basically an optimization loop where the user provides the initial ...

yousef elbrolosy

135

askedMay 23 at 6:54

Question 10

1vote

1answer

279views

Is there a way to update weights of an nnx.Module in Flax's NNX using the lax.scan function?

I have a neural network (nnx.Module) written in Flax's NNX. I want to train this network efficiently using lax.scan instead of a for loop. However, as scan doesn't allow in place changes, how can I ...

elderly

21

askedApr 21 at 13:29

Question 11

1vote

1answer

372views

Freezing filtered parameter collections with Flax.nnx

I'm trying to work out how to do transfer learning with flax.nnx. Below is my attempt to freeze the kernel of my nnx.Linear instance and optimize the bias. I think maybe I'm not correctly setting up ...

jworrell

23

askedApr 17 at 21:16

Question 12

2votes

1answer

203views

Debug jax In vscode

Why can't I use a vscode debugger to debug jax code, specifically pure functions. I understand that they provide their own framework for debugging but vscode debugger is quite comfortable. Is this ...

akshat

33

askedApr 4 at 1:18

Question 13

0votes

1answer

286views

Flax nnx / jax: tree.map for layers of incongruent size

I am trying to figure out how to use nnx.split_rngs. Can somebody give a version of the code below that uses nnx.split_rngs with jax.tree.map to produce an arbitrary number of Linear layers with ...

jworrell

23

askedApr 2 at 17:23

Question 14

1vote

1answer

90views

Jax / Flax potential tracing issue

I'm currently using Flax for neural network implementations. My model takes two inputs:x and θ. It first processes x through an LSTM, then concatenates the LSTM's output with θ — or more precisely, ...

Dan Leonte

73

askedMar 6 at 18:42

Question 15

0votes

0answers

76views

orbax save/restore with 8 devices

I have setup a snippet on Colab herewithjax.__version__ # 0.4.33 9Feb2025 orbax.checkpoint.__version__ # 0.6.4 9Feb2025It quite difficult to follow the flax/orbax changes in the ...

Jean-Eric

402

askedFeb 9 at 14:12

Movatterモバイル変換

Collectives™ on Stack Overflow

Overhead of instantiating a flax model

How to control hyperparameter within flax.nnx loss function using an optax.schedule?

JAX shard on GPU and shard on CPU in subroutine, all with JIT

Does vmap correctly split the RNG keys?

Efficient multi-host TPU dataset processing

Differentiable weight setting in flax NNX

LAPACK Inconsistent across multiple different operating systems and devices

How to type hint `flax.linen.Module.apply`'s output correctly?

Would using lists rather than jax.numpy arrays lead to more accurate numerical transformations?

Is there a way to update weights of an nnx.Module in Flax's NNX using the lax.scan function?

Freezing filtered parameter collections with Flax.nnx

Debug jax In vscode

Flax nnx / jax: tree.map for layers of incongruent size

Jax / Flax potential tracing issue

orbax save/restore with 8 devices

Hot Network Questions

Subscribe to RSS