Project

General

Profile

Ruby

All Projects

Ruby

Custom queries

Actions

Copy link

Feature #21852

open

New improved allocator function interface

Feature #21852:New improved allocator function interface

Added bybyroot (Jean Boussier)23 days ago. Updated21 days ago.

Status:

Open

Assignee:

Target version:

[ruby-core:124634]

Description

When implementing native types with theTypedData API, You have to define an allocator function.
That function receive the class to allocate and is supposed to return a new instance.

/** * This is  the type of  functions that ruby calls  when trying to  allocate an * object.  It is  sometimes necessary to allocate extra memory  regions for an * object.  When you define a class that uses ::RTypedData, it is typically the * case.  On  such situations  define a function  of this type  and pass  it to * rb_define_alloc_func(). * * @param[in]  klass  The class that this function is registered. * @return     A newly allocated instance of `klass`. */typedefVALUE(*rb_alloc_func_t)(VALUEklass);

Current API shortcomings¶

There are a few limitations with the current API.

Hard to disallow`.allocate` without breaking`#dup` and`#clone`.¶

First, it is frequent for extensions to want to disableClass#allocate for their native types viarb_undef_alloc_func, as very often allowing uninitialized object would lead to bugs.

The problem withrb_undef_alloc_func is that the alloc func is also used internally bydup andclone, so most types that undefine the allocator also prevent object copy without necessarily realizing it.

If you want to both disableClass#allocate yet still allow copying, you need to entirely implement the#dup and#clone methods, which is non-trivial and very few types do. One notable exception isBinding, which has to implement these two methods:https://github.com/ruby/ruby/blob/bea48adbcacc29cce9536977e15ceba0d65c8a02/proc.c#L301-L326

This works for Ruby code, however it doesn't work with C-levelrb_obj_dup(VALUE), as used by the Ractor logic to copy objects across ractors.
In the case ofBinding we probably wouldn't allow it anyway, but for other types it may be a problem.

Can't support objects of variable width¶

When duping or cloning an object of variable width, you need access to the original object to be able to allocate the right slot size.

An example of that isThread::Backtrace objects, as evidenced by [Bug#21818].

To support sending exception objects across ractors, we'd need to makerb_obj_dup() work forThread::Backtrace, but to correctly duplicate a backtrace, the allocator needs to know the size.

Proposed new API¶

I'd like to propose a new API for defining allocators:

typedefVALUE(*rb_copy_alloc_func_t)(VALUEklass,VALUEother);

In addition to the class to allocate, the function also receives the instance to copy.
When called byClass#allocate, theother argument is set toQundef. Example usage:

staticVALUEbacktrace_alloc(VALUEklass,VALUEother){rb_backtrace_t*bt;if(UNDEF_P(other)){// Regular allocreturnTypedData_Make_Struct(klass,rb_backtrace_t,&backtrace_data_type,bt);}else{// Copyrb_backtrace_t*other_bt;TypedData_Get_Struct(other,rb_backtrace_t,&backtrace_data_type,other_bt);VALUEself=backtrace_alloc_capa(other_bt->backtrace_size,&bt);bt->backtrace_size=other_bt->backtrace_size;MEMCPY(bt->backtrace,other_bt->backtrace,rb_backtrace_location_t,other_bt->backtrace_size);returnself;}}

Backward compatibility¶

Older-style allocator can keep being supported as long as we wish.

The one backward potential compatibility concern is third party code that callsrb_alloc_func_t rb_get_alloc_func(VALUE klass);.
As its documentation suggest, there's not much valid use case for it, but regardless we can keep supporting it by returning
a "jump function". Seecopy_allocator_adapter:https://github.com/ruby/ruby/pull/15795/changes#diff-884a5a8a369ef1b4c7597e00aa65974cec8c5f54f25f03ad5d24848f64892869R1640-R1653

Opportunity for more changes?¶

I was discussing this new interface with@ko1 (Koichi Sasada) and it appears that the current allocator interface may also be a limitation for Ractors and Ractor local GC. i.e. it might be useful to let the allocator function know that we're copying from one Ractor to another.

But I know to little about Ractor local GC to make a proposition here, so I will let@ko1 (Koichi Sasada) make suggestions.

Implementation¶

I implemented this idea inhttps://github.com/ruby/ruby/pull/15795, to solve [Bug#21818].
It could remain a purely private API, but I think it would make sense to expose it.

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#1 [ruby-core:124649]

Areinitialize_copy/initialize_dup/initialize_clone still called when using acopy_allocator?
In thebacktrace_alloc example you seem to already do some copying in that function, which then makes it unclear where the copying should be done.

First, it is frequent for extensions to want to disableClass#allocate for their native types viarb_undef_alloc_func, as very often allowing uninitialized object would lead to bugs.

How about usingrb_undef_method(klass, "allocate"); for such cases?
I think that's a simple solution and requires no changes.

Fundamentally there is a difference between anallocate method and analloc function, new/dup/clone all require analloc function, but they should not require (and they don't IIRC) anallocate method.

Updated bybyroot (Jean Boussier)21 days agoActions
Copy link
#2 [ruby-core:124650]

Are initialize_copy/initialize_dup/initialize_clone still called when using a copy_allocator?

Yes, it is unchanged.

In the backtrace_alloc example you seem to already do some copying in that function, which then makes it unclear where the copying should be done.

Indeed. Technically it wouldn't be required, but I think it's more reliable to do it there than ininitialize_copy as the later could e redefined and cause corruption.

How about using rb_undef_method(klass, "allocate"); for such cases?

It's a corner case, but that allows redefining it later on.

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#3 [ruby-core:124651]

byroot (Jean Boussier) wrote in#note-2:

Indeed. Technically it wouldn't be required, but I think it's more reliable to do it there than ininitialize_copy as the later could be redefined and cause corruption.

That could cause leaks if copying state involves extra allocations though, as a previously-existinginitialize_copy might allocate and just set the pointers, but not free that first copy done in thecopy_allocator.
It makes the contract unclear about what is supposed to copy what.

I think we need to trustinitialize_copy, or invent a new Ruby-level protocol for copying objects.

Inventing a new Ruby-level protocol for copying objects and for creation without uninitialized state would be great.
Some core classes already do this but then they typically don't support dup/clone.
It'd be great to have this in general, so one could write classes that never have to care about uninitialized state since there are no instances in that uninitialized state ever.
We'd have some method to do both allocation + initialization at once, and another method to create a copy + initialize it as one call.

How about using rb_undef_method(klass, "allocate"); for such cases?
It's a corner case, but that allows redefining it later on.

I don't think we need to worry about this corner case.
Such things are clearly violating internals of the class, and then they might as wellrb_define_alloc_func() and break it too.
But I suppose for core classes there might be a point to try to not segfault in that case, mmh.

Maybe classes should have a flag for "allow/disallow Class#allocate" and thenClass#allocate would check that?
There is already a check for singleton classes, so we could merge it with that check for free and just have singleton classes always set that flag to false.

Updated bybyroot (Jean Boussier)21 days agoActions
Copy link
#4 [ruby-core:124652]

Inventing a new Ruby-level protocol for copying objects and for creation without uninitialized state would be great.

Yes, this is somewhat what this new allocator API does. It also solves the problem that the Ractor API must be able to clone objects but can hardly trust user definedinitialize_copy methods.

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#5 [ruby-core:124653]

It only does it for classes defined in C, and if they do all state copying incopy_allocator.
I think this would be valuable to have for any class, i.e. also for classes defined in Ruby and not in C.

Actions

Copy link

Also available in:PDFAtom

Movatterモバイル変換

Project

General

Profile

Ruby

Custom queries

Feature #21852

New improved allocator function interface

Current API shortcomings¶

Hard to disallow.allocate without breaking#dup and#clone.¶

Can't support objects of variable width¶

Proposed new API¶

Backward compatibility¶

Opportunity for more changes?¶

Implementation¶

Updated byEregon (Benoit Daloze)21 days agoActionsCopy link#1[ruby-core:124649]

Updated bybyroot (Jean Boussier)21 days agoActionsCopy link#2[ruby-core:124650]

Updated byEregon (Benoit Daloze)21 days agoActionsCopy link#3[ruby-core:124651]

Updated bybyroot (Jean Boussier)21 days agoActionsCopy link#4[ruby-core:124652]

Updated byEregon (Benoit Daloze)21 days agoActionsCopy link#5[ruby-core:124653]

Hard to disallow`.allocate` without breaking`#dup` and`#clone`.¶

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#1 [ruby-core:124649]

Updated bybyroot (Jean Boussier)21 days agoActions
Copy link
#2 [ruby-core:124650]

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#3 [ruby-core:124651]

Updated bybyroot (Jean Boussier)21 days agoActions
Copy link
#4 [ruby-core:124652]

Updated byEregon (Benoit Daloze)21 days agoActions
Copy link
#5 [ruby-core:124653]