A D implementation that conforms to the D ABI (Application Binary Interface) will be able to generate libraries, DLLs, etc., that can interoperate with D binaries built by other implementations.
The C ABI referred to in this specification means the C Application Binary Interface of the target system. C and D code should be freely linkable together, in particular, D code shall have access to the entire C ABI runtime library.
Theendianness (byte order) of the layout of the data will conform to the endianness of the target machine. The Intel x86 CPUs arelittle endian meaning that the value 0x0A0B0C0D is stored in memory as:0D 0C 0B 0A.
| bool | 8 bit byte with the values 0 for false and 1 for true |
| byte | 8 bit signed value |
| ubyte | 8 bit unsigned value |
| short | 16 bit signed value |
| ushort | 16 bit unsigned value |
| int | 32 bit signed value |
| uint | 32 bit unsigned value |
| long | 64 bit signed value |
| ulong | 64 bit unsigned value |
| cent | 128 bit signed value |
| ucent | 128 bit unsigned value |
| float | 32 bit IEEE 754 floating point value |
| double | 64 bit IEEE 754 floating point value |
| real | implementation defined floating point value, for x86 it is 80 bit IEEE 754 extended real |
| char | 8 bit unsigned value |
| wchar | 16 bit unsigned value |
| dchar | 32 bit unsigned value |
Delegates arefat pointers with two parts:
| offset | property | contents |
|---|---|---|
| 0 | .ptr | context pointer |
| ptrsize | .funcptr | pointer to function |
Thecontext pointer can be a classthis reference, a structthis pointer, a pointer to a closure (nested functions) or a pointer to an enclosing function's stack frame (nested functions).
These conform to the target's C ABI struct layout, except:
An object consists of:
| size | property | contents |
|---|---|---|
| ptrsize | .__vptr | pointer to vtable |
| ptrsize | .__monitor | monitor |
| ptrsize... | vptrs for any interfaces implemented by this class in left to right, most to least derived, order | |
| ... | ... | super's non-static fields and super's interface vptrs, from least to most derived |
| ... | named fields | non-static fields |
The vtable consists of:
| size | contents |
|---|---|
| ptrsize | pointer to instance of TypeInfo |
| ptrsize... | pointers to virtual member functions |
Casting a class object to an interface consists of adding the offset of the interface's corresponding vptr to the address of the base of the object. Casting an interface ptr back to the class type it came from involves getting the correct offset to subtract from it from the object.Interface entry at vtbl[0]. Adjustor thunks are created and pointers to them stored in the method entries in the vtbl[] in order to set the this pointer to the start of the object instance corresponding to the implementing method.
An adjustor thunk looks like:
ADD EAX,offset JMP method
The leftmost side of the inheritance graph of the interfaces all share their vptrs, this is the single inheritance model. Every time the inheritance graph forks (for multiple inheritance) a new vptr is created and stored in the class' instance. Every time a virtual method is overridden, a new vtbl[] must be created with the updated method pointers in it.
The class definition:
class XXXX{ ....};Generates the following:
An interface is a pointer to a pointer to a vtbl[]. The vtbl[0] entry is a pointer to the corresponding instance of the object.Interface class. The rest of thevtbl[1..$] entries are pointers to the virtual functions implemented by that interface, in the order that they were declared.
A COM interface differs from a regular interface in that there is no object.Interface entry invtbl[0]; the entriesvtbl[0..$] are all the virtual function pointers, in the order that they were declared. This matches the COM object layout used by Windows.
A C++ interface differs from a regular interface in that it matches the layout of a C++ class using single inheritance on the target machine.
A dynamic array consists of:
| offset | property | contents |
|---|---|---|
| 0 | .length | array dimension |
| size_t | .ptr | pointer to array data |
A dynamic array is declared as:
type[] array;whereas a static array is declared as:
type[dimension] array;
Thus, a static array always has the dimension statically available as part of the type, and so it is implemented like in C. Static arrays and Dynamic arrays can be easily converted back and forth to each other.
Associative arrays consist of a pointer to an opaque, implementation defined type.
The current implementation is contained in and defined byrt/aaA.d.
D has reference types, but they are implicit. For example, classes are always referred to by reference; this means that class instances can never reside on the stack or be passed as function parameters.
D accomplishes typesafe linking bymangling a D identifier to include scope and type information.
MangledName:_DQualifiedNameType_DQualifiedNameZ// Internal
TheType above is the type of a variable or the return type of a function. This is never aTypeFunction, as the latter can only be bound to a value via a pointer to a function or a delegate.
QualifiedName:SymbolFunctionNameSymbolFunctionNameQualifiedNameSymbolFunctionName:SymbolNameSymbolNameTypeFunctionNoReturnSymbolNameMTypeModifiersoptTypeFunctionNoReturn
TheM means that the symbol is a function that requires athis pointer. Class or struct fields are mangled withoutM. To disambiguateM from being aParameter with modifierscope, the following type needs to be checked for being aTypeFunction.
SymbolName:LNameTemplateInstanceNameIdentifierBackRef0// anonymous symbols
Template Instance Names have the types and values of its parameters encoded into it:
TemplateInstanceName:TemplateIDLNameTemplateArgsZTemplateID:__T__U// for symbols declared inside template constraintTemplateArgs:TemplateArgTemplateArgTemplateArgsTemplateArg:TemplateArgXHTemplateArgX
If a template argument matches a specialized template parameter, the argument is mangled with prefixH.
TemplateArgX:TTypeVTypeValueSQualifiedNameXNumberExternallyMangledName
ExternallyMangledName can be any series of characters allowed on the current platform, e.g. generated by functions with C++ linkage or annotated withpragma(mangle,...).
Values:ValueValueValuesValue:niNumberNNumbereHexFloatcHexFloatcHexFloatCharWidthNumber_HexDigitsANumberValuesSNumberValuesfMangledNameHexFloat:NANINFNINFNHexDigitsPExponentHexDigitsPExponentExponent:NNumberNumberHexDigits:HexDigitHexDigitHexDigitsHexDigit:DigitABCDEFCharWidth:awd
Name:NamestartNamestartNamecharsNamestart:_AlphaNamechar:NamestartDigitNamechars:NamecharNamecharNamechars
AName is a standard Didentifier.
LName:NumberNameNumber__SNumber// function-local parent symbolsNumber:DigitDigitNumberDigit:0123456789
AnLName is a name preceded by aNumber giving the number of characters in theName.
AnyLName or non-basicType (i.e. any type that does not encode as a fixed one or two character sequence) that has been emitted to the mangled symbol before will not be emitted again, but is referenced by a special sequence encoding the relative position of the original occurrence in the mangled symbol name.
Numbers in back references are encoded with base 26 by upper case lettersA -Z for higher digits but lower case lettersa -z for the last digit.
TypeBackRef:QNumberBackRefIdentifierBackRef:QNumberBackRefNumberBackRef:lower-case-letterupper-case-letterNumberBackRef
To distinguish between the type of the back reference a look-up of the back referenced character is necessary: An identifier back reference always points to a digit0 to9, while a type back reference always points to a letter.
Types are mangled using a simple linear scheme:
Type:TypeModifiersoptTypeXTypeBackRefTypeX:TypeArrayTypeStaticArrayTypeAssocArrayTypePointerTypeFunctionTypeIdentTypeClassTypeStructTypeEnumTypeTypedefTypeDelegateTypeVoidTypeByteTypeUbyteTypeShortTypeUshortTypeIntTypeUintTypeLongTypeUlongTypeCentTypeUcentTypeFloatTypeDoubleTypeRealTypeIfloatTypeIdoubleTypeIrealTypeCfloatTypeCdoubleTypeCrealTypeBoolTypeCharTypeWcharTypeDcharTypeNoreturnTypeNullTypeTupleTypeVectorTypeModifiers:ConstWildWildConstSharedSharedConstSharedWildSharedWildConstImmutableShared:OConst:xImmutable:yWild:NgTypeArray:ATypeTypeStaticArray:GNumberTypeTypeAssocArray:HTypeTypeTypePointer:PTypeTypeVector:NhTypeTypeFunction:TypeFunctionNoReturnTypeTypeFunctionNoReturn:CallConventionFuncAttrsoptParametersoptParamCloseCallConvention:F// DU// CW// WindowsR// C++Y// Objective-CFuncAttrs:FuncAttrFuncAttrFuncAttrsFuncAttr:FuncAttrPureFuncAttrNothrowFuncAttrRefFuncAttrPropertyFuncAttrNogcFuncAttrReturnFuncAttrScopeFuncAttrTrustedFuncAttrSafeFuncAttrLive
Function attributes are emitted in the order as listed above, with the exception ofreturn andscope.return comes beforescope whenthis is areturn scope parameter, and afterscope whenthis is ascope andreturn ref parameter.
FuncAttrPure:NaFuncAttrNogc:NiFuncAttrNothrow:NbFuncAttrProperty:NdFuncAttrRef:NcFuncAttrReturn:NjFuncAttrScope:NlFuncAttrTrusted:NeFuncAttrSafe:NfFuncAttrLive:NmParameters:ParameterParameterParametersParameter:Parameter2MParameter2// scopeNkParameter2// returnParameter2:TypeIType// inJType// outKType// refLType// lazyParamClose:X// variadic T t...) styleY// variadic T t,...) styleZ// not variadicTypeIdent:IQualifiedNameTypeClass:CQualifiedNameTypeStruct:SQualifiedNameTypeEnum:EQualifiedNameTypeTypedef:TQualifiedNameTypeDelegate:DTypeModifiersoptTypeFunctionTypeVoid:vTypeByte:gTypeUbyte:hTypeShort:sTypeUshort:tTypeInt:iTypeUint:kTypeLong:lTypeUlong:mTypeCent:ziTypeUcent:zkTypeFloat:fTypeDouble:dTypeReal:eTypeIfloat:oTypeIdouble:pTypeIreal:jTypeCfloat:qTypeCdouble:rTypeCreal:cTypeBool:bTypeChar:aTypeWchar:uTypeDchar:wTypeNoreturn:NnTypeNull:nTypeTuple:BParametersZ
Theextern (C) andextern (D) calling convention matches the C calling convention used by the supported C compiler on the host system. Except that the extern (D) calling convention for Windows x86 is described here.
The parameters to the non-variadic function:
foo(a1, a2, ..., an);are passed as follows:
| a1 |
| a2 |
| ... |
| an |
| hidden |
| this |
wherehidden is present if needed to return a struct value, andthis is present if needed as the this pointer for a member function or the context pointer for a nested function.
The last parameter is passed in EAX rather than being pushed on the stack if the following conditions are met:
Parameters are always pushed as multiples of 4 bytes, rounding upwards, so the stack is always aligned on 4 byte boundaries. They are pushed most significant first.out andref are passed as pointers. Static arrays are passed as pointers to their first element. On Windows, a real is pushed as a 10 byte quantity, a creal is pushed as a 20 byte quantity. On Linux, a real is pushed as a 12 byte quantity, a creal is pushed as two 12 byte quantities. The extra two bytes of pad occupy the ‘most significant’ position.
The callee cleans the stack.
The parameters to the variadic function:
void foo(int p1,int p2,int[] p3...)foo(a1, a2, ..., an);are passed as follows:
| p1 |
| p2 |
| a3 |
| hidden |
| this |
The variadic part is converted to a dynamic array and the rest is the same as for non-variadic functions.
The parameters to the variadic function:
void foo(int p1,int p2, ...)foo(a1, a2, a3, ..., an);are passed as follows:
| an |
| ... |
| a3 |
| a2 |
| a1 |
| _arguments |
| hidden |
| this |
The caller is expected to clean the stack._argptr is not passed, it is computed by the callee.
Conforms to the Microsoft Windows Structured Exception Handling conventions.
Conforms to the DWARF (debugging with attributed record formats) Exception Handling conventions.
Uses static address range/handler tables. It is not compatible with the MSVC x64 exception handling tables. The stack is walked assuming it uses the EBP/RBP stack frame convention. The EBP/RBP convention must be used for every function that has an associated EH (Exception Handler) table.
For each function that has exception handlers, an EH table entry is generated.
| field | description |
|---|---|
| void* | pointer to start of function |
| DHandlerTable* | pointer to corresponding EH data |
| uint | size in bytes of the function |
The EH table entries are placed into the following special segments, which are concatenated by the linker.
| Operating System | Segment Name |
|---|---|
| Win32 | FI |
| Win64 | ._deh$B |
| Linux | .deh_eh |
| FreeBSD | .deh_eh |
| OS X | __deh_eh,__DATA |
The rest of the EH data can be placed anywhere, it is immutable.
| field | description |
|---|---|
| void* | pointer to start of function |
| uint | offset of ESP/RSP from EBP/RBP |
| uint | offset from start of function to return code |
| uint | number of entries inDHandlerInfo[] |
| DHandlerInfo[] | array of handler information |
| field | description |
|---|---|
| uint | offset from function address to start of guarded section |
| uint | offset of end of guarded section |
| int | previous table index |
| uint | if != 0 offset to DCatchInfo data from start of table |
| void* | if not null, pointer to finally code to execute |
| field | description |
|---|---|
| uint | number of entries inDCatchBlock[] |
| DCatchBlock[] | array of catch information |
| field | description |
|---|---|
| ClassInfo | catch type |
| uint | offset from EBP/RBP to catch variable |
The interface to this is found in Druntime'score/gc/gcinterface.d.
An instance ofModuleInfo is generated by the compiler and inserted into the object file for every module.ModuleInfo contains information about the module that is useful to the D runtime library:
ModuleInfo is defined in Druntime'sobject.d, which must match the compiler's output in both the values of flags and layout of fields.
Modules compiled with-betterC do not have aModuleInfo instance generated, because such modules must work without the D runtime library. Similarly,ImportC modules do not generate aModuleInfo.
All the static constructors for a module are aggregated into a single function, and a pointer to that function is inserted into the ctor member of theModuleInfo instance for that module.
All the static destructors for a module are aggregated into a single function, and a pointer to that function is inserted into the dtor member of theModuleInfo instance for that module.
All the unit tests for a module are aggregated into a single function, and a pointer to that function is inserted into the unitTest member of theModuleInfo instance for that module.
These are found in Druntime'srt/.
D has types that are not represented in existing C or C++ debuggers. These are dynamic arrays, associative arrays, and delegates. Representing these types as structs causes problems because function calling conventions for structs are often different than that for these types, which causes C/C++ debuggers to misrepresent things. For these debuggers, they are represented as a C type which does match the calling conventions for the type.
| D type | C representation |
|---|---|
| dynamic array | unsigned long long |
| associative array | void* |
| delegate | long long |
| dchar | unsigned long |
For debuggers that can be modified to accept new types, the following extensions help them fully support the types.
The Ddchar type is represented by the special primitive type 0x78.
D makes use of the Codeview OEM generic type record indicated byLF_OEM (0x0015). The format is:
| field size | 2 | 2 | 2 | 2 | 2 | 2 |
| D Type | Leaf Index | OEM Identifier | recOEM | num indices | type index | type index |
|---|---|---|---|---|---|---|
| dynamic array | LF_OEM | OEM | 1 | 2 | @index | @element |
| associative array | LF_OEM | OEM | 2 | 2 | @key | @element |
| delegate | LF_OEM | OEM | 3 | 2 | @this | @function |
| OEM | 0x42 |
| index | type index of array index |
| key | type index of key |
| element | type index of array element |
| this | type index of context pointer |
| function | type index of function |
These extensions can be pretty-printed byobj2asm. TheDdbg debugger supports them.