3. Data model¶
3.1. Objects, values and types¶
Objects are Python’s abstraction for data. All data in a Python programis represented by objects or by relations between objects. (In a sense, and inconformance to Von Neumann’s model of a “stored program computer,” code is alsorepresented by objects.)
Every object has an identity, a type and a value. An object’sidentity neverchanges once it has been created; you may think of it as the object’s address inmemory. The ‘is’ operator compares the identity of two objects; theid() function returns an integer representing its identity.
CPython implementation detail: For CPython,id(x) is the memory address wherex is stored.
An object’s type determines the operations that the object supports (e.g., “doesit have a length?”) and also defines the possible values for objects of thattype. Thetype() function returns an object’s type (which is an objectitself). Like its identity, an object’stype is also unchangeable.[1]
Thevalue of some objects can change. Objects whose value canchange are said to bemutable; objects whose value is unchangeable once theyare created are calledimmutable. (The value of an immutable container objectthat contains a reference to a mutable object can change when the latter’s valueis changed; however the container is still considered immutable, because thecollection of objects it contains cannot be changed. So, immutability is notstrictly the same as having an unchangeable value, it is more subtle.) Anobject’s mutability is determined by its type; for instance, numbers, stringsand tuples are immutable, while dictionaries and lists are mutable.
Objects are never explicitly destroyed; however, when they become unreachablethey may be garbage-collected. An implementation is allowed to postpone garbagecollection or omit it altogether — it is a matter of implementation qualityhow garbage collection is implemented, as long as no objects are collected thatare still reachable.
CPython implementation detail: CPython currently uses a reference-counting scheme with (optional) delayeddetection of cyclically linked garbage, which collects most objects as soonas they become unreachable, but is not guaranteed to collect garbagecontaining circular references. See the documentation of thegcmodule for information on controlling the collection of cyclic garbage.Other implementations act differently and CPython may change.Do not depend on immediate finalization of objects when they becomeunreachable (so you should always close files explicitly).
Note that the use of the implementation’s tracing or debugging facilities maykeep objects alive that would normally be collectable. Also note that catchingan exception with a ‘try…except’ statement may keepobjects alive.
Some objects contain references to “external” resources such as open files orwindows. It is understood that these resources are freed when the object isgarbage-collected, but since garbage collection is not guaranteed to happen,such objects also provide an explicit way to release the external resource,usually aclose() method. Programs are strongly recommended to explicitlyclose such objects. The ‘try…finally’ statementand the ‘with’ statement provide convenient ways to do this.
Some objects contain references to other objects; these are calledcontainers.Examples of containers are tuples, lists and dictionaries. The references arepart of a container’s value. In most cases, when we talk about the value of acontainer, we imply the values, not the identities of the contained objects;however, when we talk about the mutability of a container, only the identitiesof the immediately contained objects are implied. So, if an immutable container(like a tuple) contains a reference to a mutable object, its value changes ifthat mutable object is changed.
Types affect almost all aspects of object behavior. Even the importance ofobject identity is affected in some sense: for immutable types, operations thatcompute new values may actually return a reference to any existing object withthe same type and value, while for mutable objects this is not allowed. E.g.,aftera=1;b=1,a andb may or may not refer to the same objectwith the value one, depending on the implementation, but afterc=[];d=[],c andd are guaranteed to refer to two different, unique, newlycreated empty lists. (Note thatc=d=[] assigns the same object to bothc andd.)
3.2. The standard type hierarchy¶
Below is a list of the types that are built into Python. Extension modules(written in C, Java, or other languages, depending on the implementation) candefine additional types. Future versions of Python may add types to the typehierarchy (e.g., rational numbers, efficiently stored arrays of integers, etc.),although such additions will often be provided via the standard library instead.
Some of the type descriptions below contain a paragraph listing ‘specialattributes.’ These are attributes that provide access to the implementation andare not intended for general use. Their definition may change in the future.
- None
This type has a single value. There is a single object with this value. Thisobject is accessed through the built-in name
None. It is used to signify theabsence of a value in many situations, e.g., it is returned from functions thatdon’t explicitly return anything. Its truth value is false.- NotImplemented
This type has a single value. There is a single object with this value. Thisobject is accessed through the built-in name
NotImplemented. Numeric methodsand rich comparison methods should return this value if they do not implement theoperation for the operands provided. (The interpreter will then try thereflected operation, or some other fallback, depending on the operator.) Itstruth value is true.SeeImplementing the arithmetic operationsfor more details.
- Ellipsis
This type has a single value. There is a single object with this value. Thisobject is accessed through the literal
...or the built-in nameEllipsis. Its truth value is true.numbers.NumberThese are created by numeric literals and returned as results by arithmeticoperators and arithmetic built-in functions. Numeric objects are immutable;once created their value never changes. Python numbers are of course stronglyrelated to mathematical numbers, but subject to the limitations of numericalrepresentation in computers.
Python distinguishes between integers, floating point numbers, and complexnumbers:
numbers.IntegralThese represent elements from the mathematical set of integers (positive andnegative).
There are two types of integers:
Integers (
int)These represent numbers in an unlimited range, subject to available (virtual)memory only. For the purpose of shift and mask operations, a binaryrepresentation is assumed, and negative numbers are represented in a variant of2’s complement which gives the illusion of an infinite string of sign bitsextending to the left.- Booleans (
bool) These represent the truth values False and True. The two objects representingthe values
FalseandTrueare the only Boolean objects. The Boolean type is asubtype of the integer type, and Boolean values behave like the values 0 and 1,respectively, in almost all contexts, the exception being that when converted toa string, the strings"False"or"True"are returned, respectively.
The rules for integer representation are intended to give the most meaningfulinterpretation of shift and mask operations involving negative integers.
- Booleans (
numbers.Real(float)These represent machine-level double precision floating point numbers. You areat the mercy of the underlying machine architecture (and C or Javaimplementation) for the accepted range and handling of overflow. Python does notsupport single-precision floating point numbers; the savings in processor andmemory usage that are usually the reason for using these are dwarfed by theoverhead of using objects in Python, so there is no reason to complicate thelanguage with two kinds of floating point numbers.
numbers.Complex(complex)These represent complex numbers as a pair of machine-level double precisionfloating point numbers. The same caveats apply as for floating point numbers.The real and imaginary parts of a complex number
zcan be retrieved throughthe read-only attributesz.realandz.imag.
- Sequences
These represent finite ordered sets indexed by non-negative numbers. Thebuilt-in function
len()returns the number of items of a sequence. Whenthe length of a sequence isn, the index set contains the numbers 0, 1,…,n-1. Itemi of sequencea is selected bya[i].Sequences also support slicing:
a[i:j]selects all items with indexk suchthati<=k<j. When used as an expression, a slice is asequence of the same type. This implies that the index set is renumbered sothat it starts at 0.Some sequences also support “extended slicing” with a third “step” parameter:
a[i:j:k]selects all items ofa with indexx wherex=i+n*k,n>=0andi<=x<j.Sequences are distinguished according to their mutability:
- Immutable sequences
An object of an immutable sequence type cannot change once it is created. (Ifthe object contains references to other objects, these other objects may bemutable and may be changed; however, the collection of objects directlyreferenced by an immutable object cannot change.)
The following types are immutable sequences:
- Strings
A string is a sequence of values that represent Unicode code points.All the code points in the range
U+0000-U+10FFFFcan berepresented in a string. Python doesn’t have achartype;instead, every code point in the string is represented as a stringobject with length1. The built-in functionord()converts a code point from its string form to an integer in therange0-10FFFF;chr()converts an integer in the range0-10FFFFto the corresponding length1string object.str.encode()can be used to convert astrtobytesusing the given text encoding, andbytes.decode()can be used to achieve the opposite.- Tuples
The items of a tuple are arbitrary Python objects. Tuples of two ormore items are formed by comma-separated lists of expressions. A tupleof one item (a ‘singleton’) can be formed by affixing a comma to anexpression (an expression by itself does not create a tuple, sinceparentheses must be usable for grouping of expressions). An emptytuple can be formed by an empty pair of parentheses.
- Bytes
A bytes object is an immutable array. The items are 8-bit bytes,represented by integers in the range 0 <= x < 256. Bytes literals(like
b'abc') and the built-in functionbytes()can be used toconstruct bytes objects. Also, bytes objects can be decoded to stringsvia thedecode()method.
- Mutable sequences
Mutable sequences can be changed after they are created. The subscription andslicing notations can be used as the target of assignment and
del(delete) statements.There are currently two intrinsic mutable sequence types:
- Lists
The items of a list are arbitrary Python objects. Lists are formed byplacing a comma-separated list of expressions in square brackets. (Notethat there are no special cases needed to form lists of length 0 or 1.)
- Byte Arrays
A bytearray object is a mutable array. They are created by the built-in
bytearray()constructor. Aside from being mutable (and henceunhashable), byte arrays otherwise provide the same interface andfunctionality as immutable bytes objects.
The extension module
arrayprovides an additional example of amutable sequence type, as does thecollectionsmodule.
- Set types
These represent unordered, finite sets of unique, immutable objects. As such,they cannot be indexed by any subscript. However, they can be iterated over, andthe built-in function
len()returns the number of items in a set. Commonuses for sets are fast membership testing, removing duplicates from a sequence,and computing mathematical operations such as intersection, union, difference,and symmetric difference.For set elements, the same immutability rules apply as for dictionary keys. Notethat numeric types obey the normal rules for numeric comparison: if two numberscompare equal (e.g.,
1and1.0), only one of them can be contained in aset.There are currently two intrinsic set types:
- Sets
These represent a mutable set. They are created by the built-in
set()constructor and can be modified afterwards by several methods, such asadd().- Frozen sets
These represent an immutable set. They are created by the built-in
frozenset()constructor. As a frozenset is immutable andhashable, it can be used again as an element of another set, or asa dictionary key.
- Mappings
These represent finite sets of objects indexed by arbitrary index sets. Thesubscript notation
a[k]selects the item indexed bykfrom the mappinga; this can be used in expressions and as the target of assignments ordelstatements. The built-in functionlen()returns the numberof items in a mapping.There is currently a single intrinsic mapping type:
- Dictionaries
These represent finite sets of objects indexed by nearly arbitrary values. Theonly types of values not acceptable as keys are values containing lists ordictionaries or other mutable types that are compared by value rather than byobject identity, the reason being that the efficient implementation ofdictionaries requires a key’s hash value to remain constant. Numeric types usedfor keys obey the normal rules for numeric comparison: if two numbers compareequal (e.g.,
1and1.0) then they can be used interchangeably to indexthe same dictionary entry.Dictionaries are mutable; they can be created by the
{...}notation (seesectionDictionary displays).The extension modules
dbm.ndbmanddbm.gnuprovideadditional examples of mapping types, as does thecollectionsmodule.
- Callable types
These are the types to which the function call operation (see sectionCalls) can be applied:
- User-defined functions
A user-defined function object is created by a function definition (seesectionFunction definitions). It should be called with an argument listcontaining the same number of items as the function’s formal parameterlist.
Special attributes:
Attribute Meaning __doc__The function’s documentationstring, or Noneifunavailable; not inherited bysubclassesWritable __name__The function’s name Writable __qualname__The function’squalified name
New in version 3.3.
Writable __module__The name of the module thefunction was defined in, or Noneif unavailable.Writable __defaults__A tuple containing defaultargument values for thosearguments that have defaults,or Noneif no argumentshave a default valueWritable __code__The code object representingthe compiled function body. Writable __globals__A reference to the dictionarythat holds the function’sglobal variables — theglobal namespace of themodule in which the functionwas defined. Read-only __dict__The namespace supportingarbitrary functionattributes. Writable __closure__Noneor a tuple of cellsthat contain bindings for thefunction’s free variables.Read-only __annotations__A dict containing annotationsof parameters. The keys ofthe dict are the parameternames, and 'return'forthe return annotation, ifprovided.Writable __kwdefaults__A dict containing defaultsfor keyword-only parameters. Writable Most of the attributes labelled “Writable” check the type of the assigned value.
Function objects also support getting and setting arbitrary attributes, whichcan be used, for example, to attach metadata to functions. Regular attributedot-notation is used to get and set such attributes.Note that the currentimplementation only supports function attributes on user-defined functions.Function attributes on built-in functions may be supported in the future.
Additional information about a function’s definition can be retrieved from itscode object; see the description of internal types below.
- Instance methods
An instance method object combines a class, a class instance and anycallable object (normally a user-defined function).
Special read-only attributes:
__self__is the class instance object,__func__is the function object;__doc__is the method’sdocumentation (same as__func__.__doc__);__name__is themethod name (same as__func__.__name__);__module__is thename of the module the method was defined in, orNoneif unavailable.Methods also support accessing (but not setting) the arbitrary functionattributes on the underlying function object.
User-defined method objects may be created when getting an attribute of aclass (perhaps via an instance of that class), if that attribute is auser-defined function object or a class method object.
When an instance method object is created by retrieving a user-definedfunction object from a class via one of its instances, its
__self__attribute is the instance, and the method object is saidto be bound. The new method’s__func__attribute is the originalfunction object.When a user-defined method object is created by retrieving another methodobject from a class or instance, the behaviour is the same as for afunction object, except that the
__func__attribute of the newinstance is not the original method object but its__func__attribute.When an instance method object is created by retrieving a class methodobject from a class or instance, its
__self__attribute is theclass itself, and its__func__attribute is the function objectunderlying the class method.When an instance method object is called, the underlying function(
__func__) is called, inserting the class instance(__self__) in front of the argument list. For instance, whenCis a class which contains a definition for a functionf(), andxis an instance ofC, callingx.f(1)isequivalent to callingC.f(x,1).When an instance method object is derived from a class method object, the“class instance” stored in
__self__will actually be the classitself, so that calling eitherx.f(1)orC.f(1)is equivalent tocallingf(C,1)wherefis the underlying function.Note that the transformation from function object to instance methodobject happens each time the attribute is retrieved from the instance. Insome cases, a fruitful optimization is to assign the attribute to a localvariable and call that local variable. Also notice that thistransformation only happens for user-defined functions; other callableobjects (and all non-callable objects) are retrieved withouttransformation. It is also important to note that user-defined functionswhich are attributes of a class instance are not converted to boundmethods; thisonly happens when the function is an attribute of theclass.
- Generator functions
A function or method which uses the
yieldstatement (see sectionThe yield statement) is called agenerator function. Such a function, whencalled, always returns an iterator object which can be used to execute thebody of the function: calling the iterator’siterator.__next__()method will cause the function to execute until it provides a valueusing theyieldstatement. When the function executes areturnstatement or falls off the end, aStopIterationexception is raised and the iterator will have reached the end of the set ofvalues to be returned.- Coroutine functions
A function or method which is defined using
asyncdefis calledacoroutine function. Such a function, when called, returns acoroutine object. It may containawaitexpressions,as well asasyncwithandasyncforstatements. Seealso theCoroutine Objects section.- Built-in functions
A built-in function object is a wrapper around a C function. Examples ofbuilt-in functions are
len()andmath.sin()(mathis astandard built-in module). The number and type of the arguments aredetermined by the C function. Special read-only attributes:__doc__is the function’s documentation string, orNoneifunavailable;__name__is the function’s name;__self__isset toNone(but see the next item);__module__is the name ofthe module the function was defined in orNoneif unavailable.- Built-in methods
This is really a different disguise of a built-in function, this time containingan object passed to the C function as an implicit extra argument. An example ofa built-in method is
alist.append(), assumingalist is a list object. Inthis case, the special read-only attribute__self__is set to the objectdenoted byalist.- Classes
- Classes are callable. These objects normally act as factories for newinstances of themselves, but variations are possible for class types thatoverride
__new__(). The arguments of the call are passed to__new__()and, in the typical case, to__init__()toinitialize the new instance. - Class Instances
- Instances of arbitrary classes can be made callable by defining a
__call__()method in their class.
- Modules
Modules are a basic organizational unit of Python code, and are created bytheimport system as invoked either by the
importstatement (seeimport), or by callingfunctions such asimportlib.import_module()and built-in__import__(). A module object has a namespace implemented by adictionary object (this is the dictionary referenced by the__globals__attribute of functions defined in the module). Attribute references aretranslated to lookups in this dictionary, e.g.,m.xis equivalent tom.__dict__["x"]. A module object does not contain the code object usedto initialize the module (since it isn’t needed once the initialization isdone).Attribute assignment updates the module’s namespace dictionary, e.g.,
m.x=1is equivalent tom.__dict__["x"]=1.Special read-only attribute:
__dict__is the module’s namespace as adictionary object.CPython implementation detail: Because of the way CPython clears module dictionaries, the moduledictionary will be cleared when the module falls out of scope even if thedictionary still has live references. To avoid this, copy the dictionaryor keep the module around while using its dictionary directly.
Predefined (writable) attributes:
__name__is the module’s name;__doc__is the module’s documentation string, orNoneifunavailable;__file__is the pathname of the file from which themodule was loaded, if it was loaded from a file. The__file__attribute may be missing for certain types of modules, such as C modulesthat are statically linked into the interpreter; for extension modulesloaded dynamically from a shared library, it is the pathname of the sharedlibrary file.- Custom classes
Custom class types are typically created by class definitions (see sectionClass definitions). A class has a namespace implemented by a dictionary object.Class attribute references are translated to lookups in this dictionary, e.g.,
C.xis translated toC.__dict__["x"](although there are a number ofhooks which allow for other means of locating attributes). When the attributename is not found there, the attribute search continues in the base classes.This search of the base classes uses the C3 method resolution order whichbehaves correctly even in the presence of ‘diamond’ inheritance structureswhere there are multiple inheritance paths leading back to a common ancestor.Additional details on the C3 MRO used by Python can be found in thedocumentation accompanying the 2.3 release athttps://www.python.org/download/releases/2.3/mro/.When a class attribute reference (for class
C, say) would yield aclass method object, it is transformed into an instance method object whose__self__attributes isC. When it would yield a staticmethod object, it is transformed into the object wrapped by the static methodobject. See sectionImplementing Descriptors for another way in which attributesretrieved from a class may differ from those actually contained in its__dict__.Class attribute assignments update the class’s dictionary, never the dictionaryof a base class.
A class object can be called (see above) to yield a class instance (see below).
Special attributes:
__name__is the class name;__module__isthe module name in which the class was defined;__dict__is thedictionary containing the class’s namespace;__bases__is atuple containing the base classes, in the order of their occurrence in thebase class list;__doc__is the class’s documentation string, orNoneif undefined.- Class instances
A class instance is created by calling a class object (see above). A classinstance has a namespace implemented as a dictionary which is the first placein which attribute references are searched. When an attribute is not foundthere, and the instance’s class has an attribute by that name, the searchcontinues with the class attributes. If a class attribute is found that is auser-defined function object, it is transformed into an instance methodobject whose
__self__attribute is the instance. Static method andclass method objects are also transformed; see above under “Classes”. SeesectionImplementing Descriptors for another way in which attributes of a classretrieved via its instances may differ from the objects actually stored inthe class’s__dict__. If no class attribute is found, and theobject’s class has a__getattr__()method, that is called to satisfythe lookup.Attribute assignments and deletions update the instance’s dictionary, never aclass’s dictionary. If the class has a
__setattr__()or__delattr__()method, this is called instead of updating the instancedictionary directly.Class instances can pretend to be numbers, sequences, or mappings if they havemethods with certain special names. See sectionSpecial method names.
Special attributes:
__dict__is the attribute dictionary;__class__is the instance’s class.- I/O objects (also known as file objects)
Afile object represents an open file. Various shortcuts areavailable to create file objects: the
open()built-in function, andalsoos.popen(),os.fdopen(), and themakefile()method of socket objects (and perhaps byother functions or methods provided by extension modules).The objects
sys.stdin,sys.stdoutandsys.stderrareinitialized to file objects corresponding to the interpreter’s standardinput, output and error streams; they are all open in text mode andtherefore follow the interface defined by theio.TextIOBaseabstract class.- Internal types
A few types used internally by the interpreter are exposed to the user. Theirdefinitions may change with future versions of the interpreter, but they arementioned here for completeness.
- Code objects
Code objects representbyte-compiled executable Python code, orbytecode.The difference between a code object and a function object is that the functionobject contains an explicit reference to the function’s globals (the module inwhich it was defined), while a code object contains no context; also the defaultargument values are stored in the function object, not in the code object(because they represent values calculated at run-time). Unlike functionobjects, code objects are immutable and contain no references (directly orindirectly) to mutable objects.
Special read-only attributes:
co_namegives the function name;co_argcountis the number of positional arguments (including argumentswith default values);co_nlocalsis the number of local variables usedby the function (including arguments);co_varnamesis a tuple containingthe names of the local variables (starting with the argument names);co_cellvarsis a tuple containing the names of local variables that arereferenced by nested functions;co_freevarsis a tuple containing thenames of free variables;co_codeis a string representing the sequenceof bytecode instructions;co_constsis a tuple containing the literalsused by the bytecode;co_namesis a tuple containing the names used bythe bytecode;co_filenameis the filename from which the code wascompiled;co_firstlinenois the first line number of the function;co_lnotabis a string encoding the mapping from bytecode offsets toline numbers (for details see the source code of the interpreter);co_stacksizeis the required stack size (including local variables);co_flagsis an integer encoding a number of flags for the interpreter.The following flag bits are defined for
co_flags: bit0x04is set ifthe function uses the*argumentssyntax to accept an arbitrary number ofpositional arguments; bit0x08is set if the function uses the**keywordssyntax to accept arbitrary keyword arguments; bit0x20is setif the function is a generator.Future feature declarations (
from__future__importdivision) also use bitsinco_flagsto indicate whether a code object was compiled with aparticular feature enabled: bit0x2000is set if the function was compiledwith future division enabled; bits0x10and0x1000were used in earlierversions of Python.Other bits in
co_flagsare reserved for internal use.If a code object represents a function, the first item in
co_constsisthe documentation string of the function, orNoneif undefined.
- Frame objects
Frame objects represent execution frames. They may occur in traceback objects(see below).
Special read-only attributes:
f_backis to the previous stack frame(towards the caller), orNoneif this is the bottom stack frame;f_codeis the code object being executed in this frame;f_localsis the dictionary used to look up local variables;f_globalsis used forglobal variables;f_builtinsis used for built-in (intrinsic) names;f_lastigives the precise instruction (this is an index into thebytecode string of the code object).Special writable attributes:
f_trace, if notNone, is a functioncalled at the start of each source code line (this is used by the debugger);f_linenois the current line number of the frame — writing to thisfrom within a trace function jumps to the given line (only for the bottom-mostframe). A debugger can implement a Jump command (aka Set Next Statement)by writing to f_lineno.Frame objects support one method:
frame.clear()¶This method clears all references to local variables held by theframe. Also, if the frame belonged to a generator, the generatoris finalized. This helps break reference cycles involving frameobjects (for example when catching an exception and storing itstraceback for later use).
RuntimeErroris raised if the frame is currently executing.New in version 3.4.
- Traceback objects
Traceback objects represent a stack trace of an exception. A traceback objectis created when an exception occurs. When the search for an exception handlerunwinds the execution stack, at each unwound level a traceback object isinserted in front of the current traceback. When an exception handler isentered, the stack trace is made available to the program. (See sectionThe try statement.) It is accessible as the third item of thetuple returned by
sys.exc_info(). When the program contains no suitablehandler, the stack trace is written (nicely formatted) to the standard errorstream; if the interpreter is interactive, it is also made available to the userassys.last_traceback.Special read-only attributes:
tb_nextis the next level in the stacktrace (towards the frame where the exception occurred), orNoneif there isno next level;tb_framepoints to the execution frame of the currentlevel;tb_linenogives the line number where the exception occurred;tb_lastiindicates the precise instruction. The line number and lastinstruction in the traceback may differ from the line number of its frame objectif the exception occurred in atrystatement with no matching exceptclause or with a finally clause.- Slice objects
Slice objects are used to represent slices for
__getitem__()methods. They are also created by the built-inslice()function.Special read-only attributes:
startis the lower bound;stopis the upper bound;stepis the stepvalue; each isNoneif omitted. These attributes can have any type.Slice objects support one method:
slice.indices(self,length)¶This method takes a single integer argumentlength and computesinformation about the slice that the slice object would describe ifapplied to a sequence oflength items. It returns a tuple of threeintegers; respectively these are thestart andstop indices and thestep or stride length of the slice. Missing or out-of-bounds indicesare handled in a manner consistent with regular slices.
- Static method objects
- Static method objects provide a way of defeating the transformation of functionobjects to method objects described above. A static method object is a wrapperaround any other object, usually a user-defined method object. When a staticmethod object is retrieved from a class or a class instance, the object actuallyreturned is the wrapped object, which is not subject to any furthertransformation. Static method objects are not themselves callable, although theobjects they wrap usually are. Static method objects are created by the built-in
staticmethod()constructor. - Class method objects
- A class method object, like a static method object, is a wrapper around anotherobject that alters the way in which that object is retrieved from classes andclass instances. The behaviour of class method objects upon such retrieval isdescribed above, under “User-defined methods”. Class method objects are createdby the built-in
classmethod()constructor.
3.3. Special method names¶
A class can implement certain operations that are invoked by special syntax(such as arithmetic operations or subscripting and slicing) by defining methodswith special names. This is Python’s approach tooperator overloading,allowing classes to define their own behavior with respect to languageoperators. For instance, if a class defines a method named__getitem__(),andx is an instance of this class, thenx[i] is roughly equivalenttotype(x).__getitem__(x,i). Except where mentioned, attempts to execute anoperation raise an exception when no appropriate method is defined (typicallyAttributeError orTypeError).
When implementing a class that emulates any built-in type, it is important thatthe emulation only be implemented to the degree that it makes sense for theobject being modelled. For example, some sequences may work well with retrievalof individual elements, but extracting a slice may not make sense. (One exampleof this is theNodeList interface in the W3C’s DocumentObject Model.)
3.3.1. Basic customization¶
object.__new__(cls[,...])¶Called to create a new instance of classcls.
__new__()is a staticmethod (special-cased so you need not declare it as such) that takes the classof which an instance was requested as its first argument. The remainingarguments are those passed to the object constructor expression (the call to theclass). The return value of__new__()should be the new object instance(usually an instance ofcls).Typical implementations create a new instance of the class by invoking thesuperclass’s
__new__()method usingsuper().__new__(cls[,...])with appropriate arguments and then modifying the newly-created instanceas necessary before returning it.If
__new__()returns an instance ofcls, then the new instance’s__init__()method will be invoked like__init__(self[,...]), whereself is the new instance and the remaining arguments are the same as werepassed to__new__().If
__new__()does not return an instance ofcls, then the new instance’s__init__()method will not be invoked.__new__()is intended mainly to allow subclasses of immutable types (likeint, str, or tuple) to customize instance creation. It is also commonlyoverridden in custom metaclasses in order to customize class creation.
object.__init__(self[,...])¶Called after the instance has been created (by
__new__()), but beforeit is returned to the caller. The arguments are those passed to theclass constructor expression. If a base class has an__init__()method, the derived class’s__init__()method, if any, must explicitlycall it to ensure proper initialization of the base class part of theinstance; for example:super().__init__([args...]).Because
__new__()and__init__()work together in constructingobjects (__new__()to create it, and__init__()to customize it),no non-Nonevalue may be returned by__init__(); doing so willcause aTypeErrorto be raised at runtime.
object.__del__(self)¶Called when the instance is about to be destroyed. This is also called adestructor. If a base class has a
__del__()method, the derived class’s__del__()method, if any, must explicitly call it to ensure properdeletion of the base class part of the instance. Note that it is possible(though not recommended!) for the__del__()method to postpone destructionof the instance by creating a new reference to it. It may then be called at alater time when this new reference is deleted. It is not guaranteed that__del__()methods are called for objects that still exist when theinterpreter exits.Note
delxdoesn’t directly callx.__del__()— the former decrementsthe reference count forxby one, and the latter is only called whenx’s reference count reaches zero. Some common situations that mayprevent the reference count of an object from going to zero include:circular references between objects (e.g., a doubly-linked list or a treedata structure with parent and child pointers); a reference to the objecton the stack frame of a function that caught an exception (the tracebackstored insys.exc_info()[2]keeps the stack frame alive); or areference to the object on the stack frame that raised an unhandledexception in interactive mode (the traceback stored insys.last_tracebackkeeps the stack frame alive). The first situationcan only be remedied by explicitly breaking the cycles; the second can beresolved by freeing the reference to the traceback object when it is nolonger useful, and the third can be resolved by storingNoneinsys.last_traceback.Circular references which are garbage are detected and cleaned up whenthe cyclic garbage collector is enabled (it’s on by default). Refer to thedocumentation for thegcmodule for more information about thistopic.Warning
Due to the precarious circumstances under which
__del__()methods areinvoked, exceptions that occur during their execution are ignored, and a warningis printed tosys.stderrinstead. Also, when__del__()is invoked inresponse to a module being deleted (e.g., when execution of the program isdone), other globals referenced by the__del__()method may already havebeen deleted or in the process of being torn down (e.g. the importmachinery shutting down). For this reason,__del__()methodsshould do the absoluteminimum needed to maintain external invariants. Starting with version 1.5,Python guarantees that globals whose name begins with a single underscore aredeleted from their module before other globals are deleted; if no otherreferences to such globals exist, this may help in assuring that importedmodules are still available at the time when the__del__()method iscalled.
object.__repr__(self)¶Called by the
repr()built-in function to compute the “official” stringrepresentation of an object. If at all possible, this should look like avalid Python expression that could be used to recreate an object with thesame value (given an appropriate environment). If this is not possible, astring of the form<...someusefuldescription...>should be returned.The return value must be a string object. If a class defines__repr__()but not__str__(), then__repr__()is also used when an“informal” string representation of instances of that class is required.This is typically used for debugging, so it is important that the representationis information-rich and unambiguous.
object.__str__(self)¶Called by
str(object)and the built-in functionsformat()andprint()to compute the “informal” or nicelyprintable string representation of an object. The return value must be astring object.This method differs from
object.__repr__()in that there is noexpectation that__str__()return a valid Python expression: a moreconvenient or concise representation can be used.The default implementation defined by the built-in type
objectcallsobject.__repr__().
object.__bytes__(self)¶Called by
bytes()to compute a byte-string representation of anobject. This should return abytesobject.
object.__format__(self,format_spec)¶Called by the
format()built-in function (and by extension, thestr.format()method of classstr) to produce a “formatted”string representation of an object. Theformat_specargument isa string that contains a description of the formatting options desired.The interpretation of theformat_specargument is up to the typeimplementing__format__(), however most classes will eitherdelegate formatting to one of the built-in types, or use a similarformatting option syntax.SeeFormat Specification Mini-Language for a description of the standard formatting syntax.
The return value must be a string object.
Changed in version 3.4:The __format__ method of
objectitself raises aTypeErrorif passed any non-empty string.
object.__lt__(self,other)¶object.__le__(self,other)¶object.__eq__(self,other)¶object.__ne__(self,other)¶object.__gt__(self,other)¶object.__ge__(self,other)¶These are the so-called “rich comparison” methods. The correspondence betweenoperator symbols and method names is as follows:
x<ycallsx.__lt__(y),x<=ycallsx.__le__(y),x==ycallsx.__eq__(y),x!=ycallsx.__ne__(y),x>ycallsx.__gt__(y), andx>=ycallsx.__ge__(y).A rich comparison method may return the singleton
NotImplementedif it doesnot implement the operation for a given pair of arguments. By convention,FalseandTrueare returned for a successful comparison. However, thesemethods can return any value, so if the comparison operator is used in a Booleancontext (e.g., in the condition of anifstatement), Python will callbool()on the value to determine if the result is true or false.By default,
__ne__()delegates to__eq__()andinverts the result unless it isNotImplemented. There are no otherimplied relationships among the comparison operators, for example,the truth of(x<yorx==y)does not implyx<=y.To automatically generate ordering operations from a single root operation,seefunctools.total_ordering().See the paragraph on
__hash__()forsome important notes on creatinghashable objects which supportcustom comparison operations and are usable as dictionary keys.There are no swapped-argument versions of these methods (to be used when theleft argument does not support the operation but the right argument does);rather,
__lt__()and__gt__()are each other’s reflection,__le__()and__ge__()are each other’s reflection, and__eq__()and__ne__()are their own reflection.If the operands are of different types, and right operand’s type isa direct or indirect subclass of the left operand’s type,the reflected method of the right operand has priority, otherwisethe left operand’s method has priority. Virtual subclassing isnot considered.
object.__hash__(self)¶Called by built-in function
hash()and for operations on members ofhashed collections includingset,frozenset, anddict.__hash__()should return an integer. The only requiredproperty is that objects which compare equal have the same hash value; it isadvised to mix together the hash values of the components of the object thatalso play a part in comparison of objects by packing them into a tuple andhashing the tuple. Example:def__hash__(self):returnhash((self.name,self.nick,self.color))
Note
hash()truncates the value returned from an object’s custom__hash__()method to the size of aPy_ssize_t. This istypically 8 bytes on 64-bit builds and 4 bytes on 32-bit builds. If anobject’s__hash__()must interoperate on builds of different bitsizes, be sure to check the width on all supported builds. An easy wayto do this is withpython-c"importsys;print(sys.hash_info.width)".If a class does not define an
__eq__()method it should not define a__hash__()operation either; if it defines__eq__()but not__hash__(), its instances will not be usable as items in hashablecollections. If a class defines mutable objects and implements an__eq__()method, it should not implement__hash__(), since theimplementation of hashable collections requires that a key’s hash value isimmutable (if the object’s hash value changes, it will be in the wrong hashbucket).User-defined classes have
__eq__()and__hash__()methodsby default; with them, all objects compare unequal (except with themselves)andx.__hash__()returns an appropriate value such thatx==yimplies both thatxisyandhash(x)==hash(y).A class that overrides
__eq__()and does not define__hash__()will have its__hash__()implicitly set toNone. When the__hash__()method of a class isNone, instances of the class willraise an appropriateTypeErrorwhen a program attempts to retrievetheir hash value, and will also be correctly identified as unhashable whencheckingisinstance(obj,collections.Hashable).If a class that overrides
__eq__()needs to retain the implementationof__hash__()from a parent class, the interpreter must be told thisexplicitly by setting__hash__=<ParentClass>.__hash__.If a class that does not override
__eq__()wishes to suppress hashsupport, it should include__hash__=Nonein the class definition.A class which defines its own__hash__()that explicitly raisesaTypeErrorwould be incorrectly identified as hashable byanisinstance(obj,collections.Hashable)call.Note
By default, the
__hash__()values of str, bytes and datetimeobjects are “salted” with an unpredictable random value. Although theyremain constant within an individual Python process, they are notpredictable between repeated invocations of Python.This is intended to provide protection against a denial-of-service causedby carefully-chosen inputs that exploit the worst case performance of adict insertion, O(n^2) complexity. Seehttp://www.ocert.org/advisories/ocert-2011-003.html for details.
Changing hash values affects the iteration order of dicts, sets andother mappings. Python has never made guarantees about this ordering(and it typically varies between 32-bit and 64-bit builds).
See also
PYTHONHASHSEED.Changed in version 3.3:Hash randomization is enabled by default.
object.__bool__(self)¶Called to implement truth value testing and the built-in operation
bool(); should returnFalseorTrue. When this method is notdefined,__len__()is called, if it is defined, and the object isconsidered true if its result is nonzero. If a class defines neither__len__()nor__bool__(), all its instances are consideredtrue.
3.3.2. Customizing attribute access¶
The following methods can be defined to customize the meaning of attributeaccess (use of, assignment to, or deletion ofx.name) for class instances.
object.__getattr__(self,name)¶Called when an attribute lookup has not found the attribute in the usual places(i.e. it is not an instance attribute nor is it found in the class tree for
self).nameis the attribute name. This method should return the(computed) attribute value or raise anAttributeErrorexception.Note that if the attribute is found through the normal mechanism,
__getattr__()is not called. (This is an intentional asymmetry between__getattr__()and__setattr__().) This is done both for efficiencyreasons and because otherwise__getattr__()would have no way to accessother attributes of the instance. Note that at least for instance variables,you can fake total control by not inserting any values in the instance attributedictionary (but instead inserting them in another object). See the__getattribute__()method below for a way to actually get total controlover attribute access.
object.__getattribute__(self,name)¶Called unconditionally to implement attribute accesses for instances of theclass. If the class also defines
__getattr__(), the latter will not becalled unless__getattribute__()either calls it explicitly or raises anAttributeError. This method should return the (computed) attribute valueor raise anAttributeErrorexception. In order to avoid infiniterecursion in this method, its implementation should always call the base classmethod with the same name to access any attributes it needs, for example,object.__getattribute__(self,name).Note
This method may still be bypassed when looking up special methods as theresult of implicit invocation via language syntax or built-in functions.SeeSpecial method lookup.
object.__setattr__(self,name,value)¶Called when an attribute assignment is attempted. This is called instead ofthe normal mechanism (i.e. store the value in the instance dictionary).name is the attribute name,value is the value to be assigned to it.
If
__setattr__()wants to assign to an instance attribute, it shouldcall the base class method with the same name, for example,object.__setattr__(self,name,value).
object.__delattr__(self,name)¶Like
__setattr__()but for attribute deletion instead of assignment. Thisshould only be implemented ifdelobj.nameis meaningful for the object.
object.__dir__(self)¶Called when
dir()is called on the object. A sequence must bereturned.dir()converts the returned sequence to a list and sorts it.
3.3.2.1. Implementing Descriptors¶
The following methods only apply when an instance of the class containing themethod (a so-calleddescriptor class) appears in anowner class (thedescriptor must be in either the owner’s class dictionary or in the classdictionary for one of its parents). In the examples below, “the attribute”refers to the attribute whose name is the key of the property in the ownerclass’__dict__.
object.__get__(self,instance,owner)¶Called to get the attribute of the owner class (class attribute access) or of aninstance of that class (instance attribute access).owner is always the ownerclass, whileinstance is the instance that the attribute was accessed through,or
Nonewhen the attribute is accessed through theowner. This methodshould return the (computed) attribute value or raise anAttributeErrorexception.
object.__set__(self,instance,value)¶Called to set the attribute on an instanceinstance of the owner class to anew value,value.
object.__delete__(self,instance)¶Called to delete the attribute on an instanceinstance of the owner class.
The attribute__objclass__ is interpreted by theinspect moduleas specifying the class where this object was defined (setting thisappropriately can assist in runtime introspection of dynamic class attributes).For callables, it may indicate that an instance of the given type (or asubclass) is expected or required as the first positional argument (for example,CPython sets this attribute for unbound methods that are implemented in C).
3.3.2.2. Invoking Descriptors¶
In general, a descriptor is an object attribute with “binding behavior”, onewhose attribute access has been overridden by methods in the descriptorprotocol:__get__(),__set__(), and__delete__(). If any ofthose methods are defined for an object, it is said to be a descriptor.
The default behavior for attribute access is to get, set, or delete theattribute from an object’s dictionary. For instance,a.x has a lookup chainstarting witha.__dict__['x'], thentype(a).__dict__['x'], andcontinuing through the base classes oftype(a) excluding metaclasses.
However, if the looked-up value is an object defining one of the descriptormethods, then Python may override the default behavior and invoke the descriptormethod instead. Where this occurs in the precedence chain depends on whichdescriptor methods were defined and how they were called.
The starting point for descriptor invocation is a binding,a.x. How thearguments are assembled depends ona:
- Direct Call
- The simplest and least common call is when user code directly invokes adescriptor method:
x.__get__(a). - Instance Binding
- If binding to an object instance,
a.xis transformed into the call:type(a).__dict__['x'].__get__(a,type(a)). - Class Binding
- If binding to a class,
A.xis transformed into the call:A.__dict__['x'].__get__(None,A). - Super Binding
- If
ais an instance ofsuper, then the bindingsuper(B,obj).m()searchesobj.__class__.__mro__for the base classAimmediately precedingBand then invokes the descriptor with the call:A.__dict__['m'].__get__(obj,obj.__class__).
For instance bindings, the precedence of descriptor invocation depends on thewhich descriptor methods are defined. A descriptor can define any combinationof__get__(),__set__() and__delete__(). If it does notdefine__get__(), then accessing the attribute will return the descriptorobject itself unless there is a value in the object’s instance dictionary. Ifthe descriptor defines__set__() and/or__delete__(), it is a datadescriptor; if it defines neither, it is a non-data descriptor. Normally, datadescriptors define both__get__() and__set__(), while non-datadescriptors have just the__get__() method. Data descriptors with__set__() and__get__() defined always override a redefinition in aninstance dictionary. In contrast, non-data descriptors can be overridden byinstances.
Python methods (includingstaticmethod() andclassmethod()) areimplemented as non-data descriptors. Accordingly, instances can redefine andoverride methods. This allows individual instances to acquire behaviors thatdiffer from other instances of the same class.
Theproperty() function is implemented as a data descriptor. Accordingly,instances cannot override the behavior of a property.
3.3.2.3. __slots__¶
By default, instances of classes have a dictionary for attribute storage. Thiswastes space for objects having very few instance variables. The spaceconsumption can become acute when creating large numbers of instances.
The default can be overridden by defining__slots__ in a class definition.The__slots__ declaration takes a sequence of instance variables and reservesjust enough space in each instance to hold a value for each variable. Space issaved because__dict__ is not created for each instance.
object.__slots__¶This class variable can be assigned a string, iterable, or sequence ofstrings with variable names used by instances.__slots__ reserves spacefor the declared variables and prevents the automatic creation of__dict__and__weakref__ for each instance.
3.3.2.3.1. Notes on using__slots__¶
- When inheriting from a class without__slots__, the__dict__ attribute ofthat class will always be accessible, so a__slots__ definition in thesubclass is meaningless.
- Without a__dict__ variable, instances cannot be assigned new variables notlisted in the__slots__ definition. Attempts to assign to an unlistedvariable name raises
AttributeError. If dynamic assignment of newvariables is desired, then add'__dict__'to the sequence of strings inthe__slots__ declaration. - Without a__weakref__ variable for each instance, classes defining__slots__ do not support weak references to its instances. If weak referencesupport is needed, then add
'__weakref__'to the sequence of strings in the__slots__ declaration. - __slots__ are implemented at the class level by creating descriptors(Implementing Descriptors) for each variable name. As a result, class attributescannot be used to set default values for instance variables defined by__slots__; otherwise, the class attribute would overwrite the descriptorassignment.
- The action of a__slots__ declaration is limited to the class where it isdefined. As a result, subclasses will have a__dict__ unless they also define__slots__ (which must only contain names of anyadditional slots).
- If a class defines a slot also defined in a base class, the instance variabledefined by the base class slot is inaccessible (except by retrieving itsdescriptor directly from the base class). This renders the meaning of theprogram undefined. In the future, a check may be added to prevent this.
- Nonempty__slots__ does not work for classes derived from “variable-length”built-in types such as
int,bytesandtuple. - Any non-string iterable may be assigned to__slots__. Mappings may also beused; however, in the future, special meaning may be assigned to the valuescorresponding to each key.
- __class__ assignment works only if both classes have the same__slots__.
3.3.3. Customizing class creation¶
By default, classes are constructed usingtype(). The class body isexecuted in a new namespace and the class name is bound locally to theresult oftype(name,bases,namespace).
The class creation process can be customized by passing themetaclasskeyword argument in the class definition line, or by inheriting from anexisting class that included such an argument. In the following example,bothMyClass andMySubclass are instances ofMeta:
classMeta(type):passclassMyClass(metaclass=Meta):passclassMySubclass(MyClass):pass
Any other keyword arguments that are specified in the class definition arepassed through to all metaclass operations described below.
When a class definition is executed, the following steps occur:
- the appropriate metaclass is determined
- the class namespace is prepared
- the class body is executed
- the class object is created
3.3.3.1. Determining the appropriate metaclass¶
The appropriate metaclass for a class definition is determined as follows:
- if no bases and no explicit metaclass are given, then
type()is used - if an explicit metaclass is given and it isnot an instance of
type(), then it is used directly as the metaclass - if an instance of
type()is given as the explicit metaclass, orbases are defined, then the most derived metaclass is used
The most derived metaclass is selected from the explicitly specifiedmetaclass (if any) and the metaclasses (i.e.type(cls)) of all specifiedbase classes. The most derived metaclass is one which is a subtype ofallof these candidate metaclasses. If none of the candidate metaclasses meetsthat criterion, then the class definition will fail withTypeError.
3.3.3.2. Preparing the class namespace¶
Once the appropriate metaclass has been identified, then the class namespaceis prepared. If the metaclass has a__prepare__ attribute, it is calledasnamespace=metaclass.__prepare__(name,bases,**kwds) (where theadditional keyword arguments, if any, come from the class definition).
If the metaclass has no__prepare__ attribute, then the class namespaceis initialised as an emptydict() instance.
See also
- PEP 3115 - Metaclasses in Python 3000
- Introduced the
__prepare__namespace hook
3.3.3.3. Executing the class body¶
The class body is executed (approximately) asexec(body,globals(),namespace). The key difference from a normalcall toexec() is that lexical scoping allows the class body (includingany methods) to reference names from the current and outer scopes when theclass definition occurs inside a function.
However, even when the class definition occurs inside the function, methodsdefined inside the class still cannot see names defined at the class scope.Class variables must be accessed through the first parameter of instance orclass methods, and cannot be accessed at all from static methods.
3.3.3.4. Creating the class object¶
Once the class namespace has been populated by executing the class body,the class object is created by callingmetaclass(name,bases,namespace,**kwds) (the additional keywordspassed here are the same as those passed to__prepare__).
This class object is the one that will be referenced by the zero-argumentform ofsuper().__class__ is an implicit closure referencecreated by the compiler if any methods in a class body refer to either__class__ orsuper. This allows the zero argument form ofsuper() to correctly identify the class being defined based onlexical scoping, while the class or instance that was used to make thecurrent call is identified based on the first argument passed to the method.
After the class object is created, it is passed to the class decoratorsincluded in the class definition (if any) and the resulting object is boundin the local namespace as the defined class.
When a new class is created bytype.__new__, the object provided as thenamespace parameter is copied to a standard Python dictionary and the originalobject is discarded. The new copy becomes the__dict__ attributeof the class object.
See also
- PEP 3135 - New super
- Describes the implicit
__class__closure reference
3.3.3.5. Metaclass example¶
The potential uses for metaclasses are boundless. Some ideas that have beenexplored include logging, interface checking, automatic delegation, automaticproperty creation, proxies, frameworks, and automatic resourcelocking/synchronization.
Here is an example of a metaclass that uses ancollections.OrderedDictto remember the order that class variables are defined:
classOrderedClass(type):@classmethoddef__prepare__(metacls,name,bases,**kwds):returncollections.OrderedDict()def__new__(cls,name,bases,namespace,**kwds):result=type.__new__(cls,name,bases,dict(namespace))result.members=tuple(namespace)returnresultclassA(metaclass=OrderedClass):defone(self):passdeftwo(self):passdefthree(self):passdeffour(self):pass>>>A.members('__module__','one','two','three','four')
When the class definition forA gets executed, the process begins withcalling the metaclass’s__prepare__() method which returns an emptycollections.OrderedDict. That mapping records the methods andattributes ofA as they are defined within the body of the class statement.Once those definitions are executed, the ordered dictionary is fully populatedand the metaclass’s__new__() method gets invoked. That method buildsthe new type and it saves the ordered dictionary keys in an attributecalledmembers.
3.3.4. Customizing instance and subclass checks¶
The following methods are used to override the default behavior of theisinstance() andissubclass() built-in functions.
In particular, the metaclassabc.ABCMeta implements these methods inorder to allow the addition of Abstract Base Classes (ABCs) as “virtual baseclasses” to any class or type (including built-in types), including otherABCs.
class.__instancecheck__(self,instance)¶Return true ifinstance should be considered a (direct or indirect)instance ofclass. If defined, called to implement
isinstance(instance,class).
class.__subclasscheck__(self,subclass)¶Return true ifsubclass should be considered a (direct or indirect)subclass ofclass. If defined, called to implement
issubclass(subclass,class).
Note that these methods are looked up on the type (metaclass) of a class. Theycannot be defined as class methods in the actual class. This is consistent withthe lookup of special methods that are called on instances, only in thiscase the instance is itself a class.
See also
- PEP 3119 - Introducing Abstract Base Classes
- Includes the specification for customizing
isinstance()andissubclass()behavior through__instancecheck__()and__subclasscheck__(), with motivation for this functionalityin the context of adding Abstract Base Classes (see theabcmodule) to the language.
3.3.5. Emulating callable objects¶
object.__call__(self[,args...])¶Called when the instance is “called” as a function; if this method is defined,
x(arg1,arg2,...)is a shorthand forx.__call__(arg1,arg2,...).
3.3.6. Emulating container types¶
The following methods can be defined to implement container objects. Containersusually are sequences (such as lists or tuples) or mappings (like dictionaries),but can represent other containers as well. The first set of methods is usedeither to emulate a sequence or to emulate a mapping; the difference is that fora sequence, the allowable keys should be the integersk for which0<=k<N whereN is the length of the sequence, or slice objects, which define arange of items. It is also recommended that mappings provide the methodskeys(),values(),items(),get(),clear(),setdefault(),pop(),popitem(),copy(), andupdate() behaving similar to those for Python’s standard dictionaryobjects. Thecollections module provides aMutableMappingabstract base class to help create those methods from a base set of__getitem__(),__setitem__(),__delitem__(), andkeys().Mutable sequences should provide methodsappend(),count(),index(),extend(),insert(),pop(),remove(),reverse() andsort(), like Python standard list objects. Finally,sequence types should implement addition (meaning concatenation) andmultiplication (meaning repetition) by defining the methods__add__(),__radd__(),__iadd__(),__mul__(),__rmul__() and__imul__() described below; they should not define other numericaloperators. It is recommended that both mappings and sequences implement the__contains__() method to allow efficient use of thein operator; formappings,in should search the mapping’s keys; for sequences, it shouldsearch through the values. It is further recommended that both mappings andsequences implement the__iter__() method to allow efficient iterationthrough the container; for mappings,__iter__() should be the same askeys(); for sequences, it should iterate through the values.
object.__len__(self)¶Called to implement the built-in function
len(). Should return the lengthof the object, an integer>=0. Also, an object that doesn’t define a__bool__()method and whose__len__()method returns zero isconsidered to be false in a Boolean context.CPython implementation detail: In CPython, the length is required to be at most
sys.maxsize.If the length is larger thansys.maxsizesome features (such aslen()) may raiseOverflowError. To prevent raisingOverflowErrorby truth value testing, an object must define a__bool__()method.
object.__length_hint__(self)¶Called to implement
operator.length_hint(). Should return an estimatedlength for the object (which may be greater or less than the actual length).The length must be an integer>=0. This method is purely anoptimization and is never required for correctness.New in version 3.4.
Note
Slicing is done exclusively with the following three methods. A call like
a[1:2]=b
is translated to
a[slice(1,2,None)]=b
and so forth. Missing slice items are always filled in withNone.
object.__getitem__(self,key)¶Called to implement evaluation of
self[key]. For sequence types, theaccepted keys should be integers and slice objects. Note that the specialinterpretation of negative indexes (if the class wishes to emulate a sequencetype) is up to the__getitem__()method. Ifkey is of an inappropriatetype,TypeErrormay be raised; if of a value outside the set of indexesfor the sequence (after any special interpretation of negative values),IndexErrorshould be raised. For mapping types, ifkey is missing (notin the container),KeyErrorshould be raised.Note
forloops expect that anIndexErrorwill be raised for illegalindexes to allow proper detection of the end of the sequence.
object.__missing__(self,key)¶Called by
dict.__getitem__()to implementself[key]for dict subclasseswhen key is not in the dictionary.
object.__setitem__(self,key,value)¶Called to implement assignment to
self[key]. Same note as for__getitem__(). This should only be implemented for mappings if theobjects support changes to the values for keys, or if new keys can be added, orfor sequences if elements can be replaced. The same exceptions should be raisedfor improperkey values as for the__getitem__()method.
object.__delitem__(self,key)¶Called to implement deletion of
self[key]. Same note as for__getitem__(). This should only be implemented for mappings if theobjects support removal of keys, or for sequences if elements can be removedfrom the sequence. The same exceptions should be raised for improperkeyvalues as for the__getitem__()method.
object.__iter__(self)¶This method is called when an iterator is required for a container. This methodshould return a new iterator object that can iterate over all the objects in thecontainer. For mappings, it should iterate over the keys of the container.
Iterator objects also need to implement this method; they are required to returnthemselves. For more information on iterator objects, seeIterator Types.
object.__reversed__(self)¶Called (if present) by the
reversed()built-in to implementreverse iteration. It should return a new iterator object that iteratesover all the objects in the container in reverse order.If the
__reversed__()method is not provided, thereversed()built-in will fall back to using the sequence protocol (__len__()and__getitem__()). Objects that support the sequence protocol shouldonly provide__reversed__()if they can provide an implementationthat is more efficient than the one provided byreversed().
The membership test operators (in andnotin) are normallyimplemented as an iteration through a sequence. However, container objects cansupply the following special method with a more efficient implementation, whichalso does not require the object be a sequence.
object.__contains__(self,item)¶Called to implement membership test operators. Should return true ifitemis inself, false otherwise. For mapping objects, this should consider thekeys of the mapping rather than the values or the key-item pairs.
For objects that don’t define
__contains__(), the membership test firsttries iteration via__iter__(), then the old sequence iterationprotocol via__getitem__(), seethis section in the languagereference.
3.3.7. Emulating numeric types¶
The following methods can be defined to emulate numeric objects. Methodscorresponding to operations that are not supported by the particular kind ofnumber implemented (e.g., bitwise operations for non-integral numbers) should beleft undefined.
object.__add__(self,other)¶object.__sub__(self,other)¶object.__mul__(self,other)¶object.__matmul__(self,other)¶object.__truediv__(self,other)¶object.__floordiv__(self,other)¶object.__mod__(self,other)¶object.__divmod__(self,other)¶object.__pow__(self,other[,modulo])¶object.__lshift__(self,other)¶object.__rshift__(self,other)¶object.__and__(self,other)¶object.__xor__(self,other)¶object.__or__(self,other)¶These methods are called to implement the binary arithmetic operations(
+,-,*,@,/,//,%,divmod(),pow(),**,<<,>>,&,^,|). For instance, toevaluate the expressionx+y, wherex is an instance of a class thathas an__add__()method,x.__add__(y)is called. The__divmod__()method should be the equivalent to using__floordiv__()and__mod__(); it should not be related to__truediv__(). Note that__pow__()should be defined to acceptan optional third argument if the ternary version of the built-inpow()function is to be supported.If one of those methods does not support the operation with the suppliedarguments, it should return
NotImplemented.
object.__radd__(self,other)¶object.__rsub__(self,other)¶object.__rmul__(self,other)¶object.__rmatmul__(self,other)¶object.__rtruediv__(self,other)¶object.__rfloordiv__(self,other)¶object.__rmod__(self,other)¶object.__rdivmod__(self,other)¶object.__rpow__(self,other)¶object.__rlshift__(self,other)¶object.__rrshift__(self,other)¶object.__rand__(self,other)¶object.__rxor__(self,other)¶object.__ror__(self,other)¶These methods are called to implement the binary arithmetic operations(
+,-,*,@,/,//,%,divmod(),pow(),**,<<,>>,&,^,|) with reflected(swapped) operands. These functions are only called if the left operand doesnot support the corresponding operation and the operands are of differenttypes.[2] For instance, to evaluate the expressionx-y, wherey isan instance of a class that has an__rsub__()method,y.__rsub__(x)is called ifx.__sub__(y)returnsNotImplemented.Note that ternary
pow()will not try calling__rpow__()(thecoercion rules would become too complicated).Note
If the right operand’s type is a subclass of the left operand’s type and thatsubclass provides the reflected method for the operation, this method will becalled before the left operand’s non-reflected method. This behavior allowssubclasses to override their ancestors’ operations.
object.__iadd__(self,other)¶object.__isub__(self,other)¶object.__imul__(self,other)¶object.__imatmul__(self,other)¶object.__itruediv__(self,other)¶object.__ifloordiv__(self,other)¶object.__imod__(self,other)¶object.__ipow__(self,other[,modulo])¶object.__ilshift__(self,other)¶object.__irshift__(self,other)¶object.__iand__(self,other)¶object.__ixor__(self,other)¶object.__ior__(self,other)¶These methods are called to implement the augmented arithmetic assignments(
+=,-=,*=,@=,/=,//=,%=,**=,<<=,>>=,&=,^=,|=). These methods should attempt to do theoperation in-place (modifyingself) and return the result (which could be,but does not have to be,self). If a specific method is not defined, theaugmented assignment falls back to the normal methods. For instance, ifxis an instance of a class with an__iadd__()method,x+=yisequivalent tox=x.__iadd__(y). Otherwise,x.__add__(y)andy.__radd__(x)are considered, as with the evaluation ofx+y. Incertain situations, augmented assignment can result in unexpected errors (seeWhy does a_tuple[i] += [‘item’] raise an exception when the addition works?), but this behavior is in factpart of the data model.
object.__neg__(self)¶object.__pos__(self)¶object.__abs__(self)¶object.__invert__(self)¶Called to implement the unary arithmetic operations (
-,+,abs()and~).
object.__complex__(self)¶object.__int__(self)¶object.__float__(self)¶object.__round__(self[,n])¶Called to implement the built-in functions
complex(),int(),float()andround(). Should return a valueof the appropriate type.
object.__index__(self)¶Called to implement
operator.index(), and whenever Python needs tolosslessly convert the numeric object to an integer object (such as inslicing, or in the built-inbin(),hex()andoct()functions). Presence of this method indicates that the numeric object isan integer type. Must return an integer.Note
In order to have a coherent integer type class, when
__index__()isdefined__int__()should also be defined, and both should returnthe same value.
3.3.8. With Statement Context Managers¶
Acontext manager is an object that defines the runtime context to beestablished when executing awith statement. The context managerhandles the entry into, and the exit from, the desired runtime context for theexecution of the block of code. Context managers are normally invoked using thewith statement (described in sectionThe with statement), but can also beused by directly invoking their methods.
Typical uses of context managers include saving and restoring various kinds ofglobal state, locking and unlocking resources, closing opened files, etc.
For more information on context managers, seeContext Manager Types.
object.__enter__(self)¶Enter the runtime context related to this object. The
withstatementwill bind this method’s return value to the target(s) specified in theasclause of the statement, if any.
object.__exit__(self,exc_type,exc_value,traceback)¶Exit the runtime context related to this object. The parameters describe theexception that caused the context to be exited. If the context was exitedwithout an exception, all three arguments will be
None.If an exception is supplied, and the method wishes to suppress the exception(i.e., prevent it from being propagated), it should return a true value.Otherwise, the exception will be processed normally upon exit from this method.
Note that
__exit__()methods should not reraise the passed-in exception;this is the caller’s responsibility.
3.3.9. Special method lookup¶
For custom classes, implicit invocations of special methods are only guaranteedto work correctly if defined on an object’s type, not in the object’s instancedictionary. That behaviour is the reason why the following code raises anexception:
>>>classC:...pass...>>>c=C()>>>c.__len__=lambda:5>>>len(c)Traceback (most recent call last): File"<stdin>", line1, in<module>TypeError:object of type 'C' has no len()
The rationale behind this behaviour lies with a number of special methods suchas__hash__() and__repr__() that are implemented by all objects,including type objects. If the implicit lookup of these methods used theconventional lookup process, they would fail when invoked on the type objectitself:
>>>1.__hash__()==hash(1)True>>>int.__hash__()==hash(int)Traceback (most recent call last): File"<stdin>", line1, in<module>TypeError:descriptor '__hash__' of 'int' object needs an argument
Incorrectly attempting to invoke an unbound method of a class in this way issometimes referred to as ‘metaclass confusion’, and is avoided by bypassingthe instance when looking up special methods:
>>>type(1).__hash__(1)==hash(1)True>>>type(int).__hash__(int)==hash(int)True
In addition to bypassing any instance attributes in the interest ofcorrectness, implicit special method lookup generally also bypasses the__getattribute__() method even of the object’s metaclass:
>>>classMeta(type):...def__getattribute__(*args):...print("Metaclass getattribute invoked")...returntype.__getattribute__(*args)...>>>classC(object,metaclass=Meta):...def__len__(self):...return10...def__getattribute__(*args):...print("Class getattribute invoked")...returnobject.__getattribute__(*args)...>>>c=C()>>>c.__len__()# Explicit lookup via instanceClass getattribute invoked10>>>type(c).__len__(c)# Explicit lookup via typeMetaclass getattribute invoked10>>>len(c)# Implicit lookup10
Bypassing the__getattribute__() machinery in this fashionprovides significant scope for speed optimisations within theinterpreter, at the cost of some flexibility in the handling ofspecial methods (the special methodmust be set on the classobject itself in order to be consistently invoked by the interpreter).
3.4. Coroutines¶
3.4.1. Awaitable Objects¶
Anawaitable object generally implements an__await__() method.Coroutine objects returned fromasyncdef functionsare awaitable.
Note
Thegenerator iterator objects returned from generatorsdecorated withtypes.coroutine() orasyncio.coroutine()are also awaitable, but they do not implement__await__().
object.__await__(self)¶Must return aniterator. Should be used to implementawaitable objects. For instance,
asyncio.Futureimplementsthis method to be compatible with theawaitexpression.
New in version 3.5.
See also
PEP 492 for additional information about awaitable objects.
3.4.2. Coroutine Objects¶
Coroutine objects areawaitable objects.A coroutine’s execution can be controlled by calling__await__() anditerating over the result. When the coroutine has finished executing andreturns, the iterator raisesStopIteration, and the exception’svalue attribute holds the return value. If thecoroutine raises an exception, it is propagated by the iterator. Coroutinesshould not directly raise unhandledStopIteration exceptions.
Coroutines also have the methods listed below, which are analogous tothose of generators (seeGenerator-iterator methods). However, unlikegenerators, coroutines do not directly support iteration.
Changed in version 3.5.2:It is aRuntimeError to await on a coroutine more than once.
coroutine.send(value)¶Starts or resumes execution of the coroutine. Ifvalue is
None,this is equivalent to advancing the iterator returned by__await__(). Ifvalue is notNone, this method delegatesto thesend()method of the iterator that causedthe coroutine to suspend. The result (return value,StopIteration, or other exception) is the same as wheniterating over the__await__()return value, described above.
coroutine.throw(type[,value[,traceback]])¶Raises the specified exception in the coroutine. This method delegatesto the
throw()method of the iterator that causedthe coroutine to suspend, if it has such a method. Otherwise,the exception is raised at the suspension point. The result(return value,StopIteration, or other exception) is the same aswhen iterating over the__await__()return value, describedabove. If the exception is not caught in the coroutine, it propagatesback to the caller.
coroutine.close()¶Causes the coroutine to clean itself up and exit. If the coroutineis suspended, this method first delegates to the
close()method of the iterator that caused the coroutine to suspend, if ithas such a method. Then it raisesGeneratorExitat thesuspension point, causing the coroutine to immediately clean itself up.Finally, the coroutine is marked as having finished executing, even ifit was never started.Coroutine objects are automatically closed using the above process whenthey are about to be destroyed.
3.4.3. Asynchronous Iterators¶
Anasynchronous iterable is able to call asynchronous code in its__aiter__ implementation, and anasynchronous iterator can callasynchronous code in its__anext__ method.
Asynchronous iterators can be used in anasyncfor statement.
object.__aiter__(self)¶Must return anasynchronous iterator object.
object.__anext__(self)¶Must return anawaitable resulting in a next value of the iterator. Shouldraise a
StopAsyncIterationerror when the iteration is over.
An example of an asynchronous iterable object:
classReader:asyncdefreadline(self):...def__aiter__(self):returnselfasyncdef__anext__(self):val=awaitself.readline()ifval==b'':raiseStopAsyncIterationreturnval
New in version 3.5.
Note
Changed in version 3.5.2:Starting with CPython 3.5.2,__aiter__ can directly returnasynchronous iterators. Returninganawaitable object will result in aPendingDeprecationWarning.
The recommended way of writing backwards compatible code inCPython 3.5.x is to continue returning awaitables from__aiter__. If you want to avoid the PendingDeprecationWarningand keep the code backwards compatible, the following decoratorcan be used:
importfunctoolsimportsysifsys.version_info<(3,5,2):defaiter_compat(func):@functools.wraps(func)asyncdefwrapper(self):returnfunc(self)returnwrapperelse:defaiter_compat(func):returnfunc
Example:
classAsyncIterator:@aiter_compatdef__aiter__(self):returnselfasyncdef__anext__(self):...
Starting with CPython 3.6, thePendingDeprecationWarningwill be replaced with theDeprecationWarning.In CPython 3.7, returning an awaitable from__aiter__ willresult in aRuntimeError.
3.4.4. Asynchronous Context Managers¶
Anasynchronous context manager is acontext manager that is able tosuspend execution in its__aenter__ and__aexit__ methods.
Asynchronous context managers can be used in anasyncwith statement.
object.__aenter__(self)¶This method is semantically similar to the
__enter__(), with onlydifference that it must return anawaitable.
object.__aexit__(self,exc_type,exc_value,traceback)¶This method is semantically similar to the
__exit__(), with onlydifference that it must return anawaitable.
An example of an asynchronous context manager class:
classAsyncContextManager:asyncdef__aenter__(self):awaitlog('entering context')asyncdef__aexit__(self,exc_type,exc,tb):awaitlog('exiting context')
New in version 3.5.
Footnotes
| [1] | Itis possible in some cases to change an object’s type, under certaincontrolled conditions. It generally isn’t a good idea though, since it canlead to some very strange behaviour if it is handled incorrectly. |
| [2] | For operands of the same type, it is assumed that if the non-reflected method(such as__add__()) fails the operation is not supported, which is why thereflected method is not called. |
