Initialization, Finalization, and Threads¶
See alsoPython Initialization Configuration.
Before Python Initialization¶
In an application embedding Python, thePy_Initialize()
function mustbe called before using any other Python/C API functions; with the exception ofa few functions and theglobal configuration variables.
The following functions can be safely called before Python is initialized:
Configuration functions:
Informative functions:
Utilities:
Memory allocators:
Note
The following functionsshould not be called beforePy_Initialize()
:Py_EncodeLocale()
,Py_GetPath()
,Py_GetPrefix()
,Py_GetExecPrefix()
,Py_GetProgramFullPath()
,Py_GetPythonHome()
,Py_GetProgramName()
andPyEval_InitThreads()
.
Global configuration variables¶
Python has variables for the global configuration to control different featuresand options. By default, these flags are controlled bycommand lineoptions.
When a flag is set by an option, the value of the flag is the number of timesthat the option was set. For example,-b
setsPy_BytesWarningFlag
to 1 and-bb
setsPy_BytesWarningFlag
to 2.
- int
Py_BytesWarningFlag
¶ Issue a warning when comparing
bytes
orbytearray
withstr
orbytes
withint
. Issue an error if greateror equal to2
.Set by the
-b
option.
- int
Py_DebugFlag
¶ Turn on parser debugging output (for expert only, depending on compilationoptions).
Set by the
-d
option and thePYTHONDEBUG
environmentvariable.
- int
Py_DontWriteBytecodeFlag
¶ If set to non-zero, Python won’t try to write
.pyc
files on theimport of source modules.Set by the
-B
option and thePYTHONDONTWRITEBYTECODE
environment variable.
- int
Py_FrozenFlag
¶ Suppress error messages when calculating the module search path in
Py_GetPath()
.Private flag used by
_freeze_importlib
andfrozenmain
programs.
- int
Py_HashRandomizationFlag
¶ Set to
1
if thePYTHONHASHSEED
environment variable is set toa non-empty string.If the flag is non-zero, read the
PYTHONHASHSEED
environmentvariable to initialize the secret hash seed.
- int
Py_IgnoreEnvironmentFlag
¶ Ignore all
PYTHON*
environment variables, e.g.PYTHONPATH
andPYTHONHOME
, that might be set.
- int
Py_InspectFlag
¶ When a script is passed as first argument or the
-c
option is used,enter interactive mode after executing the script or the command, even whensys.stdin
does not appear to be a terminal.Set by the
-i
option and thePYTHONINSPECT
environmentvariable.
- int
Py_IsolatedFlag
¶ Run Python in isolated mode. In isolated mode
sys.path
containsneither the script’s directory nor the user’s site-packages directory.Set by the
-I
option.New in version 3.4.
- int
Py_LegacyWindowsFSEncodingFlag
¶ If the flag is non-zero, use the
mbcs
encoding instead of the UTF-8encoding for the filesystem encoding.Set to
1
if thePYTHONLEGACYWINDOWSFSENCODING
environmentvariable is set to a non-empty string.SeePEP 529 for more details.
Availability: Windows.
- int
Py_LegacyWindowsStdioFlag
¶ If the flag is non-zero, use
io.FileIO
instead ofWindowsConsoleIO
forsys
standard streams.Set to
1
if thePYTHONLEGACYWINDOWSSTDIO
environmentvariable is set to a non-empty string.SeePEP 528 for more details.
Availability: Windows.
- int
Py_NoSiteFlag
¶ Disable the import of the module
site
and the site-dependentmanipulations ofsys.path
that it entails. Also disable thesemanipulations ifsite
is explicitly imported later (callsite.main()
if you want them to be triggered).Set by the
-S
option.
- int
Py_NoUserSiteDirectory
¶ Don’t add the
usersite-packagesdirectory
tosys.path
.Set by the
-s
and-I
options, and thePYTHONNOUSERSITE
environment variable.
- int
Py_OptimizeFlag
¶ Set by the
-O
option and thePYTHONOPTIMIZE
environmentvariable.
- int
Py_QuietFlag
¶ Don’t display the copyright and version messages even in interactive mode.
Set by the
-q
option.New in version 3.2.
- int
Py_UnbufferedStdioFlag
¶ Force the stdout and stderr streams to be unbuffered.
Set by the
-u
option and thePYTHONUNBUFFERED
environment variable.
- int
Py_VerboseFlag
¶ Print a message each time a module is initialized, showing the place(filename or built-in module) from which it is loaded. If greater or equalto
2
, print a message for each file that is checked for whensearching for a module. Also provides information on module cleanup at exit.Set by the
-v
option and thePYTHONVERBOSE
environmentvariable.
Initializing and finalizing the interpreter¶
- void
Py_Initialize
()¶ Initialize the Python interpreter. In an application embedding Python,this should be called before using any other Python/C API functions; seeBefore Python Initialization for the few exceptions.
This initializesthe table of loaded modules (
sys.modules
), and creates the fundamentalmodulesbuiltins
,__main__
andsys
. It also initializesthe module search path (sys.path
). It does not setsys.argv
; usePySys_SetArgvEx()
for that. This is a no-op when called for a second time(without callingPy_FinalizeEx()
first). There is no return value; it is afatal error if the initialization fails.Note
On Windows, changes the console mode from
O_TEXT
toO_BINARY
, which willalso affect non-Python uses of the console using the C Runtime.
- void
Py_InitializeEx
(int initsigs)¶ This function works like
Py_Initialize()
ifinitsigs is1
. Ifinitsigs is0
, it skips initialization registration of signal handlers, whichmight be useful when Python is embedded.
- int
Py_IsInitialized
()¶ Return true (nonzero) when the Python interpreter has been initialized, false(zero) if not. After
Py_FinalizeEx()
is called, this returns false untilPy_Initialize()
is called again.
- int
Py_FinalizeEx
()¶ Undo all initializations made by
Py_Initialize()
and subsequent use ofPython/C API functions, and destroy all sub-interpreters (seePy_NewInterpreter()
below) that were created and not yet destroyed sincethe last call toPy_Initialize()
. Ideally, this frees all memoryallocated by the Python interpreter. This is a no-op when called for a secondtime (without callingPy_Initialize()
again first). Normally thereturn value is0
. If there were errors during finalization(flushing buffered data),-1
is returned.This function is provided for a number of reasons. An embedding applicationmight want to restart Python without having to restart the application itself.An application that has loaded the Python interpreter from a dynamicallyloadable library (or DLL) might want to free all memory allocated by Pythonbefore unloading the DLL. During a hunt for memory leaks in an application adeveloper might want to free all memory allocated by Python before exiting fromthe application.
Bugs and caveats: The destruction of modules and objects in modules is donein random order; this may cause destructors (
__del__()
methods) to failwhen they depend on other objects (even functions) or modules. Dynamicallyloaded extension modules loaded by Python are not unloaded. Small amounts ofmemory allocated by the Python interpreter may not be freed (if you find a leak,please report it). Memory tied up in circular references between objects is notfreed. Some memory allocated by extension modules may not be freed. Someextensions may not work properly if their initialization routine is called morethan once; this can happen if an application callsPy_Initialize()
andPy_FinalizeEx()
more than once.Raises anauditing event
cpython._PySys_ClearAuditHooks
with no arguments.New in version 3.6.
- void
Py_Finalize
()¶ This is a backwards-compatible version of
Py_FinalizeEx()
thatdisregards the return value.
Process-wide parameters¶
- int
Py_SetStandardStreamEncoding
(const char *encoding, const char *errors)¶ This function should be called before
Py_Initialize()
, if it iscalled at all. It specifies which encoding and error handling to usewith standard IO, with the same meanings as instr.encode()
.It overrides
PYTHONIOENCODING
values, and allows embedding codeto control IO encoding when the environment variable does not work.encoding and/orerrors may be
NULL
to usePYTHONIOENCODING
and/or default values (depending on othersettings).Note that
sys.stderr
always uses the “backslashreplace” errorhandler, regardless of this (or any other) setting.If
Py_FinalizeEx()
is called, this function will need to be calledagain in order to affect subsequent calls toPy_Initialize()
.Returns
0
if successful, a nonzero value on error (e.g. calling after theinterpreter has already been initialized).New in version 3.4.
- void
Py_SetProgramName
(const wchar_t *name)¶ This function should be called before
Py_Initialize()
is called forthe first time, if it is called at all. It tells the interpreter the valueof theargv[0]
argument to themain()
function of the program(converted to wide characters).This is used byPy_GetPath()
and some other functions below to findthe Python run-time libraries relative to the interpreter executable. Thedefault value is'python'
. The argument should point to azero-terminated wide character string in static storage whose contents will notchange for the duration of the program’s execution. No code in the Pythoninterpreter will change the contents of this storage.Use
Py_DecodeLocale()
to decode a bytes string to get awchar_*
string.
- wchar*
Py_GetProgramName
()¶ Return the program name set with
Py_SetProgramName()
, or the default.The returned string points into static storage; the caller should not modify itsvalue.
- wchar_t*
Py_GetPrefix
()¶ Return theprefix for installed platform-independent files. This is derivedthrough a number of complicated rules from the program name set with
Py_SetProgramName()
and some environment variables; for example, if theprogram name is'/usr/local/bin/python'
, the prefix is'/usr/local'
. Thereturned string points into static storage; the caller should not modify itsvalue. This corresponds to theprefix variable in the top-levelMakefile
and the--prefix
argument to theconfigurescript at build time. The value is available to Python code assys.prefix
.It is only useful on Unix. See also the next function.
- wchar_t*
Py_GetExecPrefix
()¶ Return theexec-prefix for installed platform-dependent files. This isderived through a number of complicated rules from the program name set with
Py_SetProgramName()
and some environment variables; for example, if theprogram name is'/usr/local/bin/python'
, the exec-prefix is'/usr/local'
. The returned string points into static storage; the callershould not modify its value. This corresponds to theexec_prefixvariable in the top-levelMakefile
and the--exec-prefix
argument to theconfigure script at build time. The value isavailable to Python code assys.exec_prefix
. It is only useful on Unix.Background: The exec-prefix differs from the prefix when platform dependentfiles (such as executables and shared libraries) are installed in a differentdirectory tree. In a typical installation, platform dependent files may beinstalled in the
/usr/local/plat
subtree while platform independent maybe installed in/usr/local
.Generally speaking, a platform is a combination of hardware and softwarefamilies, e.g. Sparc machines running the Solaris 2.x operating system areconsidered the same platform, but Intel machines running Solaris 2.x are anotherplatform, and Intel machines running Linux are yet another platform. Differentmajor revisions of the same operating system generally also form differentplatforms. Non-Unix operating systems are a different story; the installationstrategies on those systems are so different that the prefix and exec-prefix aremeaningless, and set to the empty string. Note that compiled Python bytecodefiles are platform independent (but not independent from the Python version bywhich they were compiled!).
System administrators will know how to configure themount orautomount programs to share
/usr/local
between platformswhile having/usr/local/plat
be a different filesystem for eachplatform.
- wchar_t*
Py_GetProgramFullPath
()¶ Return the full program name of the Python executable; this is computed as aside-effect of deriving the default module search path from the program name(set by
Py_SetProgramName()
above). The returned string points intostatic storage; the caller should not modify its value. The value is availableto Python code assys.executable
.
- wchar_t*
Py_GetPath
()¶ Return the default module search path; this is computed from the program name(set by
Py_SetProgramName()
above) and some environment variables.The returned string consists of a series of directory names separated by aplatform dependent delimiter character. The delimiter character is':'
on Unix and Mac OS X,';'
on Windows. The returned string points intostatic storage; the caller should not modify its value. The listsys.path
is initialized with this value on interpreter startup; itcan be (and usually is) modified later to change the search path for loadingmodules.
- void
Py_SetPath
(const wchar_t *)¶ Set the default module search path. If this function is called before
Py_Initialize()
, thenPy_GetPath()
won’t attempt to compute adefault search path but uses the one provided instead. This is useful ifPython is embedded by an application that has full knowledge of the locationof all modules. The path components should be separated by the platformdependent delimiter character, which is':'
on Unix and Mac OS X,';'
on Windows.This also causes
sys.executable
to be set to the programfull path (seePy_GetProgramFullPath()
) and forsys.prefix
andsys.exec_prefix
to be empty. It is up to the caller to modify theseif required after callingPy_Initialize()
.Use
Py_DecodeLocale()
to decode a bytes string to get awchar_*
string.The path argument is copied internally, so the caller may free it after thecall completes.
Changed in version 3.8:The program full path is now used for
sys.executable
, insteadof the program name.
- const char*
Py_GetVersion
()¶ Return the version of this Python interpreter. This is a string that lookssomething like
"3.0a5+ (py3k:63103M, May 12 2008, 00:53:55)\n[GCC 4.2.3]"
The first word (up to the first space character) is the current Python version;the first three characters are the major and minor version separated by aperiod. The returned string points into static storage; the caller should notmodify its value. The value is available to Python code as
sys.version
.
- const char*
Py_GetPlatform
()¶ Return the platform identifier for the current platform. On Unix, this isformed from the “official” name of the operating system, converted to lowercase, followed by the major revision number; e.g., for Solaris 2.x, which isalso known as SunOS 5.x, the value is
'sunos5'
. On Mac OS X, it is'darwin'
. On Windows, it is'win'
. The returned string points intostatic storage; the caller should not modify its value. The value is availableto Python code assys.platform
.
- const char*
Py_GetCopyright
()¶ Return the official copyright string for the current Python version, for example
'Copyright1991-1995StichtingMathematischCentrum,Amsterdam'
The returned string points into static storage; the caller should not modify itsvalue. The value is available to Python code as
sys.copyright
.
- const char*
Py_GetCompiler
()¶ Return an indication of the compiler used to build the current Python version,in square brackets, for example:
"[GCC 2.7.2.2]"
The returned string points into static storage; the caller should not modify itsvalue. The value is available to Python code as part of the variable
sys.version
.
- const char*
Py_GetBuildInfo
()¶ Return information about the sequence number and build date and time of thecurrent Python interpreter instance, for example
"#67, Aug 1 1997, 22:34:28"
The returned string points into static storage; the caller should not modify itsvalue. The value is available to Python code as part of the variable
sys.version
.
- void
PySys_SetArgvEx
(int argc, wchar_t **argv, int updatepath)¶ Set
sys.argv
based onargc andargv. These parameters aresimilar to those passed to the program’smain()
function with thedifference that the first entry should refer to the script file to beexecuted rather than the executable hosting the Python interpreter. If thereisn’t a script that will be run, the first entry inargv can be an emptystring. If this function fails to initializesys.argv
, a fatalcondition is signalled usingPy_FatalError()
.Ifupdatepath is zero, this is all the function does. Ifupdatepathis non-zero, the function also modifies
sys.path
according to thefollowing algorithm:If the name of an existing script is passed in
argv[0]
, the absolutepath of the directory where the script is located is prepended tosys.path
.Otherwise (that is, ifargc is
0
orargv[0]
doesn’t pointto an existing file name), an empty string is prepended tosys.path
, which is the same as prepending the current workingdirectory ("."
).
Use
Py_DecodeLocale()
to decode a bytes string to get awchar_*
string.Note
It is recommended that applications embedding the Python interpreterfor purposes other than executing a single script pass
0
asupdatepath,and updatesys.path
themselves if desired.SeeCVE-2008-5983.On versions before 3.1.3, you can achieve the same effect by manuallypopping the first
sys.path
element after having calledPySys_SetArgv()
, for example using:PyRun_SimpleString("import sys; sys.path.pop(0)\n");
New in version 3.1.3.
- void
PySys_SetArgv
(int argc, wchar_t **argv)¶ This function works like
PySys_SetArgvEx()
withupdatepath setto1
unless thepython interpreter was started with the-I
.Use
Py_DecodeLocale()
to decode a bytes string to get awchar_*
string.Changed in version 3.4:Theupdatepath value depends on
-I
.
- void
Py_SetPythonHome
(const wchar_t *home)¶ Set the default “home” directory, that is, the location of the standardPython libraries. See
PYTHONHOME
for the meaning of theargument string.The argument should point to a zero-terminated character string in staticstorage whose contents will not change for the duration of the program’sexecution. No code in the Python interpreter will change the contents ofthis storage.
Use
Py_DecodeLocale()
to decode a bytes string to get awchar_*
string.
- w_char*
Py_GetPythonHome
()¶ Return the default “home”, that is, the value set by a previous call to
Py_SetPythonHome()
, or the value of thePYTHONHOME
environment variable if it is set.
Thread State and the Global Interpreter Lock¶
The Python interpreter is not fully thread-safe. In order to supportmulti-threaded Python programs, there’s a global lock, called theglobalinterpreter lock orGIL, that must be held by the current thread beforeit can safely access Python objects. Without the lock, even the simplestoperations could cause problems in a multi-threaded program: for example, whentwo threads simultaneously increment the reference count of the same object, thereference count could end up being incremented only once instead of twice.
Therefore, the rule exists that only the thread that has acquired theGIL may operate on Python objects or call Python/C API functions.In order to emulate concurrency of execution, the interpreter regularlytries to switch threads (seesys.setswitchinterval()
). The lock is alsoreleased around potentially blocking I/O operations like reading or writinga file, so that other Python threads can run in the meantime.
The Python interpreter keeps some thread-specific bookkeeping informationinside a data structure calledPyThreadState
. There’s also oneglobal variable pointing to the currentPyThreadState
: it canbe retrieved usingPyThreadState_Get()
.
Releasing the GIL from extension code¶
Most extension code manipulating theGIL has the following simplestructure:
Savethethreadstateinalocalvariable.Releasetheglobalinterpreterlock....DosomeblockingI/Ooperation...Reacquiretheglobalinterpreterlock.Restorethethreadstatefromthelocalvariable.
This is so common that a pair of macros exists to simplify it:
Py_BEGIN_ALLOW_THREADS...DosomeblockingI/Ooperation...Py_END_ALLOW_THREADS
ThePy_BEGIN_ALLOW_THREADS
macro opens a new block and declares ahidden local variable; thePy_END_ALLOW_THREADS
macro closes theblock.
The block above expands to the following code:
PyThreadState*_save;_save=PyEval_SaveThread();...DosomeblockingI/Ooperation...PyEval_RestoreThread(_save);
Here is how these functions work: the global interpreter lock is used to protect the pointer to thecurrent thread state. When releasing the lock and saving the thread state,the current thread state pointer must be retrieved before the lock is released(since another thread could immediately acquire the lock and store its own threadstate in the global variable). Conversely, when acquiring the lock and restoringthe thread state, the lock must be acquired before storing the thread statepointer.
Note
Calling system I/O functions is the most common use case for releasingthe GIL, but it can also be useful before calling long-running computationswhich don’t need access to Python objects, such as compression orcryptographic functions operating over memory buffers. For example, thestandardzlib
andhashlib
modules release the GIL whencompressing or hashing data.
Non-Python created threads¶
When threads are created using the dedicated Python APIs (such as thethreading
module), a thread state is automatically associated to themand the code showed above is therefore correct. However, when threads arecreated from C (for example by a third-party library with its own threadmanagement), they don’t hold the GIL, nor is there a thread state structurefor them.
If you need to call Python code from these threads (often this will be partof a callback API provided by the aforementioned third-party library),you must first register these threads with the interpreter bycreating a thread state data structure, then acquiring the GIL, and finallystoring their thread state pointer, before you can start using the Python/CAPI. When you are done, you should reset the thread state pointer, releasethe GIL, and finally free the thread state data structure.
ThePyGILState_Ensure()
andPyGILState_Release()
functions doall of the above automatically. The typical idiom for calling into Pythonfrom a C thread is:
PyGILState_STATEgstate;gstate=PyGILState_Ensure();/* Perform Python actions here. */result=CallSomeFunction();/* evaluate result or handle exception *//* Release the thread. No Python API allowed beyond this point. */PyGILState_Release(gstate);
Note that thePyGILState_*()
functions assume there is only one globalinterpreter (created automatically byPy_Initialize()
). Pythonsupports the creation of additional interpreters (usingPy_NewInterpreter()
), but mixing multiple interpreters and thePyGILState_*()
API is unsupported.
Cautions about fork()¶
Another important thing to note about threads is their behaviour in the faceof the Cfork()
call. On most systems withfork()
, after aprocess forks only the thread that issued the fork will exist. This has aconcrete impact both on how locks must be handled and on all stored statein CPython’s runtime.
The fact that only the “current” thread remainsmeans any locks held by other threads will never be released. Python solvesthis foros.fork()
by acquiring the locks it uses internally beforethe fork, and releasing them afterwards. In addition, it resets anyLock Objects in the child. When extending or embedding Python, thereis no way to inform Python of additional (non-Python) locks that need to beacquired before or reset after a fork. OS facilities such aspthread_atfork()
would need to be used to accomplish the same thing.Additionally, when extending or embedding Python, callingfork()
directly rather than throughos.fork()
(and returning to or callinginto Python) may result in a deadlock by one of Python’s internal locksbeing held by a thread that is defunct after the fork.PyOS_AfterFork_Child()
tries to reset the necessary locks, but is notalways able to.
The fact that all other threads go away also means that CPython’sruntime state there must be cleaned up properly, whichos.fork()
does. This means finalizing all otherPyThreadState
objectsbelonging to the current interpreter and all otherPyInterpreterState
objects. Due to this and the specialnature of the“main” interpreter,fork()
should only be called in that interpreter’s “main”thread, where the CPython global runtime was originally initialized.The only exception is ifexec()
will be called immediatelyafter.
High-level API¶
These are the most commonly used types and functions when writing C extensioncode, or when embedding the Python interpreter:
PyInterpreterState
¶This data structure represents the state shared by a number of cooperatingthreads. Threads belonging to the same interpreter share their moduleadministration and a few other internal items. There are no public members inthis structure.
Threads belonging to different interpreters initially share nothing, exceptprocess state like available memory, open file descriptors and such. The globalinterpreter lock is also shared by all threads, regardless of to whichinterpreter they belong.
PyThreadState
¶This data structure represents the state of a single thread. The only publicdata member is
interp
(PyInterpreterState*
), which points tothis thread’s interpreter state.
- void
PyEval_InitThreads
()¶ Initialize and acquire the global interpreter lock. It should be called in themain thread before creating a second thread or engaging in any other threadoperations such as
PyEval_ReleaseThread(tstate)
. It is not needed beforecallingPyEval_SaveThread()
orPyEval_RestoreThread()
.This is a no-op when called for a second time.
Changed in version 3.7:This function is now called by
Py_Initialize()
, so you don’thave to call it yourself anymore.Changed in version 3.2:This function cannot be called before
Py_Initialize()
anymore.
- int
PyEval_ThreadsInitialized
()¶ Returns a non-zero value if
PyEval_InitThreads()
has been called. Thisfunction can be called without holding the GIL, and therefore can be used toavoid calls to the locking API when running single-threaded.Changed in version 3.7:TheGIL is now initialized by
Py_Initialize()
.
- PyThreadState*
PyEval_SaveThread
()¶ Release the global interpreter lock (if it has been created) and reset thethread state to
NULL
, returning the previous thread state (which is notNULL
). If the lock has been created, the current thread must haveacquired it.
- void
PyEval_RestoreThread
(PyThreadState *tstate)¶ Acquire the global interpreter lock (if it has been created) and set thethread state totstate, which must not be
NULL
. If the lock has beencreated, the current thread must not have acquired it, otherwise deadlockensues.Note
Calling this function from a thread when the runtime is finalizingwill terminate the thread, even if the thread was not created by Python.You can use
_Py_IsFinalizing()
orsys.is_finalizing()
tocheck if the interpreter is in process of being finalized before callingthis function to avoid unwanted termination.
- PyThreadState*
PyThreadState_Get
()¶ Return the current thread state. The global interpreter lock must be held.When the current thread state is
NULL
, this issues a fatal error (so thatthe caller needn’t check forNULL
).
- PyThreadState*
PyThreadState_Swap
(PyThreadState *tstate)¶ Swap the current thread state with the thread state given by the argumenttstate, which may be
NULL
. The global interpreter lock must be heldand is not released.
The following functions use thread-local storage, and are not compatiblewith sub-interpreters:
- PyGILState_STATE
PyGILState_Ensure
()¶ Ensure that the current thread is ready to call the Python C API regardlessof the current state of Python, or of the global interpreter lock. This maybe called as many times as desired by a thread as long as each call ismatched with a call to
PyGILState_Release()
. In general, otherthread-related APIs may be used betweenPyGILState_Ensure()
andPyGILState_Release()
calls as long as the thread state is restored toits previous state before the Release(). For example, normal usage of thePy_BEGIN_ALLOW_THREADS
andPy_END_ALLOW_THREADS
macros isacceptable.The return value is an opaque “handle” to the thread state when
PyGILState_Ensure()
was called, and must be passed toPyGILState_Release()
to ensure Python is left in the same state. Eventhough recursive calls are allowed, these handlescannot be shared - eachunique call toPyGILState_Ensure()
must save the handle for its calltoPyGILState_Release()
.When the function returns, the current thread will hold the GIL and be ableto call arbitrary Python code. Failure is a fatal error.
Note
Calling this function from a thread when the runtime is finalizingwill terminate the thread, even if the thread was not created by Python.You can use
_Py_IsFinalizing()
orsys.is_finalizing()
tocheck if the interpreter is in process of being finalized before callingthis function to avoid unwanted termination.
- void
PyGILState_Release
(PyGILState_STATE)¶ Release any resources previously acquired. After this call, Python’s state willbe the same as it was prior to the corresponding
PyGILState_Ensure()
call(but generally this state will be unknown to the caller, hence the use of theGILState API).Every call to
PyGILState_Ensure()
must be matched by a call toPyGILState_Release()
on the same thread.
- PyThreadState*
PyGILState_GetThisThreadState
()¶ Get the current thread state for this thread. May return
NULL
if noGILState API has been used on the current thread. Note that the main threadalways has such a thread-state, even if no auto-thread-state call has beenmade on the main thread. This is mainly a helper/diagnostic function.
- int
PyGILState_Check
()¶ Return
1
if the current thread is holding the GIL and0
otherwise.This function can be called from any thread at any time.Only if it has had its Python thread state initialized and currently isholding the GIL will it return1
.This is mainly a helper/diagnostic function. It can be usefulfor example in callback contexts or memory allocation functions whenknowing that the GIL is locked can allow the caller to perform sensitiveactions or otherwise behave differently.New in version 3.4.
The following macros are normally used without a trailing semicolon; look forexample usage in the Python source distribution.
Py_BEGIN_ALLOW_THREADS
¶This macro expands to
{PyThreadState*_save;_save=PyEval_SaveThread();
.Note that it contains an opening brace; it must be matched with a followingPy_END_ALLOW_THREADS
macro. See above for further discussion of thismacro.
Py_END_ALLOW_THREADS
¶This macro expands to
PyEval_RestoreThread(_save);}
. Note that it containsa closing brace; it must be matched with an earlierPy_BEGIN_ALLOW_THREADS
macro. See above for further discussion ofthis macro.
Py_BLOCK_THREADS
¶This macro expands to
PyEval_RestoreThread(_save);
: it is equivalent toPy_END_ALLOW_THREADS
without the closing brace.
Py_UNBLOCK_THREADS
¶This macro expands to
_save=PyEval_SaveThread();
: it is equivalent toPy_BEGIN_ALLOW_THREADS
without the opening brace and variabledeclaration.
Low-level API¶
All of the following functions must be called afterPy_Initialize()
.
Changed in version 3.7:Py_Initialize()
now initializes theGIL.
- PyInterpreterState*
PyInterpreterState_New
()¶ Create a new interpreter state object. The global interpreter lock need notbe held, but may be held if it is necessary to serialize calls to thisfunction.
Raises anauditing event
cpython.PyInterpreterState_New
with no arguments.
- void
PyInterpreterState_Clear
(PyInterpreterState *interp)¶ Reset all information in an interpreter state object. The global interpreterlock must be held.
Raises anauditing event
cpython.PyInterpreterState_Clear
with no arguments.
- void
PyInterpreterState_Delete
(PyInterpreterState *interp)¶ Destroy an interpreter state object. The global interpreter lock need not beheld. The interpreter state must have been reset with a previous call to
PyInterpreterState_Clear()
.
- PyThreadState*
PyThreadState_New
(PyInterpreterState *interp)¶ Create a new thread state object belonging to the given interpreter object.The global interpreter lock need not be held, but may be held if it isnecessary to serialize calls to this function.
- void
PyThreadState_Clear
(PyThreadState *tstate)¶ Reset all information in a thread state object. The global interpreter lockmust be held.
- void
PyThreadState_Delete
(PyThreadState *tstate)¶ Destroy a thread state object. The global interpreter lock need not be held.The thread state must have been reset with a previous call to
PyThreadState_Clear()
.
- PY_INT64_T
PyInterpreterState_GetID
(PyInterpreterState *interp)¶ Return the interpreter’s unique ID. If there was any error in doingso then
-1
is returned and an error is set.New in version 3.7.
- PyObject*
PyInterpreterState_GetDict
(PyInterpreterState *interp)¶ Return a dictionary in which interpreter-specific data may be stored.If this function returns
NULL
then no exception has been raised andthe caller should assume no interpreter-specific dict is available.This is not a replacement for
PyModule_GetState()
, whichextensions should use to store interpreter-specific state information.New in version 3.8.
- PyObject*
PyThreadState_GetDict
()¶ - Return value: Borrowed reference.
Return a dictionary in which extensions can store thread-specific stateinformation. Each extension should use a unique key to use to store state inthe dictionary. It is okay to call this function when no current thread stateis available. If this function returns
NULL
, no exception has been raised andthe caller should assume no current thread state is available.
- int
PyThreadState_SetAsyncExc
(unsigned long id,PyObject *exc)¶ Asynchronously raise an exception in a thread. Theid argument is the threadid of the target thread;exc is the exception object to be raised. Thisfunction does not steal any references toexc. To prevent naive misuse, youmust write your own C extension to call this. Must be called with the GIL held.Returns the number of thread states modified; this is normally one, but will bezero if the thread id isn’t found. Ifexc is
NULL
, the pendingexception (if any) for the thread is cleared. This raises no exceptions.Changed in version 3.7:The type of theid parameter changed from
long
tounsignedlong
.
- void
PyEval_AcquireThread
(PyThreadState *tstate)¶ Acquire the global interpreter lock and set the current thread state totstate, which should not be
NULL
. The lock must have been created earlier.If this thread already has the lock, deadlock ensues.Note
Calling this function from a thread when the runtime is finalizingwill terminate the thread, even if the thread was not created by Python.You can use
_Py_IsFinalizing()
orsys.is_finalizing()
tocheck if the interpreter is in process of being finalized before callingthis function to avoid unwanted termination.Changed in version 3.8:Updated to be consistent with
PyEval_RestoreThread()
,Py_END_ALLOW_THREADS()
, andPyGILState_Ensure()
,and terminate the current thread if called while the interpreter is finalizing.PyEval_RestoreThread()
is a higher-level function which is alwaysavailable (even when threads have not been initialized).
- void
PyEval_ReleaseThread
(PyThreadState *tstate)¶ Reset the current thread state to
NULL
and release the global interpreterlock. The lock must have been created earlier and must be held by the currentthread. Thetstate argument, which must not beNULL
, is only used to checkthat it represents the current thread state — if it isn’t, a fatal error isreported.PyEval_SaveThread()
is a higher-level function which is alwaysavailable (even when threads have not been initialized).
- void
PyEval_AcquireLock
()¶ Acquire the global interpreter lock. The lock must have been created earlier.If this thread already has the lock, a deadlock ensues.
Deprecated since version 3.2:This function does not update the current thread state. Please use
PyEval_RestoreThread()
orPyEval_AcquireThread()
instead.Note
Calling this function from a thread when the runtime is finalizingwill terminate the thread, even if the thread was not created by Python.You can use
_Py_IsFinalizing()
orsys.is_finalizing()
tocheck if the interpreter is in process of being finalized before callingthis function to avoid unwanted termination.Changed in version 3.8:Updated to be consistent with
PyEval_RestoreThread()
,Py_END_ALLOW_THREADS()
, andPyGILState_Ensure()
,and terminate the current thread if called while the interpreter is finalizing.
- void
PyEval_ReleaseLock
()¶ Release the global interpreter lock. The lock must have been created earlier.
Deprecated since version 3.2:This function does not update the current thread state. Please use
PyEval_SaveThread()
orPyEval_ReleaseThread()
instead.
Sub-interpreter support¶
While in most uses, you will only embed a single Python interpreter, thereare cases where you need to create several independent interpreters in thesame process and perhaps even in the same thread. Sub-interpreters allowyou to do that.
The “main” interpreter is the first one created when the runtime initializes.It is usually the only Python interpreter in a process. Unlike sub-interpreters,the main interpreter has unique process-global responsibilities like signalhandling. It is also responsible for execution during runtime initialization andis usually the active interpreter during runtime finalization. ThePyInterpreterState_Main()
function returns a pointer to its state.
You can switch between sub-interpreters using thePyThreadState_Swap()
function. You can create and destroy them using the following functions:
- PyThreadState*
Py_NewInterpreter
()¶ Create a new sub-interpreter. This is an (almost) totally separate environmentfor the execution of Python code. In particular, the new interpreter hasseparate, independent versions of all imported modules, including thefundamental modules
builtins
,__main__
andsys
. Thetable of loaded modules (sys.modules
) and the module search path(sys.path
) are also separate. The new environment has nosys.argv
variable. It has new standard I/O stream file objectssys.stdin
,sys.stdout
andsys.stderr
(however these refer to the same underlyingfile descriptors).The return value points to the first thread state created in the newsub-interpreter. This thread state is made in the current thread state.Note that no actual thread is created; see the discussion of thread statesbelow. If creation of the new interpreter is unsuccessful,
NULL
isreturned; no exception is set since the exception state is stored in thecurrent thread state and there may not be a current thread state. (Like allother Python/C API functions, the global interpreter lock must be held beforecalling this function and is still held when it returns; however, unlike mostother Python/C API functions, there needn’t be a current thread state onentry.)Extension modules are shared between (sub-)interpreters as follows:
For modules using multi-phase initialization,e.g.
PyModule_FromDefAndSpec()
, a separate module object iscreated and initialized for each interpreter.Only C-level static and global variables are shared between thesemodule objects.For modules using single-phase initialization,e.g.
PyModule_Create()
, the first time a particular extensionis imported, it is initialized normally, and a (shallow) copy of itsmodule’s dictionary is squirreled away.When the same extension is imported by another (sub-)interpreter, a newmodule is initialized and filled with the contents of this copy; theextension’sinit
function is not called.Objects in the module’s dictionary thus end up shared across(sub-)interpreters, which might cause unwanted behavior (seeBugs and caveats below).Note that this is different from what happens when an extension isimported after the interpreter has been completely re-initialized bycalling
Py_FinalizeEx()
andPy_Initialize()
; in thatcase, the extension’sinitmodule
functionis called again.As with multi-phase initialization, this means that only C-level staticand global variables are shared between these modules.
- void
Py_EndInterpreter
(PyThreadState *tstate)¶ Destroy the (sub-)interpreter represented by the given thread state. The giventhread state must be the current thread state. See the discussion of threadstates below. When the call returns, the current thread state is
NULL
. Allthread states associated with this interpreter are destroyed. (The globalinterpreter lock must be held before calling this function and is still heldwhen it returns.)Py_FinalizeEx()
will destroy all sub-interpreters thathaven’t been explicitly destroyed at that point.
Bugs and caveats¶
Because sub-interpreters (and the main interpreter) are part of the sameprocess, the insulation between them isn’t perfect — for example, usinglow-level file operations likeos.close()
they can(accidentally or maliciously) affect each other’s open files. Because of theway extensions are shared between (sub-)interpreters, some extensions may notwork properly; this is especially likely when using single-phase initializationor (static) global variables.It is possible to insert objects created in one sub-interpreter intoa namespace of another (sub-)interpreter; this should be avoided if possible.
Special care should be taken to avoid sharing user-defined functions,methods, instances or classes between sub-interpreters, since importoperations executed by such objects may affect the wrong (sub-)interpreter’sdictionary of loaded modules. It is equally important to avoid sharingobjects from which the above are reachable.
Also note that combining this functionality withPyGILState_*()
APIsis delicate, because these APIs assume a bijection between Python thread statesand OS-level threads, an assumption broken by the presence of sub-interpreters.It is highly recommended that you don’t switch sub-interpreters between a pairof matchingPyGILState_Ensure()
andPyGILState_Release()
calls.Furthermore, extensions (such asctypes
) using these APIs to allow callingof Python code from non-Python created threads will probably be broken when usingsub-interpreters.
Asynchronous Notifications¶
A mechanism is provided to make asynchronous notifications to the maininterpreter thread. These notifications take the form of a functionpointer and a void pointer argument.
- int
Py_AddPendingCall
(int (*func)(void *), void *arg)¶ Schedule a function to be called from the main interpreter thread. Onsuccess,
0
is returned andfunc is queued for being called in themain thread. On failure,-1
is returned without setting any exception.When successfully queued,func will beeventually called from themain interpreter thread with the argumentarg. It will be calledasynchronously with respect to normally running Python code, but withboth these conditions met:
on abytecode boundary;
with the main thread holding theglobal interpreter lock(func can therefore use the full C API).
func must return
0
on success, or-1
on failure with an exceptionset.func won’t be interrupted to perform another asynchronousnotification recursively, but it can still be interrupted to switchthreads if the global interpreter lock is released.This function doesn’t need a current thread state to run, and it doesn’tneed the global interpreter lock.
Warning
This is a low-level function, only useful for very special cases.There is no guarantee thatfunc will be called as quick aspossible. If the main thread is busy executing a system call,func won’t be called before the system call returns. Thisfunction is generallynot suitable for calling Python code fromarbitrary C threads. Instead, use thePyGILState API.
New in version 3.1.
Profiling and Tracing¶
The Python interpreter provides some low-level support for attaching profilingand execution tracing facilities. These are used for profiling, debugging, andcoverage analysis tools.
This C interface allows the profiling or tracing code to avoid the overhead ofcalling through Python-level callable objects, making a direct C function callinstead. The essential attributes of the facility have not changed; theinterface allows trace functions to be installed per-thread, and the basicevents reported to the trace function are the same as had been reported to thePython-level trace functions in previous versions.
- int
(*Py_tracefunc)
(PyObject *obj,PyFrameObject *frame, int what,PyObject *arg)¶ The type of the trace function registered using
PyEval_SetProfile()
andPyEval_SetTrace()
. The first parameter is the object passed to theregistration function asobj,frame is the frame object to which the eventpertains,what is one of the constantsPyTrace_CALL
,PyTrace_EXCEPTION
,PyTrace_LINE
,PyTrace_RETURN
,PyTrace_C_CALL
,PyTrace_C_EXCEPTION
,PyTrace_C_RETURN
,orPyTrace_OPCODE
, andarg depends on the value ofwhat:Value ofwhat
Meaning ofarg
PyTrace_CALL
Always
Py_None
.PyTrace_EXCEPTION
Exception information as returned by
sys.exc_info()
.PyTrace_LINE
Always
Py_None
.PyTrace_RETURN
Value being returned to the caller,or
NULL
if caused by an exception.PyTrace_C_CALL
Function object being called.
PyTrace_C_EXCEPTION
Function object being called.
PyTrace_C_RETURN
Function object being called.
PyTrace_OPCODE
Always
Py_None
.
- int
PyTrace_CALL
¶ The value of thewhat parameter to a
Py_tracefunc
function when a newcall to a function or method is being reported, or a new entry into a generator.Note that the creation of the iterator for a generator function is not reportedas there is no control transfer to the Python bytecode in the correspondingframe.
- int
PyTrace_EXCEPTION
¶ The value of thewhat parameter to a
Py_tracefunc
function when anexception has been raised. The callback function is called with this value forwhat when after any bytecode is processed after which the exception becomesset within the frame being executed. The effect of this is that as exceptionpropagation causes the Python stack to unwind, the callback is called uponreturn to each frame as the exception propagates. Only trace functions receivesthese events; they are not needed by the profiler.
- int
PyTrace_LINE
¶ The value passed as thewhat parameter to a
Py_tracefunc
function(but not a profiling function) when a line-number event is being reported.It may be disabled for a frame by settingf_trace_lines
to0 on that frame.
- int
PyTrace_RETURN
¶ The value for thewhat parameter to
Py_tracefunc
functions when acall is about to return.
- int
PyTrace_C_CALL
¶ The value for thewhat parameter to
Py_tracefunc
functions when a Cfunction is about to be called.
- int
PyTrace_C_EXCEPTION
¶ The value for thewhat parameter to
Py_tracefunc
functions when a Cfunction has raised an exception.
- int
PyTrace_C_RETURN
¶ The value for thewhat parameter to
Py_tracefunc
functions when a Cfunction has returned.
- int
PyTrace_OPCODE
¶ The value for thewhat parameter to
Py_tracefunc
functions (but notprofiling functions) when a new opcode is about to be executed. This event isnot emitted by default: it must be explicitly requested by settingf_trace_opcodes
to1 on the frame.
- void
PyEval_SetProfile
(Py_tracefunc func,PyObject *obj)¶ Set the profiler function tofunc. Theobj parameter is passed to thefunction as its first parameter, and may be any Python object, or
NULL
. Ifthe profile function needs to maintain state, using a different value forobjfor each thread provides a convenient and thread-safe place to store it. Theprofile function is called for all monitored events exceptPyTrace_LINE
PyTrace_OPCODE
andPyTrace_EXCEPTION
.
- void
PyEval_SetTrace
(Py_tracefunc func,PyObject *obj)¶ Set the tracing function tofunc. This is similar to
PyEval_SetProfile()
, except the tracing function does receive line-numberevents and per-opcode events, but does not receive any event related to C functionobjects being called. Any trace function registered usingPyEval_SetTrace()
will not receivePyTrace_C_CALL
,PyTrace_C_EXCEPTION
orPyTrace_C_RETURN
as a value for thewhat parameter.
Advanced Debugger Support¶
These functions are only intended to be used by advanced debugging tools.
- PyInterpreterState*
PyInterpreterState_Head
()¶ Return the interpreter state object at the head of the list of all such objects.
- PyInterpreterState*
PyInterpreterState_Main
()¶ Return the main interpreter state object.
- PyInterpreterState*
PyInterpreterState_Next
(PyInterpreterState *interp)¶ Return the next interpreter state object afterinterp from the list of allsuch objects.
- PyThreadState *
PyInterpreterState_ThreadHead
(PyInterpreterState *interp)¶ Return the pointer to the first
PyThreadState
object in the list ofthreads associated with the interpreterinterp.
- PyThreadState*
PyThreadState_Next
(PyThreadState *tstate)¶ Return the next thread state object aftertstate from the list of all suchobjects belonging to the same
PyInterpreterState
object.
Thread Local Storage Support¶
The Python interpreter provides low-level support for thread-local storage(TLS) which wraps the underlying native TLS implementation to support thePython-level thread local storage API (threading.local
). TheCPython C level APIs are similar to those offered by pthreads and Windows:use a thread key and functions to associate avoid*
value perthread.
The GIL doesnot need to be held when calling these functions; they supplytheir own locking.
Note thatPython.h
does not include the declaration of the TLS APIs,you need to includepythread.h
to use thread-local storage.
Note
None of these API functions handle memory management on behalf of thevoid*
values. You need to allocate and deallocate them yourself.If thevoid*
values happen to bePyObject*
, thesefunctions don’t do refcount operations on them either.
Thread Specific Storage (TSS) API¶
TSS API is introduced to supersede the use of the existing TLS API within theCPython interpreter. This API uses a new typePy_tss_t
instead ofint
to represent thread keys.
New in version 3.7.
See also
“A New C-API for Thread-Local Storage in CPython” (PEP 539)
Py_tss_t
¶This data structure represents the state of a thread key, the definition ofwhich may depend on the underlying TLS implementation, and it has aninternal field representing the key’s initialization state. There are nopublic members in this structure.
WhenPy_LIMITED_API is not defined, static allocation ofthis type by
Py_tss_NEEDS_INIT
is allowed.
Py_tss_NEEDS_INIT
¶This macro expands to the initializer for
Py_tss_t
variables.Note that this macro won’t be defined withPy_LIMITED_API.
Dynamic Allocation¶
Dynamic allocation of thePy_tss_t
, required in extension modulesbuilt withPy_LIMITED_API, where static allocation of this typeis not possible due to its implementation being opaque at build time.
- Py_tss_t*
PyThread_tss_alloc
()¶ Return a value which is the same state as a value initialized with
Py_tss_NEEDS_INIT
, orNULL
in the case of dynamic allocationfailure.
- void
PyThread_tss_free
(Py_tss_t *key)¶ Free the givenkey allocated by
PyThread_tss_alloc()
, afterfirst callingPyThread_tss_delete()
to ensure any associatedthread locals have been unassigned. This is a no-op if thekeyargument isNULL.Note
A freed key becomes a dangling pointer, you should reset the key toNULL.
Methods¶
The parameterkey of these functions must not beNULL
. Moreover, thebehaviors ofPyThread_tss_set()
andPyThread_tss_get()
areundefined if the givenPy_tss_t
has not been initialized byPyThread_tss_create()
.
- int
PyThread_tss_is_created
(Py_tss_t *key)¶ Return a non-zero value if the given
Py_tss_t
has been initializedbyPyThread_tss_create()
.
- int
PyThread_tss_create
(Py_tss_t *key)¶ Return a zero value on successful initialization of a TSS key. The behavioris undefined if the value pointed to by thekey argument is notinitialized by
Py_tss_NEEDS_INIT
. This function can be calledrepeatedly on the same key – calling it on an already initialized key is ano-op and immediately returns success.
- void
PyThread_tss_delete
(Py_tss_t *key)¶ Destroy a TSS key to forget the values associated with the key across allthreads, and change the key’s initialization state to uninitialized. Adestroyed key is able to be initialized again by
PyThread_tss_create()
. This function can be called repeatedly onthe same key – calling it on an already destroyed key is a no-op.
Thread Local Storage (TLS) API¶
Deprecated since version 3.7:This API is superseded byThread Specific Storage (TSS) API.
Note
This version of the API does not support platforms where the native TLS keyis defined in a way that cannot be safely cast toint
. On such platforms,PyThread_create_key()
will return immediately with a failure status,and the other TLS functions will all be no-ops on such platforms.
Due to the compatibility problem noted above, this version of the API should notbe used in new code.
- int
PyThread_create_key
()¶
- void
PyThread_delete_key
(int key)¶
- int
PyThread_set_key_value
(int key, void *value)¶
- void*
PyThread_get_key_value
(int key)¶
- void
PyThread_delete_key_value
(int key)¶
- void
PyThread_ReInitTLS
()¶