String formatting: % vs. .format vs. f-string literal

Question 1

There are various string formatting methods:

Python <2.6:"Hello %s" % name
Python 2.6+:"Hello {}".format(name) (usesstr.format)
Python 3.6+:f"{name}" (uses f-strings)

Which is better, and for what situations?

The following methods have the same outcome, so what is the difference?

name = "Alice""Hello %s" % name"Hello {0}".format(name)f"Hello {name}"# Using named arguments:"Hello %(kwarg)s" % {'kwarg': name}"Hello {kwarg}".format(kwarg=name)f"Hello {name}"

When does string formatting run, and how do I avoid a runtime performance penalty?

_{If you are trying to close a duplicate question that is just looking for a way to format a string, please useHow do I put a variable’s value inside a string?.}

Question 2

similar tostackoverflow.com/questions/3691975/…

Question 3

For beginners: Here is avery nice tutorial that teaches both styles. I personally use the older% style more often, because if you do not need the improved capabilities of theformat() style, the% style is often a lot more convenient.

Question 4

For reference: Python 3 documentation for thenewerformat() formatting style and theolder%-based formatting style.

Question 5

See also:Pythons many ways of string formatting

Question 6

To answer your second question, since 3.2 you can use {} format if you use a custom formatter (seedocs.python.org/3/library/logging.html#logging.Formatter)

Question 7

To answer your first question....format just seems more sophisticated in many ways. An annoying thing about% is also how it can either take a variable or a tuple. You'd think the following would always work:

"Hello %s" % name

yet, ifname happens to be(1, 2, 3), it will throw aTypeError. To guarantee that it always prints, you'd need to do

"Hello %s" % (name,)   # supply the single argument as a single-item tuple

which is just ugly..format doesn't have those issues. Also in the second example you gave, the.format example is much cleaner looking.

Only use it for backwards compatibility with Python 2.5.

To answer your second question, string formatting happens at the same time as any other operation - when the string formatting expression is evaluated. And Python, not being a lazy language, evaluates expressions before calling functions, so the expressionlog.debug("some debug info: %s" % some_info) will first evaluate the string to, e.g."some debug info: roflcopters are active", then that string will be passed tolog.debug().

Question 8

what about"%(a)s, %(a)s" % {'a':'test'}

Question 9

Note that you will waste time forlog.debug("something: %s" % x) but not forlog.debug("something: %s", x) The string formatting will be handled in the method and you won't get the performance hit if it won't be logged. As always, Python anticipates your needs =)

Question 10

ted: that’s a worse-looking hack to do the same as'{0}, {0}'.format('test').

Question 11

The point is: The one recurring argument that the new syntax allows reordering of items is a moot point: You can do the same with the old syntax. Most people do not know that this is actually already defined in the Ansi C99 Std! Check out a recent copy ofman sprintf and learn about the$ notation inside% placeholders

Question 12

@cfi: If you mean something like,printf("%2$d", 1, 3) to print out "3", that's specified in POSIX, not C99. The very man page you referenced notes, "The C99 standard does not include the style using '$'…".

Question 13

Something that the modulo operator ( % ) can't do, afaik:

tu = (12,45,22222,103,6)print '{0} {2} {1} {2} {3} {2} {4} {2}'.format(*tu)

result

12 22222 45 22222 103 22222 6 22222

Very useful.

Another point:format(), being a function, can be used as an argument in other functions:

li = [12,45,78,784,2,69,1254,4785,984]print map('the number is {}'.format,li)   printfrom datetime import datetime,timedeltaonce_upon_a_time = datetime(2010, 7, 1, 12, 0, 0)delta = timedelta(days=13, hours=8,  minutes=20)gen =(once_upon_a_time +x*delta for x in xrange(20))print '\n'.join(map('{:%Y-%m-%d %H:%M:%S}'.format, gen))

Results in:

['the number is 12', 'the number is 45', 'the number is 78', 'the number is 784', 'the number is 2', 'the number is 69', 'the number is 1254', 'the number is 4785', 'the number is 984']2010-07-01 12:00:002010-07-14 20:20:002010-07-28 04:40:002010-08-10 13:00:002010-08-23 21:20:002010-09-06 05:40:002010-09-19 14:00:002010-10-02 22:20:002010-10-16 06:40:002010-10-29 15:00:002010-11-11 23:20:002010-11-25 07:40:002010-12-08 16:00:002010-12-22 00:20:002011-01-04 08:40:002011-01-17 17:00:002011-01-31 01:20:002011-02-13 09:40:002011-02-26 18:00:002011-03-12 02:20:00

Question 14

You can use old style formatting inmap just as easily as format.map('some_format_string_%s'.__mod__, some_iterable)

Question 15

@cfi: please prove you are right by rewriting the example above in C99

Question 16

@MarcH:printf("%2$s %1$s\n", "One", "Two"); compiled withgcc -std=c99 test.c -o test, the output isTwo One. But I stand corrected:It is actually a POSIX extension and not C. I cannot find it again in the C/C++ standard, where I thought I'd seen it. The code works even with 'c90' std flag.sprintf man page.This does not list it, but allows libs to implement a superset. My original argument is still valid, replacingC withPosix

Question 17

My first comment here, does not apply to this answer. I regret the phrasing. In Python we cannot use the modulo operator% for reordering placeholders. I'd still like to not delete that first comment for the sake of comment consistency here. I apologize for having vented my anger here. It is directed against the often made statement that the old syntax per se would not allow this. Instead of creating a completely new syntax we could have introduced the std Posix extensions. We could have both.

Question 18

'modulo' refers to the operator that evaluates a remainder after a division. in this case the percent sign is not a modulo operator.

Question 19

Assuming you're using Python'slogging module, you can pass the string formatting arguments as arguments to the.debug() method rather than doing the formatting yourself:

log.debug("some debug info: %s", some_info)

which avoids doing the formatting unless the logger actually logs something.

Question 20

This is some useful info that I just learned now. It's a pity it doesn't have it's own question as it seems separate to the main question. Pity the OP didn't split his question in two separate questions.

Question 21

You can use dict formatting like this:log.debug("some debug info: %(this)s and %(that)s", dict(this='Tom', that='Jerry')) However, you can't use the new style.format() syntax here, not even in Python 3.3, which is a shame.

Question 22

@Cito: See this:plumberjack.blogspot.co.uk/2010/10/…

Question 23

The primary benefit of this is not performance (doing the string interpolation will be quick compared to whatever you're doing with the output from logging, e.g displaying in a terminal, saving to disk) It is that if you have a logging aggregator, it can tell you "you got 12 instances of this error message", even if they all had different 'some_info' values. If the string formatting is done before passing the string to log.debug, then this is impossible. The aggregator can only say "you had 12 different log messages"

Question 24

If you're concerned about performance, use literal dict {} syntax instead of a dict() class instantiation:doughellmann.com/2012/11/…

Question 25

As of Python 3.6 (2016) you can usef-strings to substitute variables:

>>> origin = "London">>> destination = "Paris">>> f"from {origin} to {destination}"'from London to Paris'

Note thef" prefix. If you try this in Python 3.5 or earlier, you'll get aSyntaxError.

Seehttps://docs.python.org/3.6/reference/lexical_analysis.html#f-strings

Question 26

This doesn't answer the question. Another answer that mentions f-strings at least talks about performance:stackoverflow.com/a/51167833/7851470

Question 27

f-strings are cute, and remind me of Ruby syntax. But they don't seem to have a lot of advantages, and, as you've said, they unnecessarily break compatibility with Python < 3.6

Question 28

I just don't like putting expressions into strings which means searching for code is now hidden in strings and syntax errors are not discovered until run time. I think they're saying that f-strings are not any more susceptible to injection attacks but I'm not sure I believe them.

Question 29

PEP 3101 proposes the replacement of the% operator with the new, advanced string formatting in Python 3, where it would be the default.

Question 30

Untrue: "Backwards compatibility can be maintained by leaving the existing mechanisms in place."; of course,.format won'treplace% string formatting.

Question 31

No, BrainStorms postulation is true: "intended as a replacement for the existing '%'". Tobias quote means both systems will coexist for some time. RTFPEP

Question 32

But please be careful, just now I've discovered one issue when trying to replace all% with.format in existing code:'{}'.format(unicode_string) will try to encode unicode_string and will probably fail.

Just look at this Python interactive session log:

Python 2.7.2 (default, Aug 27 2012, 19:52:55) [GCC 4.1.2 20080704 (Red Hat 4.1.2-48)] on linux2; s='й'; u=u'й'; s'\xd0\xb9'; uu'\u0439'

s is just a string (called 'byte array' in Python3) andu is a Unicode string (called 'string' in Python3):

; '%s' % s'\xd0\xb9'; '%s' % uu'\u0439'

When you give a Unicode object as a parameter to% operator it will produce a Unicode string even if the original string wasn't Unicode:

; '{}'.format(s)'\xd0\xb9'; '{}'.format(u)Traceback (most recent call last):  File "<stdin>", line 1, in <module>UnicodeEncodeError: 'latin-1' codec can't encode character u'\u0439' in position 0: ordinal not in range(256)

but the.format function will raise "UnicodeEncodeError":

; u'{}'.format(s)u'\xd0\xb9'; u'{}'.format(u)u'\u0439'

and it will work with a Unicode argument fine only if the original string was Unicode.

; '{}'.format(u'i')'i'

or if argument string can be converted to a string (so called 'byte array')

Question 33

There is simply no reason to change working code unless the additional features of the newformat method are really needed ...

Question 34

absolutely agree with you, Tobias, but sometimes it's needed when upgrading to newer versions of Python

Question 35

For instance? AFAIK, it hasnever been needed; I don't consider it likely that the% string interpolation would ever go away.

Question 36

I consider .format() function safer than % for strings. Often I see beginners' mistakes like this"p1=%s p2=%d" % "abc", 2 or"p1=%s p2=%s" % (tuple_p1_p2,). You might think it's the coder's fault but I think it's just weird faulty syntax that looks nice for the quicky-scriptie but is bad for production code.

Question 37

But I don't like the syntax of .format(), I'd be happier with good old%s,%02d like"p1=%s p2=%02d".format("abc", 2). I blame those who invented and approved the curly braces formatting that needs you to escape them like{{}} and looks ugly imho.

Question 38

% gives better performance thanformat from my test.

Test code:

Python 2.7.2:

import timeitprint 'format:', timeit.timeit("'{}{}{}'.format(1, 1.23, 'hello')")print '%:', timeit.timeit("'%s%s%s' % (1, 1.23, 'hello')")

Result:

> format: 0.470329046249> %: 0.357107877731

Python 3.5.2

import timeitprint('format:', timeit.timeit("'{}{}{}'.format(1, 1.23, 'hello')"))print('%:', timeit.timeit("'%s%s%s' % (1, 1.23, 'hello')"))

Result

> format: 0.5864730989560485> %: 0.013593495357781649

It looks in Python2, the difference is small whereas in Python3,% is much faster thanformat.

Thanks @Chris Cogdon for the sample code.

Edit 1:

Tested again in Python 3.7.2 in July 2019.

Result:

> format: 0.86600608> %: 0.630180146

There is not much difference. I guess Python is improving gradually.

Edit 2:

After someone mentioned python 3's f-string in comment, I did a test for the following code under python 3.7.2 :

import timeitprint('format:', timeit.timeit("'{}{}{}'.format(1, 1.23, 'hello')"))print('%:', timeit.timeit("'%s%s%s' % (1, 1.23, 'hello')"))print('f-string:', timeit.timeit("f'{1}{1.23}{\"hello\"}'"))

Result:

format: 0.8331376779999999%: 0.6314778750000001f-string: 0.766649943

It seems f-string is still slower than% but better thanformat.

Question 39

Instead,str.format gives more functionalities (especially type-specialized formatting e.g.'{0:%Y-%m-%d}'.format(datetime.datetime.utcnow())). Performance cannot be the absolute requirement of all jobs. Use the right tool for the job.

Question 40

"Premature optimization is the root of all evil" or so Donald Knuth once said...

Question 41

Sticking with a well-known formatting scheme (as long as it suits the needs, which it does in the vast majority of cases), and which is twice as fast, is no "premature optimization" but simply reasonable. BTW, the% operator allows to reuseprintf knowledge; dictionary interpolation is a very simple extension of the principle.

Question 42

From my test there is also a huge difference between Python3 and Python 2.7. Where% is much more efficient thanformat() in Python 3. The code that I used can be found here:github.com/rasbt/python_efficiency_tweaks/blob/master/test_code/… andgithub.com/rasbt/python_efficiency_tweaks/blob/master/test_code/…

Question 43

I've actually experienced the opposite in one situation. New-style formatting was faster. Can you provide the test code you used?

Question 44

Yet another advantage of.format (which I don't see in the answers): it can take object properties.

In [12]: class A(object):   ....:     def __init__(self, x, y):   ....:         self.x = x   ....:         self.y = y   ....:         In [13]: a = A(2,3)In [14]: 'x is {0.x}, y is {0.y}'.format(a)Out[14]: 'x is 2, y is 3'

Or, as a keyword argument:

In [15]: 'x is {a.x}, y is {a.y}'.format(a=a)Out[15]: 'x is 2, y is 3'

This is not possible with% as far as I can tell.

Question 45

This looks more unreadable than necessary compared to the equivalent'x is {0}, y is {1}'.format(a.x, a.y). Should only be used when thea.x operation is very costly.

Question 46

@dtheodor With a tweak to use a keyword argument instead of positional argument...'x is {a.x}, y is {a.y}'.format(a=a). More readable than both examples.

Question 47

@CivFan Or, if you have more than one object,'x is {a.x}, y is {a.y}'.format(**vars())

Question 48

Also note this one in the same fashion:'{foo[bar]}'.format(foo={'bar': 'baz'}).

Question 49

This is incredibly useful for customer-facing applications, where your application supplies a standard set of formatting options with a user-supplied format string. I use this all the time. The configuration file, for instance, will have some "messagestring" property, which the user can supply with

Your order, number {order[number]} was processed at {now:%Y-%m-%d %H:%M:%S}, will be ready at about {order[eta]:%H:%M:%S}

or whatever they wish. This is far cleaner than trying to offer the same functionality with the old formatter. It makes user-supplied format strings way more powerful.

Question 50

As I discovered today, the old way of formatting strings via% doesn't supportDecimal, Python's module for decimal fixed point and floating point arithmetic, out of the box.

Example (using Python 3.3.5):

#!/usr/bin/env python3from decimal import *getcontext().prec = 50d = Decimal('3.12375239e-24') # no magic number, I rather produced it by banging my head on my keyboardprint('%.50f' % d)print('{0:.50f}'.format(d))

Output:

0.00000000000000000000000312375239000000009907464850 0.00000000000000000000000312375239000000000000000000

There surely might be work-arounds but you still might consider using theformat() method right away.

Question 51

That's probably because new-style formatting callsstr(d) before expanding the parameter, whereas old-style formatting probably callsfloat(d) first.

Question 52

You'd think so, butstr(d) returns"3.12375239e-24", not"0.00000000000000000000000312375239000000000000000000"

Question 53

If your python >= 3.6, F-string formatted literal is your new friend.

It's more simple, clean, and better performance.

In [1]: params=['Hello', 'adam', 42]In [2]: %timeit "%s %s, the answer to everything is %d."%(params[0],params[1],params[2])448 ns ± 1.48 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)In [3]: %timeit "{} {}, the answer to everything is {}.".format(*params)449 ns ± 1.42 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)In [4]: %timeit f"{params[0]} {params[1]}, the answer to everything is {params[2]}."12.7 ns ± 0.0129 ns per loop (mean ± std. dev. of 7 runs, 100000000 loops each)

Question 54

As of Python 3.11, C-style formatting (with %s, %a and %r) isnow as fast as the corresponding f-string expression

Question 55

this is only true for"string literals containing only the format codes %s, %r and %a"

Question 56

As a side note, you don't have to take a performance hit to use new style formatting with logging. You can pass any object tologging.debug,logging.info, etc. that implements the__str__ magic method. When the logging module has decided that it must emit your message object (whatever it is), it callsstr(message_object) before doing so. So you could do something like this:

import loggingclass NewStyleLogMessage(object):    def __init__(self, message, *args, **kwargs):        self.message = message        self.args = args        self.kwargs = kwargs    def __str__(self):        args = (i() if callable(i) else i for i in self.args)        kwargs = dict((k, v() if callable(v) else v) for k, v in self.kwargs.items())        return self.message.format(*args, **kwargs)N = NewStyleLogMessage# Neither one of these messages are formatted (or calculated) until they're# needed# Emits "Lazily formatted log entry: 123 foo" in loglogging.debug(N('Lazily formatted log entry: {0} {keyword}', 123, keyword='foo'))def expensive_func():    # Do something that takes a long time...    return 'foo'# Emits "Expensive log entry: foo" in loglogging.debug(N('Expensive log entry: {keyword}', keyword=expensive_func))

This is all described in the Python 3 documentation (https://docs.python.org/3/howto/logging-cookbook.html#formatting-styles). However, it will work with Python 2.6 as well (https://docs.python.org/2.6/library/logging.html#using-arbitrary-objects-as-messages).

One of the advantages of using this technique, other than the fact that it's formatting-style agnostic, is that it allows for lazy values e.g. the functionexpensive_func above. This provides a more elegant alternative to the advice being given in the Python docs here:https://docs.python.org/2.6/library/logging.html#optimization.

Question 57

I wish I could upvote this more. It allows logging withformat without the performance hit -- does it by overriding__str__ precisely aslogging was designed for -- shortens the function call to a single letter (N) which feels very similar to some of the standard ways to define strings -- AND allows for lazy function calling. Thank you! +1

Question 58

Is this any different in outcome to using thelogging.Formatter(style='{') parameter?

Question 59

One situation where% may help is when you are formatting regex expressions. For example,

'{type_names} [a-z]{2}'.format(type_names='triangle|square')

raisesIndexError. In this situation, you can use:

'%(type_names)s [a-z]{2}' % {'type_names': 'triangle|square'}

This avoids writing the regex as'{type_names} [a-z]{{2}}'. This can be useful when you have two regexes, where one is used alone without format, but the concatenation of both is formatted.

Question 60

Or just use'{type_names} [a-z]{{2}}'.format(type_names='triangle|square'). It's like saying.format() can help when using strings which already contain a percent character. Sure. You have to escape them then.

Question 61

@Alfe You are right, and that is why the answer starts with"One situation where % may help is when you are formatting regex expressions." Specifically, assumea=r"[a-z]{2}" is a regex chunk that you will be used in two different final expressions (e.g.c1 = b + a andc2 = a). Assume thatc1 needs to beformated (e.g.b needs to be formatted runtime), butc2 does not. Then you needa=r"[a-z]{2}" forc2 anda=r"[a-z]{{2}}" forc1.format(...).

Question 62

I would add that since version 3.6, we can use fstrings like the following

foo = "john"bar = "smith"print(f"My name is {foo} {bar}")

Which give

My name is john smith

Everything is converted to strings

mylist = ["foo", "bar"]print(f"mylist = {mylist}")

Result:

mylist = ['foo', 'bar']

you can pass function, like in others formats method

print(f'Hello, here is the date : {time.strftime("%d/%m/%Y")}')

Giving for example

Hello, here is the date : 16/04/2018

Question 63

Python 3.6.7 comparative:

#!/usr/bin/env pythonimport timeitdef time_it(fn):    """    Measure time of execution of a function    """    def wrapper(*args, **kwargs):        t0 = timeit.default_timer()        fn(*args, **kwargs)        t1 = timeit.default_timer()        print("{0:.10f} seconds".format(t1 - t0))    return wrapper@time_itdef new_new_format(s):    print("new_new_format:", f"{s[0]} {s[1]} {s[2]} {s[3]} {s[4]}")@time_itdef new_format(s):    print("new_format:", "{0} {1} {2} {3} {4}".format(*s))@time_itdef old_format(s):    print("old_format:", "%s %s %s %s %s" % s)def main():    samples = (("uno", "dos", "tres", "cuatro", "cinco"), (1,2,3,4,5), (1.1, 2.1, 3.1, 4.1, 5.1), ("uno", 2, 3.14, "cuatro", 5.5),)     for s in samples:        new_new_format(s)        new_format(s)        old_format(s)        print("-----")if __name__ == '__main__':    main()

Output:

new_new_format: uno dos tres cuatro cinco0.0000170280 secondsnew_format: uno dos tres cuatro cinco0.0000046750 secondsold_format: uno dos tres cuatro cinco0.0000034820 seconds-----new_new_format: 1 2 3 4 50.0000043980 secondsnew_format: 1 2 3 4 50.0000062590 secondsold_format: 1 2 3 4 50.0000041730 seconds-----new_new_format: 1.1 2.1 3.1 4.1 5.10.0000092650 secondsnew_format: 1.1 2.1 3.1 4.1 5.10.0000055340 secondsold_format: 1.1 2.1 3.1 4.1 5.10.0000052130 seconds-----new_new_format: uno 2 3.14 cuatro 5.50.0000053380 secondsnew_format: uno 2 3.14 cuatro 5.50.0000047570 secondsold_format: uno 2 3.14 cuatro 5.50.0000045320 seconds-----

Question 64

You should run each example several times, a single run may be misleading e.g. the operating system may be generally busy so execution of your code gets delayed. see the docs:docs.python.org/3/library/timeit.html. (nice avatar, Guybrush!)

Claudiu 231k174 gold badges507 silver badges702 bronze badges · Accepted Answer · 2022-07-11 00:30:36Z

To answer your first question....format just seems more sophisticated in many ways. An annoying thing about% is also how it can either take a variable or a tuple. You'd think the following would always work:

"Hello %s" % name

yet, ifname happens to be(1, 2, 3), it will throw aTypeError. To guarantee that it always prints, you'd need to do

"Hello %s" % (name,)   # supply the single argument as a single-item tuple

which is just ugly..format doesn't have those issues. Also in the second example you gave, the.format example is much cleaner looking.

Only use it for backwards compatibility with Python 2.5.

To answer your second question, string formatting happens at the same time as any other operation - when the string formatting expression is evaluated. And Python, not being a lazy language, evaluates expressions before calling functions, so the expressionlog.debug("some debug info: %s" % some_info) will first evaluate the string to, e.g."some debug info: roflcopters are active", then that string will be passed tolog.debug().

Note that you will waste time forlog.debug("something: %s" % x) but not forlog.debug("something: %s", x) The string formatting will be handled in the method and you won't get the performance hit if it won't be logged. As always, Python anticipates your needs =)
ted: that’s a worse-looking hack to do the same as'{0}, {0}'.format('test').
The point is: The one recurring argument that the new syntax allows reordering of items is a moot point: You can do the same with the old syntax. Most people do not know that this is actually already defined in the Ansi C99 Std! Check out a recent copy ofman sprintf and learn about the$ notation inside% placeholders
@cfi: If you mean something like,printf("%2$d", 1, 3) to print out "3", that's specified in POSIX, not C99. The very man page you referenced notes, "The C99 standard does not include the style using '$'…".

Movatterモバイル変換

Collectives™ on Stack Overflow

String formatting: % vs. .format vs. f-string literal

16 Answers16

17 Comments

10 Comments

12 Comments

3 Comments

2 Comments

8 Comments

10 Comments

5 Comments

2 Comments

2 Comments

2 Comments

2 Comments

Comments

1 Comment

Comments

1 Comment

Linked

Related

Hot Network Questions

Subscribe to RSS