- Notifications
You must be signed in to change notification settings - Fork196
📖🐍 Free & Open Source book to master Python 3. Also available: PDF & Web Interface.
License
animator/learn-python
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Contributors should go through theContributing Guide to learn how you can contribute to the project.
byAnkit Mahato [About]
Version 2022.10
This book can be consumed in 3 ways:
- Introduction
- Python Fundamentals
- Variables, Objects & Data Types
- Input / Output
- Operators & Expressions
- Errors & Exception Handling
- Control Flow
- Strings
- List
- What is a Python List? How to Create a List?
- Accessing Items of a List
- Modifying a List
- Removing Item from a List
- List Operations
- Traversing a List
- Built-in Functions that can be used for a List
- Introduction to List Methods
- Adding Items to a List
- Removing Items from a List
- Counting or Locating Items in a List
- Reversing Items
- Sorting a List
- Copying a List
- Nested List
- List Comprehension
- Sample Programs
- Tuple
- List vs Tuple
- How to Create Tuple?
- What is a Singleton?
- Accessing Items of a Tuple
- Tuples are Immutable
- Tuple Operations
- Traversing a Tuple
- Built-in Functions that can be used for a Tuple
- Locating Items in a Tuple
- Counting the Occurrence of an Item
- New Tuple with Reversed Items
- New Tuple with Sorted Items
- Nested Tuple
- Understanding the Immutable Nature of Tuples
- Dictionary
- Python Standard Library
- File Handling
- User Defined Functions, Modules & Packages
In today's digital era, we depend on computers, smartphones and the internet to perform a plethora of tasks, like:
- A mathematical task, such as finding the square root of a number or solving a set of simultaneous equations.
- A text-based task such as reading a document and performing search/replace.
- Streaming and playing multimedia files containing audio and video.
- Using a search engine to find and visit a website.
- Playing an online multiplayer game with friends.
- and many more...
Softwares play an important role as they translate human activity into corresponding machine instructions which are executed to accomplish these tasks.
Asoftware is a collection of programs where each program provides a sequence of instructions specifying how the computer should act.
These instructions have to be provided inmachine language orlow level language (0s and 1s) that is difficult to read or write for a human being.
This led to the invention ofhigh-level programming languages in which programs can be easily written and managed. The human-readable programs written using high-level languages are converted into computer-readable machine code or byte-code usingcompilers orinterpreters.
There are many high-level programming languages that are currently in wide use.
Some of the popular languages are Java, C, C++, C#, Go, Swift, JavaScript, PHP, Dart, Kotlin and Python.
Guido van Rossum started the development of Python in December 1989. He released the first version (0.9.0) of Python for general public on February 20, 1991.
The language evolved over the next few decades and so did its definition, the current version of which is stated below:
Python is a high-level, interpreted, object-oriented programming language with dynamic semantics.
Let us break down and analyze the above definition to gain a better understanding of Python:
High-level
Python is ahigh-level programming language which can be used to write a program in natural language (english) making it readable, writable, shareable and manageable.
While developing a Python program one is not required to handle the various components of computer architecture like registers, memory addresses and call stacks which have to be handled if an assembly language or a low-level language is used for development.
Python includes high-level language features like variables, data structures (lists, dictionaries, etc.), objects, expressions, modules, classes, functions, loops, threads, file handling, string handling, error handling and other computer science abstraction concepts.
Interpreted
In traditional programming languages like C or C++, codes are compiled into computer-readable machine code before it can be executed.
Python is aninterpreted language where the Python interpreter reads and executes the program line by line.
The process is more time consuming compared to compiled code execution, but allows faster development as one does not have to go through the entire compilation step during testing and debugging. Also, the code can run on any platform as long as it has a valid Python installation (which includes interpreter) as there is no generation of platform dependent binaries.
Object-oriented
Python does not enforceObject-oriented programming (OOP), but completely supports it.
A programmer can define Classes specifying the data in the form of attributes (or properties) and some programming logic in the form of member functions (or methods). Once a class is defined, the user can create an instance of that class which is known as an object.
In Python, everything (int
,list
,dict
, etc.) is an object. We will cover more about objects in detail in the later sections.
Dynamic Semantics
As Python is an interpreted language in which the code is executed line-by-line, a python statement or expression is evaluated during run-time. This allowsdynamic typing (type of a variable can change over its lifetime) and creation of dynamic objects during run-time, which provides more flexibility, usability and fewer lines of code as compared to statically-typed compiled languages like C/C++.
The key advantages of Python are as follows:
1. Easy to Learn
The Python programming language is easy to learn with low technical and conceptual overhead. This makes it an ideal language for beginners to learn programming.
2. Clear Syntax & Fewer Lines of Code
Python's simple and easy to learn syntax increases code readability and leads to fewer lines of code.
A typical task which requires an average of twenty lines of code in C and seven in Java can often be done with just one line in Python.
Also, due to fewer lines of code the chances of making any error is significantly reduced.
3. Open Source
Python is an open source programming language, so anyone can view and contribute to its source code.
4. Portable & Platform Independent
The Python programming language is portable & platform independent as it can work on any Linux, MacOS or Windows device.
5. Standard Library & Python Packages
Python has a rich and extensive Standard Library, a collection of predefined functions for various tasks.
Python programmers also have at their disposal the vast ecosystem of more than 250,000 community contributed libraries in the Python Package Index (PyPI), where one can find a solution to every conceivable task.
6. Web Application Development
Some of the most popular web development frameworks (django, flask, etc.) are written in Python. This coupled with the availability of packages to connect to any database makes Python a great choice for web application development.
Let's start with the Python 3 installation process on Windows operating system.
Step 1: Download Installer
Download the latest Windows installer from thePython Software Foundation website.
Step 2: Install Python 3
Once the download is complete double-click and run it.
Select the checkbox ✅Add Python 3.9 to PATH
. This will enable you to install python packages and run python script via command-line.
Hit 🛡️Install Now
and complete the setup.
Step 3: Verify Installation
Once the setup is complete, click on theStart
menu and openPython 3.9 -> IDLE (Python 3.9 64 bit)
to launch the Python interpreter.
Python 3.9 is now successfully installed on your computer.
Let's start with the Python 3 installation process on macOS operating system.
Step 1: Download Installer
Download the latest macOS installer from thePython Software Foundation website.
Step 2: Install Python 3
Once the download is complete double-click and run it.
HitContinue
and complete the setup.
Step 3: Verify Installation
Once the setup is complete, click on theLaunchpad -> IDLE
to launch the Python interpreter.
Python 3.9 is now successfully installed on your computer.
After installing the latest version of the Python interpreter, we can now write and execute some basic Python codes.
There are two ways to execute a Python program:
- Interactive Mode: When the IDLE application is launched, the Python interpreter or the Python shell pops up on the screen. User can interact with the Python interpreter and execute statements (single line or multiline code snippets) directly in this Python shell.
- Script Mode: This is the most commonly used method for executing a Python program. The entire Python program is written and saved in a file (
.py
extension) which can be executed using the IDLE application.
Let us execute some basic Python statements and interact with the Python shell.
Launching the Python Shell
To launch the IDLE application click[Windows Start Menu Button] -> [Python 3.9 Folder] -> [IDLE (Python 3.9 64 bit)]
.
The Python interpreter or the Python shell will pop-up on the screen.
The version (3.9
) of the Python interpreter is displayed at the top of the window followed by the>>>
symbol which indicates that the interpreter is ready to take instructions.
Python commands or statements can be input on this prompt. The input statements are executed instantaneously and any variable assignments are retained as long as the session is not terminated.
Basic Arithmetic
Let us perform some basic arithmetic operations in the interactive mode using an integer number (2
) and a floating-point number (3.5
):
>>>2+24>>>2*3.57.0
It can be observed that the results of each of the above computations are displayed immediately in the shell.
Storing Values/Results
Instead of immediately displaying the results, they can also be stored in variables using the assignment symbol (=
) as shown below:
>>>a=2+2>>>b=2*3.5
The values ofa
andb
can be accessed later for future calculations as shown below:
>>>a4>>>b7.0>>>a*520>>>b/32.3333333333333335
Basic String Operation
Interactive mode is not just restricted to basic arithmetic or assignments. Let us join two strings -"Hello, "
and"world!"
.
>>>"Hello, "+"world!"'Hello, world!'
The complete functionality of Python is easily accessible to a user via theInteractive Mode.
This makes it convenient for testing and instant execution of small code snippets (single line or few lines of code), a feature not available in compiled languages like C, C++ and Java.
But, the statements cannot be saved for future use and have to retyped for re-execution. This disadvantage is overcome by the use of Python inScript Mode as described in the next section.
To write reusable codes, script mode is the most preferred mode of code execution.
File Creation
To create a new file using the IDLE application click[File] -> [New File]
Write a simple Python program as shown below
a=2+2a
and save the script asexample.py
(.py
file extension for all Python scripts) using[File] -> [Save As...]
Script Execution
Now run this script using[Run] -> [Run Module]
.
It can be observed that the code has been executed, but no output is displayed on the console (or the standard output) as all outputs have to be explicitly specified when running a code in the script mode.
This can be done by using theprint()
function which is used in Python scripts to display output on the output stream. Let us quickly add theprint()
function in the above code and execute it.
a=2+2print(a)
Now, when you run the script you will observe that the value ofa
, that is4
, is now displayed on the console.
When a Python code is executed, the Python interpreter reads each logical line and breaks it into a sequence of lexical units.
These lexical units are better known astokens - the smallest individual units of a program. They are the building blocks of a Python code and can be classified into one of the following categories:
- Keywords : Reserved words that convey special meaning when processed by the Python interpreter.
- Identifiers : Names defined by the programmer to refer to objects that can represent variables, functions, classes, etc.
- Literals : Values specified in the program which belong to exactly one of the Python's built-in data types.
- Delimiters : Symbols that denote grouping, punctuation, and assignment/binding.
- Operators : Symbols that can operate on data and compute results.
Keywords are reserved words that have special meaning when processed by the Python interpreter. They are case-sensitive and cannot be used for naming identifiers (class, function, variable or structure names).
The list of keywords in Python are provided below:
True | False | import |
from | as | None |
and | or | not |
in | is | try |
except | finally | raise |
del | global | nonlocal |
lambda | def | class |
with | if | elif |
else | pass | for |
while | continue | break |
assert | return | yield |
async | await |
Identifiers are used for defining the names of Python objects such as variables, functions, classes, modules, etc. The naming convention for identifiers is as follows:
- Must begin with a lowercase character (
a-z
) or an uppercase character (A-Z
) or underscore sign (_
). - Followed by any number of letters (
a-z
,A-Z
), digits (0-9
), or underscores (_
). - Should not be a keyword.
- No special symbols are allowed like
!
,@
,#
,$
,%
, etc.
Some points to keep in mind while naming identifiers:
- Identifiers are case-sensitive in nature and any difference in case of any character refers to a different identifier. e.g.,
length
andLength
are different identifiers. - Identifiers differing by only underscores are different. e.g.,
unitlength
andunit_length
are different identifiers.
It is also a good practice (although not compulsory) to follow the following procedure while naming identifiers:
- Identifiers should be named carefully with an emphasis on clarity and readability. For example, in a program that calculates the area of a rectangle, a good choice for identifier names are -
length
,breadth
andarea
. - Class names should start with uppercase character.
- Identifiers starting with an underscore have special meaning in a program.
- Variable, function and method names should be in lowercase characters, with underscores separating multiple words like
area_of_square
,area_of_triangle
, etc.
Literals are tokens in the source code which represent fixed or constant values. They are often used in assignment statements for initializing variables or in comparison expressions.
The various types of literals available in Python are as follows:
Numeric literals are used for representing numeric values in the source code. They can be of three types - integers, float point numbers and imaginary numbers.
Integer literals are numbers without any fractional component.
In Python, integer literals can be written in four positional (base) numeral systems:
i. Decimal or base-10 Integer
A decimal integer literal consists of one or more digits (0-9
) and cannot have any zeros preceding the first non-zero digit, except when the number is0
.
Example base-10 integers:
3432832988640
092
is not a valid decimal integer literal as a zero precedes the first non-zero digit9
.
ii. Binary or base-2 Integer
A binary integer or base-2 integer begins with0b
or0B
followed by binary digits0-1
.
For example,27
can be written as a binary integer literal0b11011
.
iii. Octal or base-8 Integer
An octal integer or base-8 integer begins with0o
or0O
followed by octal digits0-7
.
For example,27
can be written as an octal integer literal0o33
.
iv. Hexadecimal or base-16 Integer
A hexadecimal integer or base-16 integer begins with0x
or0X
followed by digits0-9
or lettersA-F
(case insensitive).
For example,27
can be written as a hexadecimal integer literal0x1B
or0x1b
.
Thus, it can be observed that number27
can be written in the program as27
(decimal),0b11011
(binary),0o33
(octal) or0x1B
(hexadecimal).
Underscores in Integer Literals
An optional character_
(underscore) is also allowed in an integer literal to group digits for enhanced readability.
One underscore can occur between digits, and after base specifiers like0o
.
They are ignored while determining the actual numerical value of the literal.
Some valid underscore usages are -10_00_00_000
,0b_1110_0101
,0x23_123
.
Floating point literals are real numbers present in the source code. They contain fractional component and/or exponential component.
The fractional component includes the digits after the decimal point (.
).
Example floating point literals:
3.4.48.3.4E23.4e-2
In the above example,.4
is equivalent to0.4
and8.
is equivalent to8.0
.
The exponential component can be identified by the lettere
orE
followed by an optional sign (+
or-
) and digits (0-9
). This exponent is equivalent to multiplying the real number with the power of10
.
For example,3.4E2
is equivalent to3.4 x 10^2
or340.0
, whereas3.4e-2
is equivalent to3.4 x 10^-2
or.034
.
To specify complex numbers and perform complex number mathematics, Python supports imaginary literals which are given by real or integer number followed by the letterj
orJ
which represents the unit imaginary number.
Example imaginary literals:
3.5j 15.j 12j .005j 3e100j 3.5e-10j
Points to Note
In Python,
- there is no specialized literal such as a complex literal. A complex number is actually represented in the program using an expression comprising a real number (integer/float numeric literal) and an imaginary number (imaginary literal). For example,
1 + 2j
consists of an integer literal (1
) and a imaginary literal (2j
). - numeric literals do not include the minus sign (
-
).-
is actually a unary operator it combines with a numeric literal to represent negative numbers. For example, in-3.14
the numeric literal is3.14
and-
is an operator.
The reserved wordsTrue
andFalse
are also boolean literals which can be used to specify the truth value in a program.
String literals are texts which can be specified in a variety of ways in Python:
- Single quotes:
'python'
- Double quotes:
"python"
- Triple quoted:
'''Triple python'''
,"""Three python"""
.
Triple quoted strings can also span multiple lines.
Example:
s="I am a String"s1="""AmultilineString"""s2='''Also amultilineString'''
The backslash (\
) character can be used in a string literal to escape characters that otherwise have a special meaning, such as newline, linefeed, or the quote character.
Escape Sequence | Meaning |
---|---|
\\ | Backslash (\ ) |
\' | Single quote (' ) |
\" | Double quote (" ) |
\a | ASCII Bell (BEL) |
\b | ASCII Backspace (BS) |
\f | ASCII Form-feed (FF) |
\n | ASCII Linefeed (LF) |
\r | ASCII Carriage Return (CR) |
\t | ASCII Horizontal Tab (TAB) |
\v | ASCII Vertical Tab (VT) |
Although\'
and\"
can be used to specify quote characters, Python allows embedding double quotes inside a single-quoted string ('My name is "Python".'
) and single quotes inside a double-quoted string ("Python's World"
).
String literals also support unicode characters which can be specified using\u
escape sequence followed by the 4 letter unicode.
>>>print("E = mc\u00B2")E=mc²
In the above example,\u00B2
is the unicode character which represents the 'SUPERSCRIPT TWO'.
None
is a special literal which is used to denote the absence of value.
It should not be confused with0
as0
is an integer literal with a defined finite value, whereasNone
implies nothingness.
>>>a=None>>>a>>>
In the above example, the Python shell does not display any value ofa
as it is assigned asNone
which has no value.
Python has the provision for specifying a collection of literals in the source code using a special syntax known as"displays".
One can create specialized containers like list, set and dictionary using this syntax.
Some example collection of literals (displays) are provided below:
- List:
a = ['a', 'b', 'c']
- Set:
a = {'a', 'b', 'c'}
- Dictionary:
a = {'a':1, 'b':2, 'c':3}
List, set and dictionary will be covered in detail in later chapters.
Operators are tokens which can be combined with values and variables to create expressions which evaluate to a single value. Python supports a rich set of operators:
+-***///% @<<>>&|^~ :=<><=>===!=
Each of the above operators are covered in detail in the chapter - Operators.
Delimiters are tokens which are useful for organizing a program and are used in statements, expressions, functions, literal collections, and various other code structures.
They can be classified based on utility as follows:
()
,[]
and{}
are delimiters used for:
- grouping expressions which can be spread across multiple physical lines.
- creating collection of literals like list display, dictionary display, set display.
- creating parenthesized sub-expression having the highest operator precedence (evaluated first) in a complex expression.
Example
days= ['Sunday','Monday','Tuesday','Wednesday','Thursday','Friday','Saturday']sum_6= (1+2+3+4+5+6)
is equivalent to
days= ['Sunday','Monday','Tuesday','Wednesday','Thursday','Friday','Saturday']sum_6= (1+2+3+4+5+6)
Tokens in Python which are used for punctuation, decoration and annotation are:
. , :; @ ->
The assignment or binding delimiters are used for binding objects to names via assignment statements. The complete list of tokens are provided below:
= += -= *=/= //= %= **=@= &= |= ^=<<= >>=
Except=
, the rest of the tokens have an operator followed by=
character.
These delimiters are also known as augmented assignment operators as they perform an operation in combination with assignment.
A set of valid characters that a programming language recognizes is known as itscharacter set.
Python is a new age programming language which supports Unicode encoding standard. The default encoding for Python source code is UTF-8 (Unicode Transformation Format – 8-bit) which enables developers to use Unicode characters not only as literals, but also as identifiers.
This makes Python one of the very few programming languages that support multiple languages as shown in the example below:
Code
message="हिन्दी में print करे"print(message)क=1# Devanagari Letter KAক=2# Bengali Letter KAக=3# Tamil Letter KAક=4# Gujarati Letter KAprint(क+ক+க+ક)
Output
हिन्दी में print करे10
In traditional programming languages like C++ or Java, programs are organised in form of code blocks.
Each code block contains one or more statements which are enclosed between braces -{
and}
and are executed sequentially.
A sample C++/Java code is provided below which checks for an inputx
.
C++
if (x <10) { cout <<"x is less than 10" << endl;if (x <=5) { cout <<"x is less than or equal to 5" << endl; }else { cout <<"x is more than 5 but less than 10" << endl; }}else { cout <<"x is not less than 10" << endl;}
Java
if (x <10) {System.out.println("x is less than 10");if (x <=5) {System.out.println("x is less than or equal to 5"); }else {System.out.println("x is more than 5 but less than 10"); } }else {System.out.print("x is not less than 10");}
It can be seen how indentations (tab
at the beginning of line) are added (not required by programming language) to the code to increase readability, which helps in guiding readers through the code.
Python
Code blocks in Python are inspired by this idea as it makes it easier to understand a Python code.
A block of code is denoted by line indentation, typically4 spaces (preferred) or atab. This indentation is used to determine the logical group of statements, with all statements within a group having the same level of indentation.
The corresponding Python code for the above C++/java examples is provided below.
Notice how the code blocks are indented according to the logic.
ifx<10:print("x is less than 10")ifx<=5:print("x is less than or equal to 5")else:print("x is more than 5 but less than 10")else:print("x is not less than 10")
Python supports single-line comments and multi-line comments to enhance code readability via adding documentation.
A single line comment begins with#
. Everything between#
and the end of line is ignored by the Python interpreter.
# A single line comment.a=1# assign a
A multiline comment begins with'''
or"""
and ends with the same.
"""I ama multilinecomment."""'''I amalso a multilinecomment.'''
As compared to single line comments, multiline comments should begin at the same indentation level corresponding to their code block.
For example,
"""Begin programhere"""ifx<10:""" Enter code block when x is less than 10 """print("x is less than 10")ifx<=5:""" Enter code block when x is less than or equal to 5 """print("x is less than or equal to 5")else:""" Enter code block when x is more than 5 but less than 10 """print("x is more than 5 but less than 10")else:""" Enter code block when x is not less than 10 """print("x is not less than 10")
A program is a sequence of instructions which often acts on information (data) provided by the user.
The process of creating, storing and manipulating this data helps in the computation of new data or the end result.
Variables are the fundamental building blocks of a program which provide a way to store, access and modify values during the life-cycle of a program.
Each variable has:
- a name (handle),
- a type or data-type (kind of data), and
- a value (actual data).
In traditional programming languages like Java or C++, the type of the variable is pre-defined.
For example, if you want to use the value1
inside the program, you can store it in a variable nameda
of typeint
.
int a = 1;
Thisa
is synonymous to a box of fixed dimensions (fixed type) holding something (value1
) inside it.
In case we want to change the contents of the box, we can replace it with something similar (same type).
a = 2;
The contents of this box can be replicated and placed in a similar (same type) box:
int b = a;
Multiple boxes can exist, each containing an item having the same value.
int x = 3;int y = 3;int z = 3;
As shown above, the programming languages in which the variables (named boxes) are declared along with their types (size of the boxes) are known asstatically typed languages.
The size of these boxes cannot change later in the program until the variable is re-initialized with the same name and different type.
Python is a dynamically-typed language, where every value or data item (of any type like numeric, string, etc.) is an object.
The variable names are just name-tags pointing to the actual object containing data of any type.
As there is no need of any variable declaration in Python before usage, there is no concept of default value (an empty box ornull
) which exists in other programming languages.
Whenever a new object is created in Python, it is assigned a unique identity (ID) which remains the same throughout the lifetime of that object. This ID is the address of the object in memory and the built-in functionid()
returns the value of this address.
>>>a=1>>>id(a)140407745943856>>>a=2>>>id(a)140407745943888
In the above example, the ID ofa
changes as it points to a new object (2
).
>>>b=a>>>id(b)140407745943888
Also, whena
is assigned tob
, instead of creating a new copy,b
points to the same object asa
.
A variable is uniquely identified by a name (identifier) and follows the same naming convention:
- Must begin with a lowercase character (
a-z
) or an uppercase character (A-Z
) or underscore sign (_
). - Followed by any number of letters (
a-z
,A-Z
), digits (0-9
), or underscores (_
). - Should not be a keyword.
- No special symbols are allowed like
!
,@
,#
,$
,%
, etc.
Variables can be bound to a reference of an object (of any type) using assignment statements.
You can create an object (data) and bind it's reference to a variable using equal sign (=
):
count=100# integerpi=3.141# real numbername="Python"# string
Here, L-value refers to the assignable variables (count
,pi
,name
) on the left side of the assignment and R-value refers to the expression on the right side of the assignment operator that has a value (100
,3.141
,"Python"
).
As variables are just references, you can rebind them to another object of same or different type:
a=100# integera=3.141# real numbera="Python"# string
Thedel
statement can be used to unbind the reference to an object.
>>>a=10>>>dela>>>aTraceback (mostrecentcalllast):File"<stdin>",line1,in<module>NameError:name'a'isnotdefined
Accessinga
results in aNameError
as the reference (variable) to the object holding value10
is deleted.
The object is also automatically cleaned up from the memory if there is no other variable referencing to it (garbage collection).
In Python, multiple assignment can be used to condense variables which are set to the same value:
>>>x=y=z='foo'>>>x'foo'>>>y'foo'>>>z'foo'
In Python, a temporary variable is not required to swap values of two variables. Values can be directly swapped (tuple swapping) as shown below:
>>>a='Hello'>>>b='World'>>>b,a=a,b>>>a'World'>>>b'Hello'
In Python, thetype
of a data (or value) is not linked to the variable, but to the actual object which contains it. This type is also known as the object's data type and is used for identifying the operations that can be performed on the data.
The following built-in data types are available in Python:
- Numeric Types -
int
,float
,complex
,bool
- Sequence Types -
list
,tuple
,str
- Set Type -
set
- Mapping Type -
dict
- Special Type -
None
Often sequence, set and mapping types are also collectively known asiterables as they are a collection of items on which a user can traverse (iterate).
Numeric data types are used for storing the following types of numbers:
Integer Numbers
Objects holding integer numbers like-1, 0, 200
are ofint
data type.
Real or Floating-point Numbers
Objects holding real or floating point numbers like-1.1, 3e2, 20.0
are offloat
data type.
Complex Numbers
Objects storing complex numbers like2 + 1j, -3j, -1 + 2J
are of typecomplex
.
Each complex number has two parts, the real part which is a numeric integer or floating point literal, and the imaginary part which is an imaginary literal.
Boolean
The boolean data type (bool
) is a subtype ofint
. It stores the evaluated value of expressions represented as keywords -True
(integer value1
) andFalse
(integer value0
).
An ordered collection of items where each item can be accessed using an integer index is known as a sequence. The following three sequence data types are available in Python:
String
A string (str
data type) is a sequence of zero or more unicode characters enclosed within a pair of single ('
) or double ("
) quotes.
Some example strings are -"42", 'hello', "python"
.
List
Alist
is sequence of items of same or different data types which are enclosed within brackets -[ ]
.
Some example lists are -[1, 2, 3]
,['abc', 23, 3.14]
,['edpunk', 'python']
.
Tuple
Atuple
is an immutable sequence of items of same or different data types which are enclosed within parentheses -( )
.
Some example tuples are -(1, 2, 3)
,('abc', 23, 3.14)
,('edpunk', 'python')
.
Aset
is an unordered collection of unique items of same of different data types which are enclosed in curly braces -{ }
.
Some example sets are -{1, 2, 3}
,{'abc', 23, 3.14}
,{'edpunk', 'python'}
.
dict
is a mapping data type which stores values in the form of key-value pairs.
It is used for representing data where you can quickly access the value (any data type) corresponding to a key (any data type exceptlist
,set
ordict
), just like a dictionary where you can lookup the meaning of a given word.
Keys and corresponding values are separated by colon (:
).
The key-value pairs are separated by comma (,
) and enclosed within curly braces -{ }
.
Some example dictionaries are -{1: "a", 2: "b", 3: "c"}
,{"name": "edpunk", "language": "python"}
.
None
is a special data type which is used to denote the absence of value in an object.
It is neither0
norFalse
as these are defined finite values, whereasNone
implies nothingness.
The built-intype()
function can be used to fetch the data type of an object.
Examples:
>>>count=100>>> type(count)<class'int'>>>>pi=3.141>>> type(pi)<class'float'>>>>name="Python">>> type(name)<class'str'>
This function can be used along with theis
operator in an expression to test whether the object is of the given type.
>>>count=100>>>type(count)isintTrue
Thein
operator can be used along with thetype()
function to test if the data type is one of the mentioned types.
# count is of type int or float>>>type(count)in (int,float)True
The process of converting the data type of an object from one type to another is known asType Casting orType Conversion.
The two kinds of type casting supported in Python are:
The Python interpreter automatically converts the data type without the need of user intervention when evaluating expressions to determine the final data type.
In the below example the final type ofc
is automatically determined asfloat
by the Python interpreter.
>>>a=1# int>>>b=2.0# float>>>c=a+b>>>c3.0>>> type(c)<class'float'>
When the type conversion is explicitly specified by the user using the various built-in functions available in Python, it is known as explicit type casting.
The built-in functions which can be used for explicit type casting are as follows:
1.int()
Creates anint
from abool
,float
orstr
containing integer characters (with or without sign).
>>>int(True)1>>>int(2.3)2>>>int("2")2
2.float()
Creates afloat
from abool
,int
orstr
containing floating point literals (with or without sign).
>>>float(True)1.0>>>float(2)2.0>>>float("2.3")2.3
float()
also accepts the following string inputs -
"Infinity"
"inf"
"nan"
(not a number).
>>>float("Infinity")>1True>>>float("nan")nan
Floating point literals can also contain the following characters -
.
, which denotes the fractional part of a number.e
orE
, which denotes the exponent part of a number.
>>>float("3.14")3.14>>>float("10.")10.0>>>float("1e100")1e+100>>>float("3.14e-10")3.14e-10
3.str()
Converts any object into astr
.
>>>str(2)'2'>>>str([1,2,3,4])'[1, 2, 3, 4]'
4.tuple()
Creates atuple
from an iterable of typestr
,list
,set
orrange
.
>>>tuple('hello')('h','e','l','l','o')>>>tuple([1,2,3,4])(1,2,3,4)>>>tuple(range(6))(0,1,2,3,4,5)
5.list()
Creates alist
from an iterable of typestr
,tuple
,set
orrange
.
>>>list('hello')['h','e','l','l','o']>>>list({1,2,3,4})[1,2,3,4]>>>list(range(6))[0,1,2,3,4,5]
6.set()
Creates aset
from an iterable of typestr
,tuple
,list
orrange
.
>>>set('hello'){'o','e','l','h'}>>>set([1,2,3,4]){1,2,3,4}>>>set(range(6)){0,1,2,3,4,5}
A data type is said to be immutable when the value of an object of that type cannot be modified.
The following data types are immutable:
int
float
complex
bool
tuple
str
None
You might be wondering if some of the above types are immutable then how are we able modify the values of variables?
In case of variable re-assignment, the original objects are not modified, but new objects (with new values) are created in a new memory location and are bound to the variables. The object containing the old value is destroyed if no other variable references it.
Let us take an example,
>>>a=1>>>id_a=id(a)>>>a=2>>>id_a2=id(a)>>>id_a==id_a2False
You can witness in the above example how the object containing the value1
is different from the object containing the value2
, anda
points to the latest object.
Sequence data types like strings and tuples are also immutable, i.e., no modifications are permitted to any item once it is created and any attempt to do so raises an error.
>>>s="Hello">>>s[1]="P"Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>TypeError:'str'objectdoesnotsupportitemassignment>>>t= (1,2,3)>>>t[1]=0Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>TypeError:'tuple'objectdoesnotsupportitemassignment
Although, similar to numeric types the variables can be re-assigned to new sequences.
>>>s="Hello">>>id_s=id(s)>>>s="Help">>>id_s2=id(s)>>>id_s==id_s2False>>>t= (1,2,3)>>>id_t=id(t)>>>t= (0,2,3)>>>id_t2=id(t)>>>id_t==id_t2False
In Python, the following data types are mutable, i.e., any modification does not create a new object but modifies the existing object:
list
set
dict
Let us take a list and modify its contents.
>>>l= [1,2,3]>>>id_l=id(l)>>>l[0]=0>>>l[0,2,3]>>>id_l2=id(l)>>>id_l==id_l2True
Let us take an example of a dictionary and add a newkey:value
pair.
>>>d= {"a":"apple","b":"boy"}>>>id_d=id(d)>>>d["c"]="cat">>>d{'a':'apple','b':'boy','c':'cat'}>>>id_d2=id(d)>>>id_d==id_d2True
Let us take an example of a set and add new item.
>>>s= {"apple","bat"}>>>id_s=id(s)>>>s.add("cat")>>>s{'cat','apple','bat'}>>>id_s2=id(s)>>>id_s==id_s2True
In the above examples, theid
of the objects (list
,dict
,set
) do not change, which implies that no new objects are created and the original objects are modified.
input()
function is used to accept new input data from the user.
When this function is encountered in the code, the python interpreter waits for the user to type a response which is read as a string and assigned to a variable.
>>>name=input()edpunk>>>name'edpunk'
The function also has an optional string argument which is used as a prompt message for the user.
>>>name2=input("Enter name: ")Entername:EdPunk>>>name2'EdPunk'
User input can be converted into integer or floating point numbers using the type conversion functionsint()
andfloat()
.
>>>num=int(input("Enter n: "))Entern:10>>> type(num)<class'int'>>>>num10>>>pi=float(input("Enter pi: "))Enterpi:3.14>>> type(pi)<class'float'>>>>pi3.14
The built-inprint()
function is used to display an output (value of variables, expressions, etc.) on the standard output.
Let us go through a program which computes the area of a rectangle and displays it:
Code
length=10breadth=5area=length*breadthprint("Area:",area)
Output
Area: 50
print()
function can also be used to output the value of multiple objects when they are provided as arguments to the function.
Code
a=2+2b=2*3.5print(a,b)
Output
4 7.0
In the above code, the values ofa
andb
are separated by a blank space (the default value ofsep
).
This property can be modified by providing any user defined separator using thesep
option.
Let us modify the code and provide","
as the separator.
Code
a=2+2b=2*3.5print(a,b,sep=",")
Output
4,7.0
When expressions are provided as arguments to theprint()
function, output is the evaluated value of those expressions.
For example,
Code
print(2+2)print(2*3.5)print("Hello, "+"world!")
Output
47.0Hello, world!
In the above code snippet, eachprint()
function invocation creates a new line of output. This is becauseend
parameter has the newline character ('\n'
) as the default value in theprint()
function.
This can be modified by the user as shown below:
Code
print(2+2,end=",")print(2*3.5,end=";")print("Hello, "+"world!")
Output
4,7.0;Hello, world!
Note
All non-keyword arguments or expressions are converted to strings and written to the output stream by theprint()
function. They are separated bysep
and followed byend
. An emptyprint()
invocation writesend
parameter (an empty line asend
defaults to the newline character'\n'
).
Operators are symbols that perform a single simple task or operation on one or more values resulting in a single evaluated value.
The values on which these operators are applied are calledoperands.
Unary operators are applied on a single operand positioned on right side of the operator.
Following are some unary operators available in Python:
+
(plus): The unary+
operator does not change the value of the operand.-
(minus): The unary-
operator changes the sign of the operand.~
(bitwiseNOT
): The unary~
results in the bit-wise inversion of the integer operand. The bit-wise inversion ofx
is defined as-(x+1)
.
>>>x=5>>>-x-5>>>+x5>>>~x-6
Binary operators are applied on two operands.
For example, arithmetic operators (+
,–
,*
,/
) evaluate the result of mathematical computation of two values.
A rich set of operators are available in Python which can be categorized as follows:
- Arithmetic Operators -
+
,–
,*
,/
,%
,**
,//
- Relational Operators -
==
,!=
,>
,>=
,<
,<=
- Assignment Operators -
+=
,-=
,*=
,/=
,%=
,**=
,//=
- Logical Operators -
not
,or
,and
- Identity Operators -
is
,is not
- Membership Operators -
in
,not in
- Bitwise and Shift Operators -
&
,|
,^
,~
,<<
,>>
Arithmetic operations can be performed in Python using the following arithmetic operators:
Addition
The+
operator adds the values of numeric operands.
>>>2+35>>>2+3.05.0
In case the operands are of typestr
,list
ortuple
, the+
operator concatenates these two sequences or strings.
>>>'edpunk'+'python''edpunkpython'>>> ["ed","punk"]+ ["python", ]['ed','punk','python']
Subtraction
The-
operator subtracts the value of operand on right from the value of operand on left.
>>>2-3-1
Multiplication
The*
operator multiplies the values of numeric operands.
>>>2*36
In case the operands are of typestr
,list
ortuple
, the*
operator returns a sequence or string self-concatenated the specified number of times.
>>>"python"*3'pythonpythonpython'>>> ['ed','py']*3['ed','py','ed','py','ed','py']
Division
The/
operator divides the value of operand on left by the value of operand on right and returns the real number quotient.
>>>6/23.0>>>5/22.5
Floor Division
The//
operator divides the value of operand on left by the value of operand on right and returns the integer quotient.
>>>5//22
Modulus
The%
operator divides the value of operand on left by the value of operand on right and returns the remainder.
>>>5%21
Exponent
The**
operator raises the left operand to the power of the right operand.
>>>5**225
Relational operators are useful for comparing the values of the operands to determine their relationship. Following relational operators are available in Python:
Equals to
The==
operator returnsTrue
if the value of operand on left is same as the value of operand on right.
>>>2==2True>>>2==3False
In case of sequence operands likestr
,list
ortuple
, the result isTrue
if both the sequences are exactly the same.
>>>"python"=="python"True>>>"pypi"=="python"False>>> [1,2,3]== [1,2,3]True
As a sequence is an ordered collection of items, so the order in which the items are positioned is very important.
>>> [2,1,3]== [1,2,3]False
Not equal to
The!=
operator returnsTrue
if the value of operand on left is not equal to the value of operand on right.
>>>2!=2False>>>2!=3True>>>'py'!='oy'True>>> [2,1,3]!= [1,2,3]True>>> [1,2,3]!= [1,2,3]False
Greater than
The>
operator returnsTrue
if the value of operand on left is greater than the value of operand on right.
>>>3>2True>>>2>2False
In case of strings operands,>
operator perform comparison according to the Unicode code point (integer) of each character one-by-one.
The Unicode code point of a character can be obtained using theord()
function in Python.
The code point of first character of both operands are compared. In case they are equal, the code point of next character of both operands are compared and the process continues.
For example,
>>>"python">"Python"True
The code point of"p"
(112
) is greater than the code point of"P"
(80
). As112
is greater than80
the expression evaluates toTrue
.
Let us take another example:
>>>"pYthon">"python"False
The code point of first character is same (112
), so the next set of characters are compared. The code point of"Y"
(89
) is not greater than the code point of"y"
(121
) so the expression evaluates toFalse
.
If two string operandsp
andq
are of unequal lengths (len(p) < len(q)
) andp
is a substring ofq
such thatq = pt
where t is any string of length greater than0
, thenq > p
returnsTrue
.
>>>"python">"py"True
In case of sequence operands likelist
ortuple
, the items are compared one-by-one starting from index0
.
>>> ["p","py","PY"]> ["p","Py","PY"]True>>> [1,3]> [1,2]True>>> [1,3,4]> [1,2]True
In the above examples,"py"
is greater than"Py"
and3
is greater than2
respectively.
If two sequences are of unequal lengths and the smaller sequence is the starting subsequence of the larger one, then the larger sequence is considered greater than the smaller one.
>>> [1,2,4]> [1,2]True
Greater than or equal to
The>=
operator returnsTrue
if the value of operand on left is greater than or equal to the value of operand on right.
>>>3>=3True>>>2>=3False
In case of sequence operands (str
,list
,tuple
), the comparison operation performed is along the same lines as the>
operator discussed above.
>>>"python">="Python"True>>>"python">="python"True>>> ["py","py","PY"]>= ["py","Py","PY"]True>>> [1,2]>= [1,2]True>>> [1,2,4]>= [1,2]True
Less than
The<
operator returnsTrue
if the value of operand on left is less than the value of operand on right.
>>>2<3True>>>3<3False
In case of sequence operands (str
,list
,tuple
), the comparison operation performed is along the same lines as the>
operator discussed above.
>>>"file"<"Pile"False# f(102) is > P(80)>>>"py"<"python"True>>> ["Py","PY"]< ["py","PY"]True>>> ['a',2]< ['a',3]True>>> [1,2]< [1,2,4]True
Less than or equal to
The<=
operator returnsTrue
if the value of operand on left is lesser than or equal to the value of operand on right.
>>>2<=3True>>>3<=3True
In case of sequence operands (str
,list
,tuple
), the comparison operation performed is along the same lines as the>
operator discussed above.
>>>"file"<="Pile"False# f(102) is > P(80)>>>"py"<="python"True>>> ["Py","PY"]<= ["py","PY"]True>>> ['a',3]<= ['b',2]True>>> [1,2]<= [1,2,4]True
The assignment symbol (=
) serves as a delimiter between the name and value in an assignment statement.
It binds (or rebinds) a value (data, variable, expression) on the right to the target variable on the left.
>>>x=1>>>x1>>>y=x>>>y1>>>y="python">>>y'python'
Binary operators can be combined with the assignment symbol to createAugmented Assignment Operators.
These operators perform the binary operation on the two operands and assign the result to the original target (left operand).
If<op>
is a binary operator, then the expressiona <op>= b
containing the augmented assignment operator is equivalent toa = a <op> b
.
+=
The+=
operator adds a value (right operand) to the variable (left operand) and assigns the result to that variable.
>>>a=2>>>a+=3>>>a5>>>x="hello">>>y="world">>>x+=y>>>x'helloworld'
-=
The-=
operator subtracts a value (right operand) from the variable (left operand) and assigns the result to that variable.
>>>a=3>>>a-=2>>>a1
*=
The*=
operator multiplies a value (right operand) to the variable (left operand) and assigns the result to that variable.
>>>a=3>>>a*=2>>>a6>>>x="hi">>>x*=3>>>x'hihihi'
/=
The/=
operator divides the variable (left operand) by a value (right operand) and assigns the result to that variable.
>>>a=4>>>a/=2>>>a2.0
//=
The//=
operator floor divides the variable (left operand) by a value (right operand) and assigns the result to that variable.
>>>a=5>>>a//=2>>>a2
**=
The**=
operator raises the variable (left operand) to a power (right operand) and assigns the result to that variable.
>>>a=4>>>a**=2>>>a16
%=
The%=
operator computes modulus of the variable (left operand) and a value (right operand) and assigns the result to that variable.
>>>a=4>>>a%=3>>>a1
Expressions using logical operators evaluate to a boolean value (True
orFalse
) based on the logical state of the operands.
In Python, all values except0
,None
,False
,""
,''
,()
,[]
,{}
have their logical state asTrue
.
bool()
built-in function can be used to determine the logical state of literals, variables or expressions.
The logical state of the following literals isFalse
.
>>>bool(False)False>>>bool(0)False>>>bool([])False>>>bool(None)False>>>bool("")False>>>bool([])False>>>bool(())False>>>bool({})False
Some example literals having boolean state asTrue
are provided below.
>>>bool(True)True>>>bool(1)True>>>bool(2.0)True>>>bool(100)True>>>bool("python")True>>>bool(["py","thon"])True
not
The logical state of an operand can be reversed (False
toTrue
, and vice versa) using the logicalnot
operator.
>>>n=5>>>bool(n)True>>>bool(notn)False
or
The logicalor
operator returnsTrue
if the logical state of any of the two operands isTrue
.
>>>TrueorFalseTrue>>>bool(1or0)True>>>FalseorFalseFalse
and
The logicaland
operator returnsTrue
if the logical state of both the operands isTrue
.
>>>TrueandTrueTrue>>>TrueandFalseFalse>>>bool(10and20)True>>>bool(1and0)False
We have already witnessed how Python treats every value or data item as an object.
The relational operator==
can be used to test whether the operands contain the same value.
>>>n=1>>>n2=1>>>n==n2True
This operator however does not check if both the operands are referring to the same object or different objects.
The identity operatorsis
andis not
are used to test whether two objects have the same or different identity (pointing to the same location in memory) respectively.
a is b
is equivalent toid(a) == id(b)
, whereid()
is the built-in function which returns the identity of an object.
>>>n=1>>>n2=1>>>nisn2True
In the above example, both variablesn
andn2
point to that same memory location (same object).
>>>l= [1,2,3]>>>l2= [1,2,3]>>>l==l2True>>>lisl2False
In the above example, both listsl
andl2
although contain items with same values, are actually two different objects occupying different memory locations.
The operatorsin
andnot in
test whether a value is present or not present in an iterable (string, list, tuple, set, dictionary).
>>>1in [1,2,3]True>>>"ed"in ["ed","py","hi"]True>>>"ed"in ("ed","py","hi")True>>>'ed'in {'ed':1,'py':2}True>>>"pen"notin ["pencil","ink"]True>>>"pen"notin ["pen","ink"]False
Literals (constants), identifiers (variables) and operators can be combined to form an expression which always evaluates to a single value.
For example,40 + marks
is an expression containing a literal (40
), a variable (marks
) and an operator (+
).
Some valid expressions are provided below:
10
a
-a
a - 10
a + b
4.0 * 3.5
a == b
c in d
a is T
"Hello" + "World"
15 - a*4
3*num + 9/4 - 10%count**2
As shown above, standalone literals (like10
) and variables (likea
) are considered as expressions, but standalone operators are not expressions.
Comparison operators can be chained together in Python.
For example,lower <= age <= upper
is a valid chained expression which is equivalent to the expression -
lower <= age and age <= upper
.
Ifa
,b
,c
, …,y
,z
are expressions andop1
,op2
, …,opN
are comparison operators, then the chained expressiona op1 b op2 c ... y opN z
is equivalent toa op1 b and b op2 c and ... y opN z
.
Python does not have ternary operators (?:
) like other programming languages. Hence, the keywordsif
andelse
are used to create conditional expressions which evaluates to a value based on the given condition.
For example,
var=t_valifcondelsef_val
If the above conditioncond
evaluates toTrue
, then the variablevar
is assignedt_val
, else it is assignedf_val
.
>>>value=1if2>3else-1>>>value-1
While studying mathematics in middle school, we came across theBODMAS (Bracket, Of, Division, Multiplication, Addition, and Subtraction) rule which helps us in understanding how mathematical expressions are computed in the presence of multiple operators (of
,x
,/
,+
,-
).
In Python, we have a large number of operators and a similar rule to determine the order of evaluation of an expression. This is known asoperator precedence where the operator with higher precedence is evaluated before the operator with lower precedence in an expression.
The table below presents the precedence of operators in Python from highest to lowest. Operators in the same row have the same precedence, so in such cases the expression is evaluated from left to right.
Operator | Description |
---|---|
(expressions...) | Parenthesized expression (Group) |
** | Exponentiation |
+x ,-x ,~x | Unary positive, Unary negative, BitwiseNOT |
* ,@ ,/ ,// ,% | Multiplication, Matrix multiplication, Division, Floor division, Remainder |
+ ,- | Addition, Subtraction |
<< ,>> | Shifts |
& | BitwiseAND |
^ | BitwiseXOR |
| | BitwiseOR |
in ,not in ,is ,is not ,< ,<= ,> ,>= ,!= ,== | Membership, Identity & Comparisons |
not x | BooleanNOT |
and | BooleanAND |
or | BooleanOR |
:= | Assignment expression |
Example 1
Evaluate the expression
15 - 2 * 4
Solution
Step:*
has higher precedence over-
15 - 2 * 4
=15 - 8
=7
Example 2
Evaluate the expression
15 - 2 + 4
Solution
Step:-
and+
have the same order of precedence so the expression is evaluated left to right15 - 2 + 4
=13 + 4
=17
Example 3
Evaluate the expression
15 - (2 + 4)
Solution
Parenthesized expression(...)
has the highest precedence so+
is evaluated first15 - (2 + 4)
=15 - 6
=9
Example 4
Evaluate the expression
3 * 2 + 9 / 4 - 10 % 2 ** 2
Step 1
**
takes precedence3 * 2 + 9 / 4 - 10 % 2 ** 2
=3 * 2 + 9 / 4 - 10 % 4
Step 2
*
,/
and%
have the same precedence so they are evaluated left to right.3 * 2 + 9 / 4 - 10 % 4
=6 + 2.25 - 2
Step 3
+
and-
evaluation6 + 2.25 - 2
=6.25
Example 5
Evaluate the expression
20 / 4 // 2 * 2 - 4 + 20
Step 1
*
,/
,//
and%
have the same precedence so they are evaluated left to right.20 / 4 // 2 * 2 - 4 + 20
=5 // 2 * 2 - 4 + 20
=2 * 2 - 4 + 20
=4 - 4 + 20
Step 2
+
and-
evaluation4 - 4 + 20
=20
Example 6
Evaluate the expression
not 6 <= 4 and 3 ** 3 > 12 / 3
Step 1
**
takes precedencenot 6 <= 4 and 3 ** 3 > 12 / 3
=not 6 <= 4 and 27 > 12 / 3
Step 2
/
is next in line of precedencenot 6 <= 4 and 27 > 12 / 3
=not 6 <= 4 and 27 > 4
Step 3
Comparison operators are next in line of precedencenot 6 <= 4 and 27 > 4
=not False and True
Step 4
BooleanNOT
is evaluatednot False and True
=True and True
Step 5
BooleanAND
is evaluatedTrue and True
=True
A program contains"bug(s)" when it is unable to execute or produces an output which is different from what is expected. These bugs are generally introduced by a programmer unknowingly.
The process of identifying and eliminating these bugs or errors is known asdebugging.
The three major types of errors are:
- Syntax Error
- Runtime Error
- Logical Error
Syntax error occurs when the program contains any statement that does not follow the prescribed Python rules or syntax which makes it difficult for the Python interpreter to parse (understand) and execute it.
Some common syntax errors are:
- Missing/Misspelled keyword
- Missing colon or brackets
- Empty block
- Incorrect position of keyword
- Incorrect block indentation
When a code containing syntactically incorrect statement is executed using script mode via IDLE, an error dialog box is displayed.
On closing the dialog box, the incorrect part of the code, the potential cause of error, is highlighted in red.
This error has to be rectified to execute the program correctly.
When a syntactically incorrect statement is executed in the Python console (interactive mode), the Python interpreter displays it and also adds a little arrow (^
) pointing at the entry point or token where the error was detected.
Example
>>>whileTrueprint('Hi!')File"<stdin>",line1whileTrueprint('Hi!')^SyntaxError:invalidsyntax
In the above example there is a syntax error with^
pointing toprint
function which the parser is unable to understand as there is a missing:
(colon) afterTrue
.
A runtime error occurs when the program is terminated prematurely by the Python interpreter as it is unable to execute a statement although it is correct syntactically.
Some runtime error examples are:
- ImportError: Raised when the
import
statement has trouble loading a module or any definition from a module. - IOError: Raised when the interpreter is not able to open the file specified in the program.
- ZeroDivisionError: Raised when a number is divided or mod by zero.
- NameError: Raised when an identifier is encountered which has not been defined.
- ValueError: Raised when an argument or operand is of required data type, but has undesired value.
- IndexError: Raised when the provided index in a sequence (string, list, tuple, etc.) is out of range.
- KeyError: Raised when a dictionary key is not found in the set of existing keys.
- TypeError: Raised while performing an operation on incompatible types.
- IndentationError: Raised when the indentation of a statement or code block is incorrect.
ZeroDivisionError
n=100d=0print(n/d)
Traceback (mostrecentcalllast):File"/Users/name/Desktop/test.py",line3,in<module>print(n/d)ZeroDivisionError:divisionbyzero
NameError
n=100print(d)
Traceback (mostrecentcalllast):File"/Users/name/Desktop/test.py",line2,in<module>print(d)NameError:name'd'isnotdefined
KeyError
d= {1:"1st",2:"2nd"}print(d[3])
Traceback (mostrecentcalllast):File"/Users/name/Desktop/test.py",line2,in<module>print(d[3])KeyError:3
TypeError
n=1s="a"tot=n+s
Traceback (mostrecentcalllast):File"/Users/name/Desktop/test.py",line3,in<module>tot=n+sTypeError:unsupportedoperand type(s)for+:'int'and'str'
Logical error or Semantic error is caused when the issue lies in the underlying meaning of the code which leads to an incorrect output.
As compared to syntax or runtime error there is no termination of the program.
Debugging a logical error requires inspection of the entire code as no guiding error message is displayed.
Let us write a program to calculate the average of two numbers
n=10m=20avg=n+m/2print("Average:",avg)
On executing the script the result is
Average: 20.0
This is incorrect as there is a logical error in the code.
Since/
has a higher precedence over+
,m / 2
is being computed first.
We can modify the code to get rid of the logical error.
n=10m=20avg= (n+m)/2print("Average:",avg)
On executing the script, we now obtain the correct result
Average: 15.0
We have witnessed that even if a program is syntactically correct, its execution may lead to a run-time error.
This error detected during execution is known as anexception which is an object created by the Python interpreter containing information regarding the error like type of error, file name and the location of the error (line number, token) in the program.
Some of the built-in exceptions that are raised by the Python interpreter are -ImportError
,ZeroDivisionError
,NameError
,ValueError
,IndexError
,KeyError
,TypeError
andIndentationError
.
Apart from the Python interpreter, a programmer can also trigger and raise an exception (along with a custom message) in the code usingraise
orassert
statement.
Theraise
statement can be used to throw an exception in a program. The exception may or may not contain the custom error message (recommended).
Let us consider a program which accepts two numbers (a
andb
) from the user and prints the resulta/b
.
Code
a=int(input("Enter a: "))b=int(input("Enter b: "))print("a/b =",a/b)
Output
Enter a: 10Enter b: 0Traceback (most recent call last): File "/Users/name/test.py", line 3, in <module> print("a/b =", a/b)ZeroDivisionError: division by zero
It can be observed that the Python interpreter raises aZeroDivisionError
when the value ofb
is entered as0
.
Now we can modify the above code to raise an exception for such scenarios.
Code
a=int(input("Enter a: "))b=int(input("Enter b: "))ifb==0:raiseException()print("a/b =",a/b)
Output
Enter a: 10Enter b: 0Traceback (most recent call last): File "/Users/name/test.py", line 4, in <module> raise Exception()Exception
An exception is raised, but it is not helpful.
Let us add some custom error message.
Code
a=int(input("Enter a: "))b=int(input("Enter b: "))ifb==0:raiseException("b is zero")print("a/b =",a/b)
Output
Enter a: 10Enter b: 0Traceback (most recent call last): File "/Users/name/test.py", line 4, in <module> raise Exception("b is zero")Exception: b is zero
We can also raise any specific type of error as per the program logic as shown below:
Code
a=int(input("Enter a: "))b=int(input("Enter b: "))ifb==0:raiseValueError("The value of b cannot be zero")print("a/b =",a/b)
Output
Enter a: 10Enter b: 0Traceback (most recent call last): File "/Users/name/test.py", line 4, in <module> raise ValueError("The value of b cannot be zero")ValueError: The value of b cannot be zero
Anassert
statement is often used during code development to act like a safety valve which notifies the programmer in case the test expression is evaluated asFalse
.
If the test expression’s value isTrue
, the code execution continues normally.
AnAssertionError
is raised if the value isFalse
.
Code
a=3b=4asserta==bc=5
Output
Traceback (most recent call last): File "/Users/name/test.py", line 3, in <module> assert a == bAssertionError
The statement also allows for a message to be attached to theAssertionError
.
Code
a=3b=4asserta==b,"a is not equal to b"c=5
Output:
Traceback (most recent call last): File "/Users/name/test.py", line 3, in <module> assert a == b, "a is not equal to b"AssertionError: a is not equal to b
Exception handling is the process of properly handling an exception which can potentially crash a program during execution.
When an error occurs, the program throws an exception.
The runtime system attempts to find anexception handler, a block of code that can handle a particular type of error. Once located, the suitable exception handlercatches the exception and executes the code block which can attempt to recover from the error. In case the error is unrecoverable, the handler provides a way to gently exit the program.
Thetry
statement in Python specifies the exception handlers and/or cleanup code for a code block.
The various parts of a try statement are:
try
block: The block of statements within which an exception might be thrown.except
clause(s): One or more exception handlers. Eachexcept
clause handles a particular type of exception. In case an exception of a particular type occurs in thetry
block, the correspondingexcept
clause code block is executed.else
clause: An optionalelse
clause can also be included after the lastexcept
block. In case no exception is raised, none of theexcept
blocks are executed. In this case, theelse
code block is executed.finally
clause: An optionalfinally
clause can be added at the end of the try statement which includes a block of statements that are executed regardless of whether or not any error occurred inside the try block. This block is usually setup for code cleanup and closing all open file objects.
Here's the general form of these statements:
try: [codeblock]except [exception1 [asidentifier1]]: [exceptioncodeblock1]except [exception2 [asidentifier2]]: [exceptioncodeblock2]......else: [codeblockexecutesifnoerror]finally: [codeblockalwaysexecuted]
A simple Python program can be treated as a block of code where each statement is executed by the Python interpreter in a sequential order from top to bottom.
But, in real world we would like to have some control over the execution of code such as:
- skip or execute a block (set of statements) based on certain conditions
- execute a block repeatedly
- redirect execution to another set of statements
- breaking up the execution
This control over the flow of execution is provided byControl Flow Statements.
They can be categorized as:
- Sequential
- Selection
- Iteration/Repetition
- Jump
- Procedural Abstraction - A sequence of statements are referenced as a single function or method call
- Recursion - Calling a method/function in the same method/function
- Exception Handling
By default the code statements in Python are executed in Sequential order.
The below flow chart demonstrates how 3 statements are executed in a sequential order.
For example,
a=2b=3c=a*bprint(c)
The above code will be executed in the following sequential order:
Selection statements, also known as Decision making statements, control the flow of a program based on the outcome of one or many test expression(s). If the condition is satisfied (True
) then the code block is executed. There is also a provision to execute another code block if the condition is not satisfied.
This process can be demonstrated using the below flowchart:
Python supportsif
compound statement which provides this control. Theif
statement comprises:
if
keyword followed by the test expression, a colon:
and an indented block of code which gets executed if the condition is satisfied- (optional) one or many
elif
clause followed by their test conditions and their corresponding code blocks - (optional)
else
clause and the corresponding code block which gets executed if none of the above conditions (if
,elif
) are satisfied
An exampleif
statement is provided below:
'''age - age of loan applicantemp - is employed (bool)cscore - credit scrore of applicant'''result=Noneifage<26andnotemp:result="Loan rejected"elifage>35andcscore<600:result="Loan rejected"else:result="Loan approved"print(result)
The control flow view of the above code is:
Let us go through some programming problems which utilize selection statements.
1. Absolute Value
Write a program to output the magnitude of difference between two numbers using conditional statement.
Code
n1=int(input("Enter 1st number: "))n2=int(input("Enter 2nd number: "))ifn1>n2:diff=n1-n2else:diff=n2-n1print("The difference of",n1,"and",n2,"is",diff)
Output
Enter 1st number: 12Enter 2nd number: 15The difference of 12 and 15 is 3
2. Sorting 3 Numbers
Write a program to accept 3 numbers from the user and print them in ascending order of value.
Code
a=int(input("Enter 1st number: "))b=int(input("Enter 2nd number: "))c=int(input("Enter 3rd number: "))ifb<a:# Swapping the values of a and ba,b=b,aifc<b:b,c=c,bifb<a:a,b=b,aprint("The numbers in sorted order:",a,",",b,",",c)
Output
Enter 1st number: 9Enter 2nd number: 2Enter 3rd number: 6The numbers in sorted order: 2 , 6 , 9
3. Divisibility
Write a program to accept two numbers and test if the first number is divisible by the second number.
Code
a=int(input("Enter 1st number: "))b=int(input("Enter 2nd number: "))ifa%b==0:print(a,"is divisible by",b)else:print(a,"is not divisible by",b)
Output
Enter 1st number: 9Enter 2nd number: 29 is not divisible by 2Enter 1st number: 9Enter 2nd number: 39 is divisible by 3
Iteration statements, also known as Looping statements, allow repeated execution of a code block.
Python providesfor
andwhile
statements to perform iteration.
Thefor
statement can be used to iterate over the items of a sequence (list
,string
,tuple
,range
). It can also be used to iterate over unordered sequences likeset
anddict
.
This process can be demonstrated using the below flowchart:
Let us go through some code examples to demonstrate howfor
statement can be used to iterate over sequences.
Code
cars= ["Hyundai","Honda","Ford","Toyota","BMW","Volkswagen"]formakeincars:print(make)
Output
HyundaiHondaFordToyotaBMWVolkswagen
Code
cars= ("Hyundai","Honda","Ford","Toyota","BMW","Volkswagen")formakeincars:print(make)
Output
HyundaiHondaFordToyotaBMWVolkswagen
Code
name="python"forcharinname:print(char)
Output
python
Therange
type represents an immutable sequence of numbers that is usually used in for loops for looping a certain number of times.range
object always take the same (small) amount of memory, no matter the size of the range it represents, which is an advantage over a regularlist
ortuple
.
Syntax:range(stop)
orrange(start, stop[, step])
>>>range(10)range(0,10)>>>list(range(10))[0,1,2,3,4,5,6,7,8,9]>>>list(range(1,10,2))[1,3,5,7,9]
range()
function is widely used in afor
statement to control the number of iterations and provide the index value (i
) of each iteration.
Example #1
Print multiples of 5 starting from 0 to 20.
Code
foriinrange(5):print(i*5)
Output
05101520
Example #2
Print all integers from 2 to 5 including the boundary values.
Code
foriinrange(2,6):print(i)
Output
2345
Example #3
Print all odd numbers between 2 and 10.
Code
foriinrange(3,10,2):print(i)
or
foriinrange(2,10):ifi%2!=0:print(i)
Output
3579
Example #4
Print the index of all occurrences ofo
inpython programming
.
Code
s="python programming"foriinrange(len(s)):ifs[i]=="o":print(i)
Output
49
Let us go through some programming problems which utilize thefor
iteration statement.
1. Compound Interest
Write a program to calculate the total compound interest payable for given principal, interest rate (compounded annually) and total time (in years).
Code
prin=float(input("Enter the principal amount: "))rate=float(input("Enter the annual interest rate: "))time=int(input("Enter the loan duration (in years): "))amt=prinforninrange(time):amt+=rate*amt/100print("Total interest payable:",amt-prin)
Output
Enter the principal amount: 500000Enter the annual interest rate: 5Enter the loan duration (in years): 3Total interest payable: 78812.5
2. Factorial
The factorial of a positive integern
, denoted byn!
, is the product of all positive integers less than or equal ton
.n! = n×(n-1)×(n-2)...3×2×1
Write a program to calculaten!
for a givenn
(assumen
is greater than0
).
Code
n=int(input("Enter n: "))factorial=1foriinrange(1,n+1):factorial*=iprint("n! :",factorial)
Output
Enter n: 6n! : 720
while
statement repeatedly executes a code block as long as the test condition is satisfied.
Usually there is a statement at the end of the code block which updates the value of the variable being used in the test expression, so the the loop does not execute infinitely.
A flowchart of the process is provided below:
For example, let us traverse a list and print the position(index) and value of each element until we reach the end of the list.
Code
cars= ["Hyundai","Honda","Ford","Toyota","BMW","Volkswagen"]i=0whilei<len(cars):print(i,cars[i])i+=1
Output
0 Hyundai1 Honda2 Ford3 Toyota4 BMW5 Volkswagen
In the above example, the test condition isi<len(cars)
and the update statement isi+=1
.
Let us go through some programming problems which utilize thewhile
iteration statement.
1. Compound Interest
Write a program to calculate the total compound interest payable for given principal, interest rate (compounded annually) and total time (in years).
Code
prin=float(input("Enter the principal amount: "))rate=float(input("Enter the annual interest rate: "))time=int(input("Enter the loan duration (in years): "))amt=prinwhiletime>0:amt+=rate*amt/100time=time-1print("Total interest payable:",amt-prin)
Output
Enter the principal amount: 500000Enter the annual interest rate: 5Enter the loan duration (in years): 3Total interest payable: 78812.5
2. Factorial
The factorial of a positive integern
, denoted byn!
, is the product of all positive integers less than or equal ton
.n! = n×(n-1)×(n-2)...3×2×1
Write a program to calculaten!
for a givenn
(assumen
is greater than0
).
Code
n=int(input("Enter n: "))factorial=1whilen>0:factorial*=nn=n-1print("n! :",factorial)
Output
Enter n: 6n! : 720
Jump statements are used to (abruptly) alter the flow of execution.
Some of the jump statements available in Python are:
Apass
statement acts as a placeholder and performs null (no) operation.
Various reasons for using the keywordpass
are provided below:
1. Syntactical Requirement
Usingpass
becomes a syntactical requirement for cases where the Python interpreter can raise aSyntaxError
due to missing statements.
The below code will execute successfully without any operation in the loop
foriinrange(6):pass
whereas withoutpass
foriinrange(6):
will throw the followingSyntaxError
File "<ipython-input-18-f2ba5099d499>", line 1 for i in range(6): ^SyntaxError: unexpected EOF while parsing
Similarly, inside anif
if2<3:pass
whereas withoutpass
if2<3:
will throw the followingSyntaxError
File "<ipython-input-20-068861cce0a8>", line 1 if 2 < 3: ^SyntaxError: unexpected EOF while parsing
2. Skipping Code Execution
pass
can be used to skip code execution for certain cases.
For example,
Code
l= [2,3,4,5,6]foriinl:ifi%3==0:passelse:print(i,"is not divisible by 3")
Output
2 is not divisible by 34 is not divisible by 35 is not divisible by 3
3. Placeholders
pass
can be used to create valid empty functions and classes as placeholders which can be modified in the future versions of code.
defemptyFunction():passclassEmptyClass:pass
Thebreak
statement is used to terminate the execution of immediately enclosingfor
orwhile
statement.
The below code will terminate thefor
loop wheni
is equal to4
foriinrange(10):print(i)ifi==4:break
01234
In a while statement,
i=0whilei<10:print(i)ifi==4:breaki+=1
break
will terminate thewhile
loop wheni
is equal to4
01234
continue
statement is used to skip the execution of successive statements and start the next iteration.
The below code will skip all candidates for an interview who have less than 4 years of work experience.
people= [{"name":"ABC","experience":6}, {"name":"EFG","experience":2}, {"name":"JKL","experience":5}, {"name":"XYZ","experience":3},]forcandidateinpeople:ifcandidate["experience"]<4:continueprint(candidate["name"],"is selected for interview")
Output:
ABC is selected for interviewJKL is selected for interview
When a loop is present inside another loop, it is known as a nested loop.
For each iteration of the outer loop, the inner loop undergoes complete iteration. Thus, if the outer loop has to undergon
iterations and the inner loop has to undergom
iterations, the code block inside the inner loop executesn x m
times.
Let us go through a nested loop example:
Write a program to print the factorial of all numbers in the range1
to10
(inclusive).
Code
forninrange(1,11):factorial=1foriinrange(1,n+1):factorial*=iprint(n,"! =",factorial)
Output
1 ! = 12 ! = 23 ! = 64 ! = 245 ! = 1206 ! = 7207 ! = 50408 ! = 403209 ! = 36288010 ! = 3628800
Abreak
statement inside the inner loop terminates only the inner loop whereas the outer loop is not affected.
To develop a better understanding, let us write a program to find all prime numbers between 2 and 40.
Code
forninrange(2,40):i=2whilei<n/2:ifn%i==0:breaki+=1ifi>n/2:print(n,"is prime")
Output
2 is prime3 is prime5 is prime7 is prime11 is prime13 is prime17 is prime19 is prime23 is prime29 is prime31 is prime37 is prime
A String (str
) is animmutable sequence of Unicode characters which is used to handle textual data in Python.
They can be specified by enclosing within:
- Single quotes:
'embedded "double" quotes are allowed'
- Double quotes:
"embedded 'single' quotes are allowed"
- Triple quotes:
'''Three single quotes'''
,"""Three double quotes"""
.
Triple quoted strings can also span multiple lines.
Some examples are provided below:
s="I am a String"s1="""AmultilineString"""s2='''Also amultilineString'''
The backslash (\
) character can be used in a string to escape characters that otherwise have a special meaning, such as newline, linefeed, or the quote character.
Escape Sequence | Meaning |
---|---|
\\ | Backslash (\ ) |
\' | Single quote (' ) |
\" | Double quote (" ) |
\a | ASCII Bell (BEL) |
\b | ASCII Backspace (BS) |
\f | ASCII Form-feed (FF) |
\n | ASCII Linefeed (LF) |
\r | ASCII Carriage Return (CR) |
\t | ASCII Horizontal Tab (TAB) |
\v | ASCII Vertical Tab (VT) |
Although\'
and\"
can be used to specify quote characters, Python allows embedding double quotes inside a single-quoted string ('My name is "Python".'
) and single quotes inside a double-quoted string ("Python's World"
).
Python string objects support Unicode characters.
A unicode character can be specified as\u
followed by the 4 letter unicode (\uXXXX
).
>>>print("E = mc\u00B2")E=mc²
In the above example,\u00B2
is the unicode character which represents the 'SUPERSCRIPT TWO'.
In case you want to create a string object from other data types, just use the built-instr()
function as follows:
>>>str(9)'9'>>>str(10.0)'10.0'
Python strings are "immutable", i.e., the state (value) of the objects cannot be modified after they are created.
Using the standard[ ]
syntax and zero-based indexing, characters can be accessed in a string.
Ifs = "hello"
,
s[0]
will result inh
s[2]
will result inl
s[5]
will result inIndexError: string index out of range
as the length of string is5
(index0
to4
)s[2] = 'p'
will result inTypeError: 'str' object does not support item assignment
ass
is immutable
Python also supports negative indexing, i.e., you can access the values of a string from right to left.
Index of-1
denotes the last character of the string,-2
is the second last character and so forth.
Ifs = "hello"
,
s[-1]
will result ino
s[-4]
will result ine
s[-6]
will result inIndexError: string index out of range
as the length of string is5
(negative index-1
to-5
)
The built-in functionlen()
returns the length of a string which is useful during string traversal or other string operations.
>>>len("hello")5>>>s="sample text">>>len(s)11>>>p="python">>>l=len(p)>>>p[l-1]'n'
We can perform various operations on a string (sequence of characters) such as slicing, membership, concatenation and repetition.
In Python, a character in a string can be easily accessed using its index.
>>>s="Hello">>>s[1]'e'
Python also provides a way to access a substring from a string. This substring is known as aslice and it can be obtained using the slice operator[n:m]
which returns the part of the string from the start index (n
) to the end index (m
), including the first but excluding the last.
>>>s="Hello">>>s[1:3]'el'
If the start index (n
) is omitted, the default value ofn
is set as0
which denotes the beginning of the string. If the end index (m
) is omitted, the substring ends at the last character of the string.
>>>s="Hello">>>s[:3]'Hel'>>>s[3:]'lo'>>>s[:]'Hello'
Negative indexing is also supported in the slice operator.
>>>s="Hello">>>s[-4:-2]'el'
In the above example,-4
is equivalent tolen(s) - 4 = 5 - 4 = 1
and-2
is equivalent to5 - 2 = 3
. Thus,s[-4:-2]
is same ass[1:3]
.
The slice operator also allows the usage of a third index which is known as step as it allows a user to step over (skip) characters.
>>>s="Hello">>>s[0:5:2]'Hlo'
In the above example, the substring begins at the start of the string, takes a step size of2
skippinge
and ends at the last character again skipping the 4th characterl
.
in
andnot in
operators can be used to determine whether a substring is present/not present in a string.
>>>s="Hello">>>"lo"insTrue>>>"lp"notinsTrue
The+
operator can be used to join two strings.
>>>s1="Hello">>>s2="Python">>>s1+s2'HelloPython'>>>s1+"World"'HelloWorld'
The*
operator repeats a string the number of times as specified by the integer operand.
>>>s="Hello">>>s*3'HelloHelloHello'
Apart from the built-in functionlen()
which returns the length of the string, String objects have access to several specialized functions (methods) that can:
- Convert Case of Strings
- Check Characters of a String
- Split a String
- Strip Characters from a String
- Checking for Prefix or Suffix in a String
- Find & Replace Characters in a String
Let us discuss these methods in detail in the coming sections.
The following methods are useful in converting the case of the characters in a string:
All cased characters are converted into lowercase.
>>>"PYthon".lower()'python'
All cased characters are converted into uppercase.
>>>"PYthon".upper()'PYTHON'
Uppercase characters are converted into lowercase and lowercase characters are converted into uppercase.
>>>"PYthon".swapcase()'pyTHON'
First character is capitalized and rest all characters are lowercased.
>>>"hello py".capitalize()'Hello py'
For each word present in the string, the first character is uppercased and the remaining characters are lowercased.
>>>"hello python".title()'Hello Python'
The following methods are used to check the type of characters in a string.
ReturnsTrue
if all characters in the string are letters (A-Z a-z
).
>>>"HelloPython".isalpha()True>>>"Hello Python".isalpha()False# contains whitespace>>>"HelloPython2".isalpha()False# contains digit
ReturnsTrue
if all characters in the string are digits.
>>>"Hello24".isdigit()False# contains alphabets>>>"24".isdigit()True
ReturnsTrue
if all characters in the string are alphanumeric (Alphabets or Numeric).
>>>"02".isalnum()True>>>"HelloPython".isalnum()True>>>"Hello Python v2".isalnum()False# contains whitespace>>>"HelloPythonv2".isalnum()True
ReturnsTrue
if the string is empty or all characters in the string are ASCII.
>>>"".isascii()True>>>"HelloPython".isascii()True>>>"Hello Py\u00B2".isascii()False>>>"पा से python".isascii()False
ReturnsTrue
if all characters are lowercase.
>>>"hello".islower()True>>>"Hello".islower()False
ReturnsTrue
if all characters are uppercase.
>>>"HELLO".isupper()True>>>"Hello".isupper()False
ReturnsTrue
if there are only whitespace characters in the string. Some popular whitespace characters are (space),
\t
(tab),\n
(newline),\r
(carriage return),\f
(form feed) and\v
(vertical tab).
>>>" ".isspace()True
ReturnsTrue
if the string is title-cased, i.e., the first character of every word in the string is uppercased and the remaining characters are lowercased.
>>>"Hello World".istitle()True>>>"Hello world".istitle()False>>>"hello world".istitle()False
Split methods help in splitting/partitioning a string.
partition(sep)
method splits the string when the separator (sep
) is encountered for the first time, and returns a tuple with three items(string before separator, separator, string after separator)
.
>>>"Hi|Ed|Punk".partition('|')('Hi','|','Ed|Punk')
split(sep=None, maxsplit=-1)
method splits a string into a list based on a string separator (sep
).
Ifsep
is not specified, it defaults toNone
, where whitespace is regarded as separator, and the string is stripped of all leading and trailing whitespaces after which it is split into words contained in the string.
>>>"Hi|Ed|Punk".split('|')['Hi','Ed','Punk']>>>"Hi Ed Punk".split()['Hi','Ed','Punk']>>>" Hi Ed Punk ".split()['Hi','Ed','Punk']
Ifmaxsplit
is provided, at mostmaxsplit
number of splits are performed and the list will contain a maximum ofmaxsplit+1
elements.
maxsplit
when not specified defaults to-1
, which implies that there is no limit on the number of splits.
>>>"Hi|Ed|Punk|v2".split('|',2)['Hi','Ed','Punk|v2']
Strip methods are useful in removing leading and/or trailing characters in a string.
They accept an optional argumentchars
which specifies the set of characters which have to be removed.
If no arguments are provided thenchars
defaults to ASCII whitespace and the method removes all leading and/or trailing spaces from the string.
All leading (on the left) characters are removed from the string.
>>>" Hello|World ".lstrip()'Hello|World '>>>"www.edpunk.cc".lstrip('w.')'edpunk.cc'
All trailing (on the right) characters are removed from the string.
>>>" Hello|World ".rstrip()' Hello|World'>>>"www.edpunk.cc".rstrip('.c')'www.edpunk'
All leading and trailing characters are removed from the string.
>>>" Hello|World ".strip()'Hello|World'>>>"www.edpunk.cc".strip('cw.')'edpunk'
startswith()
andendswith()
methods are used check whether a string starts or ends with the provided substring (or a tuple of substrings).
>>>"Hello Py".startswith("He")True>>>"Hello Py".startswith(("He","P"))True>>>"Py Hello".startswith(("He","P"))True>>>"Hello Py".endswith("y")True>>>"Hello Py".endswith(("p","y"))True>>>"Py Hello".endswith(("o","n"))True
The following string methods are useful for locating substring in a string.
count(sub[, start[, end]])
returns the number of non-overlapping occurrences of a substringsub
in the range[start, end]
.
start
andend
are optional parameters and they default to0
andlen(string)
respectively.
>>>s="she sells sea shells">>>s.count("she")2>>>s.count("she",5)1>>>s.count("she",5,10)0>>>s.count("she",5,17)1
It has to be noted that the method counts non-overlapping occurrences, so it does not start a new matching process until the current substring matching is complete.
>>>s="valhala alala">>>s.count("al")4>>>s.count("ala")2
In the above example,ala
is counted twice as the first occurrence is invalh"ala"
and the next occurrence is in"ala"la
. Althoughala
can be located again inal"ala"
, it overlaps with the occurrence"ala"la
, hence it is not counted.
find(sub[, start[, end]])
returns the lowest index in the string where substringsub
is located in the range[start, end]
.
start
andend
are optional parameters and they default to0
andlen(string)
respectively.
The method returns-1
in case the substring is not present.
>>>s="she sells sea shells">>>s.find("she")0>>>s.find("she",5)14>>>s.find("see")-1>>>s.find("she",5,10)-1>>>s.find("she",5,17)14
rfind(sub[, start[, end]])
returns the highest index in the string where substringsub
is located in the range[start, end]
.
start
andend
are optional parameters and they default to0
andlen(string)
respectively.
The method returns-1
in case the substring is not present.
>>>s="she sells sea shells">>>s.rfind("she")14>>>s.rfind("she",0,12)0>>>s.rfind("see")-1>>>s.rfind("she",5)14
index(sub[, start[, end]])
is similar tofind(sub[, start[, end]])
, but instead of returning-1
it raisesValueError
when the substring is not found.
>>>s="she sells sea shells">>>s.index("she")0>>>s.index("she",5)14>>>s.index("see")Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>ValueError:substringnotfound>>>s.index("she",5,10)Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>ValueError:substringnotfound>>>s.index("she",5,17)14
s.rindex(sub[, start[, end]])
is similar torfind(sub[, start[, end]])
, but instead of returning-1
it raisesValueError
when the substring is not found.
>>>s="she sells sea shells">>>s.rindex("she")14>>>s.rindex("she",0,12)0>>>s.rindex("see")Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>ValueError:substringnotfound>>>s.rindex("she",5)14
replace(oldsub, newsub[, count])
returns a copy of the string with all occurrences ofoldsub
substring replaced bynewsub
.
count
is an optional parameter which when provided, only replaces the firstcount
occurrences from the string.
>>>s="Oh Python! Oh">>>s.replace("Oh","Hi")'Hi Python! Hi'>>>s.replace("Oh","Hi",1)'Hi Python! Oh'
for
andwhile
statements are useful for traversing a string.
As a string is a sequence of characters, thefor
statement can be used to iterate over a string as shown below.
Code
name="python"forcharinname:print(char)
Output
python
Awhile
statement can be used to traverse a string by iterating on the value of index till the last character index.
Code
name="python"i=0whilei<len(name):print(name[i])i+=1
Output
python
The most common and widely used collections in Python arelists which store an ordered group of objects (of any datatype) which might have some logical relation. This marks a considerable difference from arrays (in traditional languages) and makes Python an ideal language for handling real-life data which is not type-bound.
Let us create a list of attributes of a particular vehicle available for sale in a car dealership:
>>>l= ["BMW","Z4",2019, ...4,"Red",True]
In this list:
"BMW"
is the make of the vehicle,"Z4"
is the model of the vehicle,2019
is the year when the vehicle was manufactured,4
represents the number of wheels,"Red"
is the color of the vehicle, andTrue
tells us that the vehicle up for sale is brand new.
This method of creating a list from a collection of literals is known aslist display.
Notice, how this list contains items of multiple data types -str
,int
andbool
.
Apart from the list display shown above, the built-inlist()
function can also be used to create new lists.
If no arguments are provided to thelist()
function, an empty list is created.
>>>l=list()>>>l[]
If a string, tuple or set is passed as an argument,list()
functions converts them into a list.
>>>l=list("abcd")>>>l['a','b','c','d']>>>l=list({1,2,3})>>>l[1,2,3]>>>l=list((1,2,3,4))>>>l[1,2,3,4]
Lists use zero-based indexing, where 0 is the index of the first element andlen(list) - 1
is the index of the last element.
For example,
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[0]'BMW'>>>l[2]2019
The built-inlen()
function can be used to return the length of the list.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>len(l)6
Just likestr
, sequence types likelist
support negative indexing, i.e., you can access the values of list from the end.
Index of-1
denotes the last item in the list,-2
the second last item and so forth.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[-1]True>>>l[-3]4
As lists are mutable, the assignment operator can be used to modify or replace a single item in the list at a given index.
>>>l= ["BMW","Z4","Red"]>>>l[2]="Black">>>l['BMW','Z4','Black']
del
statement can be used to delete an item at a given index of a list.
>>>l= ["BMW","Z4","Red"]>>>dell[1]>>>l['BMW','Red']
We can perform various operations on a list (collection of items) such as membership, concatenation, repetition and slicing.
in
andnot in
can be used to detect the membership of an item in a list.
>>>l= ["BMW","Z4","Red"]>>>"Red"inlTrue>>>"Gold"inlFalse>>>"Gold"notinlTrue
+
operator can be used to join two lists to create a new list.
>>>l= ["BMW","Z4"]>>>k= [2019,"Red"]>>>l+k['BMW','Z4',2019,'Red']
+=
operator joins two lists and assigns it to the target list.
>>>l= ["BMW","Z4"]>>>l+= [2019,"Red"]>>>l['BMW','Z4',2019,'Red']
The*
operator repeats the items of a list the number of times as specified by the integer operand.
>>>l= [1,2]>>>l*3[1,2,1,2,1,2]
A subset of listl
can be obtained using the list slice notation given asl[i:j]
, where the item at start indexi
is included, but the item at end indexj
is excluded.
For example, the slice notation[1:4]
refers to items from index1
to index3
(i.e.4-1
).
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[1:4]['Z4',2019,4]
The slice notationl[i:j:k]
can also include a third number known as the stepper. Here, a list is sliced from start indexi
to end index (j
) - 1 with a step ofk
items.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[1:4:2]['Z4',4]
Slice notations also have some useful defaults. 0 is the default for the first number and size of the list is the default for the second number.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[2:][2019,4,'Red',True]>>>l[:4]['BMW','Z4',2019,4]
Slice notation also supports negative indexing.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[-4:][2019,4,'Red',True]>>>l[:-2]['BMW','Z4',2019,4]>>>l[-4:-1][2019,4,'Red']
Slice notation can be used to replace multiple items in a list.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[:2]= ["Kia","Sonet"]>>>l['Kia','Sonet',2019,4,'Red',True]>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>l[1:5:2]= ["Sonet",2]>>>l['BMW','Sonet',2019,2,'Red',True]
Slice notation can also be used to delete multiple items in a list.
>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>dell[:2]>>>l[2019,4,'Red',True]>>>l= ["BMW","Z4",2019, ...4,"Red",True]>>>dell[:5:2]>>>l['Z4',4,True]
List Traversal is the process of visiting every item in a list, usually from the first item to the last item, and executing some instruction on the accessed item.
Python provides 2 ways to traverse a list:
As a list is an ordered collection of items, thefor
statement can be used to directly and sequentially access each item.
Code
l= ["BMW","Z4",2019,4,"Red",True]foriteminl:print(item)
Output
BMWZ420194RedTrue
In location based or index based traversal the value of index starts at0
and increments as long as it is lesser than the length of the list.
This index value can be used to access the item at that index in a list using the index operator[]
.
for
statement can be used to iterate over the index of the list using therange()
andlen()
functions.
Code
l= ["BMW","Z4",2019,4,"Red",True]foriinrange(len(l)):print(i,l[i])
Output
0 BMW1 Z42 20193 44 Red5 True
This method is also useful when you need to modify the items of a list (without altering the length of list) during traversal.
For example, let us convert each item of the list into a string.
Code
l= ["BMW","Z4",2019,4,"Red",True]foriinrange(len(l)):# modifying item valuel[i]=str(l[i])print(l)
Output
['BMW', 'Z4', '2019', '4', 'Red', 'True']
Awhile
statement can be used to traverse a list by iterating on the value of index till the last item index.
Code
l= ["BMW","Z4",2019,4,"Red",True]i=0whilei<len(l):print(l[i])i+=1
Output
BMWZ420194RedTrue
Built-in functionsmax()
,min()
andsum()
are used to calculate the maximum, minimum and sum of items in a list, respectively.
>>>l= [4,8,2,1,3]>>>min(l)1>>>max(l)8>>>sum(l)18
List objects have access to several specialized functions (methods) that can be used to:
- Add Items to a List
- Remove Items from a List
- Count or Locate Items in a List
- Sort a List
- Reverse a List
- Copy the Contents of a List
Let us discuss these methods in detail in the coming sections.
append()
method adds an item (passed as an argument) to the end of a list.
>>>l= ["T","C",2,4]>>>l.append(2.5)>>>l['T','C',2,4,2.5]
If the item has to be added at a particular index,insert()
method can be used.
>>>l= ["T","C",2,4]>>>l.insert(3,2.5)>>>l['T','C',2,2.5,4]
extend()
method appends each item of the argument list at the end of the target list.
>>>l= ["T","C",2,4]>>>l.extend([2.5,"SE"])>>>l['T','C',2,4,2.5,'SE']
clear()
method removes all items from the list.
>>>l= ["T","C",2,4,"S"]>>>l.clear()>>>l[]
remove()
method removes the first occurrence of an item from the list.
>>>l= ["Hi","Ed","Py","Hi"]>>>l.remove("Hi")>>>l['Ed','Py','Hi']
If an index is provided,pop()
method removes the item at that index, else the last item is removed from the list.
>>>l= ["T","C",2,4,"S"]>>>item=l.pop()>>>item'S'>>>l['T','C',2,4]>>>l= ["T","C",2,4,"S"]>>>item=l.pop(2)>>>item2>>>l['T','C',4,'S']
An itemx
can be located in a list using theindex(x[, i[, j]])
method which returns the first occurrence of the item at or after indexi
and before indexj
.
In casei
andj
are not specified they default toi=0
andj=len(l)
.
>>>l= [34,4,6,23,4]>>>l.index(4)1>>>l.index(4,3)4>>>l.index(6,1,4)2
count()
method can be used to count the occurrence(s) of an item in a list.
>>>l= [34,4,6,23]>>>l.count(4)1>>>l= ["it","is","it","I"]>>>l.count("it")2
reverse()
method can be used to reverse a list in-place.
>>>l= ["T","C",2,4,"S"]>>>l.reverse()>>>l['S',4,2,'C','T']
If you do not wish to modify the existing list and create a new list with items in reverse order, use the built-in functionreversed()
nested in the built-inlist()
.
>>>l= ["T","C",2,4,"S"]>>>new_l=list(reversed(l))>>>new_l['S',4,2,'C','T']>>>l['T','C',2,4,'S']
Python lists have a built-insort()
method which sorts the items in-place using<
comparisons between items.
The method also accepts 2 key-worded arguments:
key
is used to specify a function which is called on each list element prior to making the comparisons.reverse
is a boolean which specifies whether the list is to be sorted in descending order.
>>>l= [34,4,6,23]>>>l.sort()>>>l[4,6,23,34]>>>l= [34,4,6,23]>>>l.sort(reverse=True)>>>l[34,23,6,4]>>>l= ["Oh","Hi","Py","ed"]>>>l.sort()>>>l['Hi','Oh','Py','ed']>>>l= ["Oh","Hi","Py","ed"]# lowercase the words before sorting>>>l.sort(key=str.lower)>>>l['ed','Hi','Oh','Py']
If you do not wish to modify the existing list and create a new list with sorted items in, use the built-insorted()
function which returns a new sorted list.
>>>l= [34,4,6,23]>>>new_l=sorted(l)>>>new_l[4,6,23,34]
In Python, we can create an object (data) and bind its reference to a variable using the assignment operator (=
).
As multiple collections or items in collections can point to the same mutable object, a copy is required so one can change one copy without changing the other.
Let us take an example:
>>>old_l= [1,2,3]# Copying old list into a new list>>>new_l=old_l# Checking if both lists are# pointing to the same object>>>id(new_l)==id(old_l)True# Adding element to new list>>>new_l.append(4)>>>new_l[1,2,3,4]>>>old_l[1,2,3,4]
It can be seen how the assignment operatordoes not create a new copy of the list.
Thecopy()
method can be used to create a newlist
containing the items of the original list.
>>>old_l= [1,2,3]# Copying old list into a new list>>>new_l=old_l.copy()# Checking if both lists are# pointing to the same object>>>id(new_l)==id(old_l)False# Adding element to new list>>>new_l.append(4)>>>new_l[1,2,3,4]>>>old_l[1,2,3]
Assigning a slice of the entire list ([:]
) is also equivalent to creating a new copy.
>>>old_l= [1,2,3]# Copying old list into a new list# using slice notation>>>new_l=old_l[:]# Checking if both lists are# pointing to the same object>>>id(new_l)==id(old_l)False# Adding element to new list>>>new_l.append(4)>>>new_l[1,2,3,4]>>>old_l[1,2,3]
A list of lists is also known as a nested list.
Previously, we learnt how to create a list of attributes of a particular vehicle available for sale in a dealership.
Let us now create a nested list containing some details of vehicles available for sale.
# Make Model Year No_of_wheelsl= [["Kia","Sonnet",2019,4], ["Toyota","Camry",2018,4], ["BMW","Z4",2015,4], ["BMW","S1000",2016,2], ["KTM","390",2019,2], ]
Thus, nested list is useful in representing a dataset where each item is a list (datum) containing the attributes of an observation.
l[i][j]
is the syntax to fetch thej+1th
item of the list at indexi
of the nested listl
.
>>>l[2][1]'Z4'
A short and succinct way of creating a new list by transforming an existing list is via list comprehension.
Also, due to the optimizations present in the Python interpreter, there are performance benefits of using list comprehension.
List comprehension can be used to make a new list where each element is the result of some operations applied to each member of another sequence or iterable.
For example, to create a new list where each item is squared.
>>>l= [2,3,4,5,6,8]>>>l2= [i**2foriinl]>>>l2[4,9,16,25,36,64]
List comprehension can also be used to create a subsequence of those elements that satisfy a certain condition.
For example, to create a new list where each item is divisible by 2.
>>>l= [2,3,4,5,6,8]>>>l2= [iforiinlifi%2==0]>>>l2[2,4,6,8]
Write a program to find the minimum, maximum and mean of values stored in a list input by user.
a) Without Using Built-in Functions
Code
#create an empty listl= []n=int(input("Enter length of list: "))foriinrange(n):val=float(input("Enter item: "))#append marks in the listl.append(val)# Set first item as the default min valueminval=l[0]# Set first item as the default max valuemaxval=l[0]# Store sum of itemss=0forvalinl:# check if item value is smaller than current min valifval<minval:minval=val# check if item value is greater than current max valifval>maxval:maxval=vals+=valprint("Minimum :",minval)print("Maximum :",maxval)print("Mean :",s/n)
Output
Enter length of list: 5Enter item: 3Enter item: 2Enter item: 6Enter item: 9Enter item: 1Minimum : 1.0Maximum : 9.0Mean : 4.2
b) Using Built-in Functions
Code
#create an empty listl= []n=int(input("Enter length of list: "))foriinrange(n):val=float(input("Enter item: "))#append marks in the listl.append(val)print("Minimum :",min(l))print("Maximum :",max(l))print("Mean :",sum(l)/n)
Output
Enter length of list: 5Enter item: 3Enter item: 2Enter item: 6Enter item: 9Enter item: 1Minimum : 1.0Maximum : 9.0Mean : 4.2
Write a program to enter a list and then check if the number entered by a user is present in that list.
Code
#create an empty listl= []n=int(input("Enter length of list: "))foriinrange(n):val=input("Enter item: ")#append marks in the listl.append(val)search=input("Enter search value: ")found=Falseforiinrange(n):ifl[i]==search:found=Truebreakiffound:print("Search value located at index",i)else:print("Search value not found")
Output
Enter length of list: 4Enter item: 2Enter item: 6Enter item: 3Enter item: 7Enter search value: 3Search value located at index 2Enter length of list: 4Enter item: 2Enter item: 6Enter item: 3Enter item: 7Enter search value: 1Search value not found
Write a program to enter a list and then print the frequency of elements present in that list.
a) Without Using Built-in Functions
Code
#create an empty listl= []n=int(input("Enter length of list: "))foriinrange(n):val=input("Enter item: ")#append marks in the listl.append(val)# Let us create two lists# 1. to store unique itemsitems= []# 2. to store the count of unique itemscounts= []forvalinl:ifvalnotinitems:items.append(val)counts.append(1)else:foridxinrange(len(items)):ifitems[idx]==val:counts[idx]+=1print("Frequency of Elements")foridxinrange(len(items)):print(items[idx],"-",counts[idx])
Output
Enter length of list: 8Enter item: 2Enter item: 3Enter item: 3Enter item: 4Enter item: 2Enter item: 5Enter item: 6Enter item: 6Frequency of Elements2 - 23 - 24 - 15 - 16 - 2
b) Using Built-in Functions
Code
l= []n=int(input("Enter length of list: "))foriinrange(n):val=input("Enter item: ")#append marks in the listl.append(val)# List to store unique itemsitems= []print("Frequency of Elements")forvalinl:ifvalnotinitems:items.append(val)print(val,"-",l.count(val))
Output
Enter length of list: 8Enter item: 2Enter item: 3Enter item: 3Enter item: 4Enter item: 2Enter item: 5Enter item: 6Enter item: 6Frequency of Elements2 - 23 - 24 - 15 - 16 - 2
Tuples are like lists, with a difference - they are immutable. This means that once initialized a user cannot modify its value, which makes it a useful feature to ensure the sanctity of data and guarantee that it is not being modified by the program.
Tuples are declared using parentheses( )
, with successive items separated by a comma,
.
>>>l= ("Hi","Ed","Punk")>>> type(l)<class'tuple'>
Apart from the method shown above, the built-intuple()
function can also be used to create new tuples.
If no arguments are provided to thetuple()
function, an empty tuple is created.
>>>t=tuple()>>>t()
If a string, tuple or set is passed as an argument,tuple()
functions converts them into a tuple.
>>>t=tuple("abcd")>>>t('a','b','c','d')>>>t=tuple({1,2,3})>>>t(1,2,3)>>>t=tuple([1,2,3,4])>>>t(1,2,3,4)
In case a tuple has only 1 item, it is known as a singleton.
It is a good practice to include a trailing comma to avoid the Python interpreter from treating it as a value inside regular parentheses as shown in the examples below.
>>>regular_string= ("Hi")>>>regular_string'Hi'>>> type(regular_string)<class'str'>>>>regular_int= (1)>>>regular_int1>>> type(regular_int)<class'int'>>>>str_tuple= ("Hi", )>>> type(str_tuple)<class'tuple'>>>>int_tuple= (1, )>>> type(int_tuple)<class'tuple'>
Tuples use zero-based indexing, where 0 is the index of the first element andlen(tuple) - 1
is the index of the last element.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[0]'BMW'>>>t[2]2019
The built-inlen()
function can be used to return the length of the tuple.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>len(t)6
Just likelist
,tuple
supports negative indexing, i.e., you can access the values of tuple from the end. Index of -1 denotes the last item in the tuple, -2 is the second last item and so forth.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[-1]True>>>t[-3]4
Tuples are immutable, i.e., no modifications are permitted once it is created and any attempt to do so raises an error.
>>>t= ("BMW","Z4",2019)>>>t[1]="Charger"Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>TypeError:'tuple'objectdoesnotsupportitemassignment
We can perform various operations on a tuple such as membership, concatenation, repetition and slicing.
in
andnot in
can be used to detect the membership of an item in a tuple.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>"Red"intTrue>>>"Gold"intFalse>>>"Gold"notintTrue
+
operator can be used to join two tuples to create a new tuple.
>>>t= ("BMW","Z4")>>>u= (2019,"Red")>>>t+u('BMW','Z4',2019,'Red')
The*
operator creates a new tuple with items of a tuple repeated the number of times as specified by the integer operand.
>>>t= (1,2)>>>t*3(1,2,1,2,1,2)
A subset of tuplet
can be obtained using the tuple slice notation given ast[i:j]
, where the item at indexi
is included, but the item at indexj
is excluded.
For example, the slice notation[1:4]
refers to items from index1
to index3
(i.e.4-1
).
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[1:4]('Z4',2019,4)
The slice notationt[i:j:k]
can also include a third number known as the step. Here, a tuple is sliced from start indexi
to end index (j
) - 1 with a step ofk
items.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[1:4:2]('Z4',4)
Slice notations also have some useful defaults.
0
is the default for the first number and size of the tuple is the default for the second number.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[2:](2019,4,'Red',True)>>>t[:4]('BMW','Z4',2019,4)
Slice notation also supports negative indexing.
>>>t= ("BMW","Z4",2019, ...4,"Red",True)>>>t[-4:](2019,4,'Red',True)>>>t[:-2]('BMW','Z4',2019,4)>>>t[-4:-1](2019,4,'Red')
Tuple Traversal is the process of visiting every item in a tuple, usually from the first item to the last item, and executing some instruction on the accessed item.
Python provides 2 ways to traverse a tuple:
As a tuple is an ordered collection of items, thefor
statement can be used to directly and sequentially access each item.
Code
t= ("BMW","Z4",2019,4,"Red",True)foritemint:print(item)
Output
BMWZ420194RedTrue
In location based or index based traversal the value of index starts at0
and increments as long as it is lesser than the length of the tuple.
This index value can be used to access the item at that index in a tuple using the index operator[]
.
for
statement can be used to iterate over the index of the tuple using therange()
andlen()
functions.
Code
t= ("BMW","Z4",2019,4,"Red",True)foriinrange(len(t)):print(t[i])
Output
BMWZ420194RedTrue
Awhile
statement can be used to traverse a tuple by iterating on the value of index.
Code
t= ("BMW","Z4",2019,4,"Red",True)index=0whileindex<len(t):print(t[index])index+=1
Output
BMWZ420194RedTrue
Built-in functionsmax()
,min()
andsum()
can be used to calculate the maximum, minimum and sum of items in a tuple, respectively.
>>>t= (4,8,2,1,3)>>>min(t)1>>>max(t)8>>>sum(t)18
As compared to alist
,tuple
supports only methods that do not try to modify it.
An itemx
can be located in a tuple using theindex(x[, i[, j]])
method which returns the first occurrence of the item at or after indexi
and before indexj
. In casei
andj
are not specified they default toi=0
andj=len(t)
.
>>>t= (34,4,6,23,4)>>>t.index(4)1>>>t.index(4,3)4>>>t.index(6,1,4)2
count()
method can be used to count the occurrence(s) of an item in a tuple.
>>>t= (34,4,6,23,4)>>>t.count(4)2>>>t= ("it","is","it","I")>>>t.count("it")2
To create a new tuple with items in reverse order, use the built-in functionreversed()
nested in the built-intuple()
.
>>>t= ("T","C",2,4,"S")>>>new_t=tuple(reversed(t))>>>new_t('S',4,2,'C','T')>>>t('T','C',2,4,'S')
To create a new tuple with sorted items, use the built-insorted(x,[key, reverse])
function which returns a new sorted tuple.
The method optionally accepts 2 keyword (named) arguments:
key
which is used to specify a function which is called on each tuple element prior to making the comparisons, andreverse
is a boolean which specifies whether the tuple is to be sorted in descending order.
>>>t= (34,4,6,23,9)>>>tuple(sorted(t))(4,6,9,23,34)>>>tuple(sorted(t,reverse=True))(34,23,9,6,4)>>>t= ("Oh","Hi","Py","it")>>>tuple(sorted(t))('Hi','Oh','Py','it')# lowercase the words before sorting>>>tuple(sorted(t,key=str.lower))('Hi','it','Oh','Py')
A tuple of tuples is also known as a nested tuple.
Let us create a nested tuple containing the details of all vehicles available for sale.
# Make Model Year No_of_wheelst= ( ("Kia","Sonnet",2019,4), ("Toyota","Camry",2018,4), ("BMW","Z4",2015,4), ("BMW","S1000",2016,2), ("KTM","390",2019,2),)
t[i][j]
is the syntax to fetch thej+1 th
item of the tuple at indexi
of the nested tuplet
.
>>>t[2][1]'Z4'
As compared to a nested list, a nested tuple is useful in representing a dataset (such as a table fetched from a database) where it is important to ensure the sanctity of data as the code cannot modify it because tuples are immutable.
Just like lists, the+=
operation works for a tuple and adds new item(s) to it.
>>>t= ("Hi","Ed","Punk")>>>t+= (1,2)>>>t('Hi','Ed','Punk',1,2)
But tuples are immutable right?
Then how are we able to modify it.
To understand it better, let us revisit what happens when we apply+=
operator on a list.
>>>l= ["Hi","Ed","Punk"]>>>id_l=id(l)>>>l+= [1,2]>>>l['Hi','Ed','Punk',1,2]>>>id(l)==id_lTrue
In case of a list, the modified listl
still points at the same object.
Now, let us add items to a tuple.
>>>t= ("Hi","Ed","Punk")>>>id_t=id(t)>>>t+= (1,2)>>>t('Hi','Ed','Punk',1,2)>>>id(t)==id_tFalse
In case of a tuple, the modified tuple is actually a completely new tuple with contents of the original tuple and the extension.
The original tuple is not modified as it is immutable. But, ast
is no longer pointing to the original tuple, it is freed from memory.
Thus, it is recommended that instead of+=
,append()
andextend()
methods should be employed to add new items programmatically as it will raise an error in case the code is trying to modify a tuple.
>>>l= ["Hi","Ed","Punk"]>>>l.extend([1,2])>>>l['Hi','Ed','Punk',1,2]>>>t= ("Hi","Ed","Punk")>>>t.extend((1,2))Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>AttributeError:'tuple'objecthasnoattribute'extend'
Python provides a mapping type collection which contains keys, and values corresponding to those keys.
This collection is known as adictionary
(typedict
) , where each key is unique and can be used to easily store or retrieve values (any data-type includingstring
,int
,float
,list
).
Dictionaries are indexed by keys (any immutable type - numbers, string, tuple) as compared to lists which are indexed by a range of numbers.
An empty dictionary can be created using a pair of braces -{}
or using the built-in functiondict()
which is also a constructor.
>>>d= {}>>> type(d)<class'dict'>>>>d=dict()>>>d{}>>> type(d)<class'dict'>
To create a new dictionary with some initial values, one can use:
A sequence ofkey: value
pairs separated by commas in braces -{}
.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d{'yr':20,'name':'Ed',18:True}
A sequence of (key, value) tuples can be passed as an argument to thedict()
function.
>>>l= [("yr",20), ("name","Ed"), ... (18,True)]>>>d=dict(l)>>>d{'yr':20,'name':'Ed',18:True}
Keys and values can be passed as keyword arguments to thedict()
function.
>>>d=dict(yr=20,name="Ed", ...is18=True)>>>d{'yr':20,'name':'Ed','is18':True}
One of the limitations of this method is that the keys of the dictionary are only of type string and their names must be within the namespace of an identifier.
There might cases when there is a need to combine two lists where keys are in one list and the corresponding values are available in another list.
The built-inzip()
function is useful for combining two lists into tuple pairs.
The zipped iterator can be passed as an argument to thedict()
to create a new mapping collection.
>>>m= ["yr","name",18]>>>n= [20,"Ed",True]>>>list(zip(m,n))[('yr',20), ('name','Ed'), (18,True)]>>>dict(zip(m,n)){'yr':20,'name':'Ed',18:True}
dict.fromkeys(iterable[, value])
can be used to create new dictionary with items of aniterable
as keys with an optional value (default -None
) corresponding to the keys.
>>>l= ["yr","name",18]>>>dict.fromkeys(l){'yr':None,'name':None,18:None}>>>dict.fromkeys(l,0){'yr':0,'name':0,18:0}
d[x]
can be used to access the value corresponding to a keyx
in a dictionaryd
.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d["yr"]20
get(key[, default])
method can also be used to fetch the value for a key if key is in the dictionary, else adefault
value is returned.
Ifdefault
is not provided,None
is the default value returned.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d.get("name")'Ed'>>>d.get("version")isNoneTrue>>>d.get("version","1.0")'1.0'
This is an elegant method of fetching a value from a dictionary and it should be used in real life software codes as it never raises aKeyError
.
d[x] = y
can be used to add the valuey
corresponding to a keyx
in a dictionaryd
.
In case keyx
already exists, assignment updates or overwrites the value corresponding to it.
>>>d= {"yr":20,18:True}>>>d["name"]='Ed'>>>d{'yr':20,18:True,'name':'Ed'}>>>d["name"]='Py'>>>d{'yr':20,18:True,'name':'Py'}
update()
method can be used to add new items or update existing items in adict
.
This method accepts the following:
1. Dictionary
>>>d= {"yr":20,18:True}>>>d.update({"name":'Ed'})>>>d{'yr':20,18:True,'name':'Ed'}>>>d.update({"yr":15,18:False})>>>d{'yr':15,18:False,'name':'Ed'}
2. Sequence of (key, value) pairs
>>>d= {"yr":20,18:True}>>>d.update([("name",'Ed'), ... ('yr',15)])>>>d {'yr':15,18:True,'name':'Ed'}
3. Key-worded arguments
>>>d= {"yr":20,18:True}>>>d.update(name="Ed",yr=15)>>>d{'yr':15,18:True,'name':'Ed'}
setdefault(key[, default])
method returns a value corresponding to a key.
If the key is not present in the dictionary, it is inserted with the provideddefault
value.
If no default value is provided, the value is set asNone
.
>>>d= {"yr":20,18:True}>>>d.setdefault("name")>>>d{'yr':20,18:True,'name':None}>>>d= {"yr":20,18:True}>>>d.setdefault("name","Ed")'Ed'>>>d{'yr':20,18:True,'name':'Ed'}
del
keyword can be used to remove akey: value
pair.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>deld["name"]>>>d{'yr':20,18:True}
clear()
method can be used to clear all values of a dictionary.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d.clear()>>>d{}
pop(key[, default])
method can be used to remove akey: value
pair.
In case the key is not present in the dictionary, the method returns the value of argumentdefault
provided by the user.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d.pop("name")'Ed'>>>d{'yr':20,18:True}>>>d.pop("name")Traceback (mostrecentcalllast):File"<stdin>",line1,in<module>KeyError:'name'>>>d.pop("name","Punk")'Punk'
popitem()
can be used to destructively iterate over a dictionary removing and returning a(key, value)
pair in LIFO (Last Item First Out) order.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d.popitem()(18,True)>>>d.popitem()('name','Ed')>>>d.update({"name":"py"})>>>d{'yr':20,'name':'py'}>>>d.popitem()('name','py')>>>d.popitem()('yr',20)>>>d{}
in
keyword can be used to test the presence of akey
in a dictionary.
>>>d= {"yr":20,18:True}>>>"yr"indTrue
Similarly,not in
can be used to determine the absence of akey
.
>>>"name"notindTrue
The|
(union) operator can be used to merge two dictionaries.
>>>d= {"yr":20,18:True}>>>n= {"name":"Ed"}>>>d|n{'yr':20,18:True,'name':'Ed'}
In case of common keys between the two operands (dictionaries), the values of the operand on the right takes priority.
>>>d= {"yr":20,18:True}>>>n= {"yr":15,"name":"Ed"}>>>d|n{'yr':15,18:True,'name':'Ed'}
In the above example, bothd
andn
shared a common keyyr
. The value corresponding toyr
inn
gets priority.
The|=
(union) augmented assignment operator can be used to update the dictionary with keys and values from another dictionary or an iterable of (key, value) pairs.
>>>d= {"book":"Python","year":1990}>>>dnew= {"author":"Guido"}>>>d|=dnew>>>d{'book':'Python','year':1990,'author':'Guido'}
If both the dictionaries share a common key, then the value corresponding to the key in the right operand is the updated value.
>>>d= {"book":"Python","year":1990}>>>dnew= {"author":"Guido","year":2000}>>>d|=dnew>>>d{'book':'Python','year':2000,'author':'Guido'}
In the above example, bothd
anddnew
shared a common keyyear
. The value (1990
) corresponding toyear
ind
is updated with the new value.
The|=
operator also works in case the right operand is an iterable instead of a dictionary as shown in the example below.
>>>d= {"book":"Python","year":1990}>>>inew= [("author","Guido"), ("year",2000)]>>>d|=inew>>>d{'book':'Python','year':2000,'author':'Guido'}
Compared to position based indexing oflist
ortuple
, dictionaries are indexed based on the key.
for
loop can be used to directly iterate over the keys of adict
.
d= {"yr":20,"name":"Ed",18:True}forkeyind:print(key,":",d[key])
Output
yr : 20name : Ed18 : True
Dictionary methodsitems()
,keys()
andvalues()
can be used to fetch the items, keys and values in a dictionary, respectively.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>d.keys()dict_keys(['yr','name',18])>>>d.values()dict_values([20,'Ed',True])>>>d.items()dict_items([('yr',20), ('name','Ed'), (18,True)])
dict_keys
,dict_values
anddict_items
are iterables that can be used in afor
statement to traverse the dictionary.
Code
d= {"yr":20,"name":"Ed",18:True}forkeyind.keys():print(key)
Output
yrname18
Code
d= {"yr":20,"name":"Ed",18:True}forvalueind.values():print(value)
Output
20EdTrue
Code
d= {"yr":20,"name":"Ed",18:True}forkey,valueind.items():print(key,":",value)
Output
yr : 20name : Ed18 : True
len()
can be used to fetch the number of items in the dictionary.
>>>d= {"yr":20,"name":"Ed", ...18:True}>>>len(d)3
Themin()
andmax()
built-in functions return the minimum and maximum value among the keys of a dictionary.
In case of string keys, they return the first and last occurring strings alphabetically.
>>>d= {"yr":20,"name":"Ed", ..."is18":True}>>>min(d)'is18'>>>max(d)'yr'>>>d= {2:"b",3:"c",1:"a"}>>>min(d)1>>>max(d)3
To get the list of all keys in a dictionary, use thelist()
built-in function.
>>>d= {"book":"Python","year":1990,"author":"Guido"}>>>list(d)['book','year','author']
To get a sorted list of keys, you can use thesorted()
built-in function.
>>>d= {"book":"Python","year":1990,"author":"Guido"}>>>sorted(d)['author','book','year']
A new copy of a dictionary can be made usingcopy()
method:
>>>old_d= {"a":1,"b":2}>>>new_d=old_d.copy()# Checking if both dicts are# pointing to the same object>>>new_disold_dFalse>>>new_d["c"]=3>>>new_d["b"]=4>>>new_d{'a':1,'b':4,'c':3}>>>old_d{'a':1,'b':2}
When the value in akey:value
pair of a dictionary is of typedict
, it is known as a nested dictionary.
In simple terms, it is a dictionary inside a dictionary.
Records of a table can be represented as a nested dict with primary key as the outer dict keys.
d= {1: {"name":"Amar","age":19},2: {"name":"Ria","age":20},3: {"name":"Rao","age":18}}print(d[1])print(d[2]["name"])
Output
{'name':'Amar','age':19}Ria
Write a program to count the number of times a character appears in a given string.
Code
d= {}s=input("Enter string: ")forcins:ifcind:d[c]+=1else:d[c]=1print("Frequency of characters")forkeyind:print(key,"-",d[key])
Output
Enter string: pythondataFrequency of charactersp - 1y - 1t - 2h - 1o - 1n - 1d - 1a - 2
Write a program to enter employee names and their salaries as input and store them in a dictionary.
Code
n=int(input("Enter number of employees: "))emp=dict()forcntinrange(n):name=input("Enter employee name: ")salary=int(input("Enter salary: "))emp[name]=salaryprint("Employee Name & Salary")forkeyinemp:print(key,'-',emp[key])
Output
Enter number of employees: 3Enter employee name: Anil RajEnter salary: 12000Enter employee name: Dinesh KumarEnter salary: 15000Enter employee name: Sam SinghEnter salary: 10000Employee Name & SalaryAnil Raj - 12000Dinesh Kumar - 15000Sam Singh - 10000
Python has a rich and extensiveStandard Library which gives it an edge over traditional programming languages.
Python Standard Library contains 69 built-in functions that are commonly used by programmers which saves a lot of time as they can be directly used in a program.
We have already used some functions in the previous sections likeinput()
,output()
,len()
,sum()
,min()
,max()
,list()
,dict()
, etc.
Some of the widely used functions can be categorized as follows:
- Mathematical Functions
- Type Functions
- Input/Output Functions
- Base/Unicode Functions
abs(x)
returns the absolute value or magnitude ofx
of typeint
,float
orcomplex
number.
>>>abs(-2.3)2.3>>>abs(-10)10>>>abs(2+2j)2.8284271247461903
round(x [,ndigits])
rounds offx
(float
) tondigits
precision after the decimal point.
In casendigits
is not provided, it rounds off to the nearest integer.
>>>round(2.33462398)2>>>round(2.33462398,3)2.335
sum(sequence [,start])
returns the sum of items in asequence
of typelist
,tuple
,range
orset
.
If a second argumentstart
is provided, it is added to the sum.
>>>sum([1,2,-1])2>>>sum((1,2,-1))2>>>sum([1,2,-1],10)12
min(sequence)
returns the minimum value of items in asequence
of typelist
,tuple
,range
,str
orset
.
Apart from iterables,min(arg1, arg2, ..)
also accepts multiple argumentsarg1
,arg2
.. of numeric type and returns the smallest among these arguments.
>>>min([1,2,-1])-1>>>min(1,2,-1)-1# character with minimum# ASCII value in a String>>>min("hello")'e'
max(sequence)
returns the maximum value of items in asequence
of typelist
,tuple
,range
,str
orset
.
Apart from iterables,max(arg1, arg2, ..)
also accepts multiple argumentsarg1
,arg2
.. of numeric type and returns the largest among these arguments.
>>>max([1,2,-1])2>>>max(1,2,-1)2# character with maximum# ASCII value in a String>>>max("hello")'o'
pow(base, exp [,mod])
raises abase
(int
,float
) to the powerexp
(int
,float
), i.e.base**exp
.
Ifmod
(int
) is provided(base**exp) % mod
is returned.
>>>pow(3,2,7)2>>>pow(1.4141,2)1.9996788099999998
divmod(a, b)
returns a tuple(a // b, a % b)
consisting of the quotient and remainder whena
(int
,float
) is divided byb
(int
,float
).
>>>divmod(23,3.5)(6.0,2.0)>>>divmod(-10,7)(-2,4)
Python has built-in functions to handle various aspects of datatypes such as checking and conversion.
type()
andisinstance()
builtin functions are used for checking the data type of objects.
Check outType Checking section in the chapterVariable, Objects & Data Types to learn more about it in detail.
We have previously discussed the various built-in data types available in Python.
The following functions are often used to assign a default value or create an empty collection when no arguments are provided:
Function | Default Value |
---|---|
bool() | False |
int() | 0 |
float() | 0.0 |
complex() | 0j |
str() | '' |
list() | [] |
tuple() | () |
set() | set() |
frozenset() | frozenset() |
dict() | {} |
TheType Casting section of the chapterVariable, Objects & Data Types covers these functions in detail in case any argument is passed.
Python provides the following built-in functions to handle user input and result output:
input()
function can be used to accept a user input and assign it to a variable.
When this function is encountered in the code, the python interpreter waits for the user to type a response which is read as a string.
Code
name=input("Enter name: ")print("Hello,",name)
Output
Enter name: PythonHello, Python
The built-inprint()
function can be used to display an output (value of variables, expressions, etc.) on the standard output.
For example, the below code computes the area of a rectangle and displays it:
Code
length=10breadth=5area=length*breadthprint("Area:",area)
Output
Area: 50
Both,input()
andprint()
functions are covered in detail in the chapterInput & Output.
open()
function is used to open a file and return the corresponding file object.
This function supports various modes useful for reading from a file and writing to a file.
An example usage ofopen()
function is shown below:
f=open('data.txt','r+')# f is the file object which# can perform read and write# operations on the file
This function is covered in detail in the chapterFile Handling.
Converts anint
to a binary (base-2) string prefixed with'0b'
.
>>>bin(9)'0b1001'
Converts anint
to an octal (base-8) string prefixed with'0o'
.
>>>oct(9)'0o11'
Converts anint
to a lowercase hexadecimal (base-16) string prefixed with"0x"
.
>>>hex(29)'0x1d'
Returns the integer Unicode code point for the given Unicode characterstr
.
>>>ord("a")97>>>ord("β")946
Converts the integer Unicode code point into the corresponding Unicode string.
>>>chr(97)'a'>>>chr(946)'β'
Apart from built-in functions, thePython Standard Library also contains a wide range of built-in modules which are a group of functions organized based on functionality.
Some commonly used modules are:
math
- Mathematical functionsrandom
- Generate pseudo-random numbersstatistics
- Statistical functionscopy
- Create shallow and deep copy of objects
Accessing Modules
A module can be accessed in a program using theimport
statement.
>>>importmath
The aboveimport
statement loads all the functions available in themath
module. To access any function in the module, simply type the module name followed by a period (.
), followed by the function name.
>>>importmath>>>math.pow(3,2)9.0
Instead of loading all the functions in a module, thefrom
statement can be used to access only specified functions.
>>>frommathimportpow>>>pow(3,2)9.0
math
module provides mathematical functions and constants as defined by the C standard.
The following frequently used mathematical constants are available inmath
module:
pi
:π = 3.141592…
(to available precision)e
:e = 2.718281…
(to available precision)tau
:τ = 2π = 6.283185…
(to available precision)
>>>importmath>>>math.pi3.141592653589793>>>math.e2.718281828459045>>>math.tau6.283185307179586
Following useful mathematical functions are available in themath
module:
fabs()
fabs(x)
returns the absolute value ofx
.
>>>importmath>>>math.fabs(-3.24)3.24
gcd()
gcd(a, b)
returns the greatest common divisor of integersa
andb
.
>>>importmath>>>math.gcd(54,24)6
ceil()
ceil(x)
returns the smallest integer greater than or equal tox
.
>>>importmath>>>math.ceil(2.13)3
floor()
floor(x)
returns the largest integer less than or equal tox
.
>>>importmath>>>math.floor(2.13)2
fmod()
fmod(x, y)
returns the value of the expressionx - n*y
such that the result has the same sign asx
and magnitude less than|y|
for some integern
.
This function should be preferred when working with floating point numbers as compared tox % y
that should be used when working with integers.
>>>importmath>>>math.fmod(2.14,0.5)0.14000000000000012>>>math.fmod(-2.14,0.5)-0.14000000000000012
pow()
pow(x, y)
raisesx
to the power ofy
.
>>>importmath>>>math.pow(5,2)25.0
sqrt()
sqrt(x)
returns the square root ofx
.
>>>importmath>>>math.sqrt(25)5.0
sin(), cos() & tan()
sin(x)
,cos(x)
andtan(x)
return the sine, cosine and tangent ofx
(radians) respectively.
>>>importmath>>>math.cos(math.pi/3)0.5000000000000001>>>math.sin(math.pi/2)1.0>>>math.tan(math.pi/4)0.9999999999999999
factorial()
factorial(n)
computes the factorial of a positive integern
which is the product of all positive integers less than or equal ton
.n! = n×(n-1)×(n-2)...3×2×1
,
where0! = 1
>>>importmath>>>math.factorial(5)120
random
module provides access to functions which can generate random numbers.
random()
function draws a random float in the half open interval[0.0, 1.0)
with a uniform probability.
>>>importrandom>>>random.random()0.6857133962949904
randrange(stop)
is used to randomly select an integer from a range0
tostop
(excluding).
>>>importrandom>>>random.randrange(10)5
randrange(start, stop)
is used to randomly select an integer from a rangestart
tostop
(excluding).
>>>importrandom>>>random.randrange(5,10)8
randint(a, b)
is an alias forrandrange(a, b+1)
, provides an interface to generate a random integerN
such thata <= N <= b
(includes boundaries).
>>>importrandom>>>random.randint(10,20)12
statistics
module provides functions that are useful for calculating mathematical statistics of numeric (Real-valued) data.
mean(data)
returns the arithmetic mean or average value ofdata
(list
ortuple
ofint
orfloat
).
>>>fromstatisticsimportmean>>>mean([1,2,3,6])3
median(data)
returns the median or middle value ofdata
(list
ortuple
ofint
orfloat
) using the common "mean of middle two" method.
>>>fromstatisticsimportmedian>>>median([1,2,3,6])2.5
mode(data)
returns the most commonly occurring value in thedata
(list
ortuple
ofint
,str
orfloat
).
If there are multiple modes with the same count, the first occurrence in the sequence is returned.
>>>fromstatisticsimportmode>>>mode([1,2,3,6])1>>>mode([1,2,2,3,6])2>>>mode(["a","b","a","c"])'a'
copy()
method does not recurse to create copies of the child objects, so if the child objects are mutable (example nested list) any modification in the child object will get reflected in the both the parent objects.
>>>old_list= [[1,2,3], [4,5,6], [7,8,9]]# Copying old list into a new list>>>new_list=old_list.copy()# Checking if both lists are pointing to the same object>>>id(new_list)==id(old_list)False# Checking if items of both lists are pointing to the same objects>>> [id(new_list[idx])==id(old_list[idx])foridxinrange(len(old_list))][True,True,True]# Modify new list>>>new_list[1][1]=0>>>new_list[[1,2,3], [4,0,6], [7,8,9]]>>>old_list[[1,2,3], [4,0,6], [7,8,9]]
As we can see in the output,new_list[1][1]
was modified which is reflected in bothnew_list
andold_list
.
Thecopy
module provides thedeepcopy()
function which is helpful in mitigating this issue.
Deep copy overcomes the shortcomings ofcopy()
and recursively creates copies of the child objects found in the original list. This leads to the creation of an independent copy of the original.
>>>importcopy>>>old_list= [[1,2,3], [4,5,6], [7,8,9]]# Copying old list into a new list>>>new_list=copy.deepcopy(old_list)# Checking if both lists are pointing to the same object>>>id(new_list)==id(old_list)False# Checking if items of both lists are pointing to the same objects>>> [id(new_list[idx])==id(old_list[idx])foridxinrange(len(old_list))][False,False,False]# Modify new list>>>new_list[1][1]=0>>>new_list[[1,2,3], [4,0,6], [7,8,9]]>>>old_list[[1,2,3], [4,5,6], [7,8,9]]
Note the change in theid()
equality of the children for[True, True, True]
to[False, False, False]
. As we can see in the output,new_list[1][1]
was modified which gets reflected only in thenew_list
.
So far we have written programs which accept input from the user via keyboard and display the generated output on the standard output (console).
This activity has the following disadvantages:
- Feasible for small inputs, but cumbersome in case of large inputs.
- The entire input has to be entered every time the program is executed.
- The output generated is not saved for sharing or later use.
Just like words can be written on a piece of paper, information can be stored on a computer disk as a named location or a file.
Python comes with the in-built ability to interact with files. Through file handling, Python programs can read a file to provide input data and output the results into a file for later use.
Primarily, there are two types of files:
Text File
A text file is the simplest way to store information in a human readable text format (sequence of ASCII or Unicode characters), which includes numeric values.
It is easy to view and modify the contents of a text file using any text editor like notepad or even IDLE.
Even the python script is stored as a text file given by the extension.py
..txt
is the most popular extension used for a generic text file.
Although text files do not have any structure, there are international standards defining some rules for creating specialized text files like:
.csv
, where each row is a set of comma separated values. This is the most popular data exchange format..tsv
, similar to CSV but the values are separated by tabs instead of commas..xml
and.json
are popular web data exchange formats..html
files are text files with contents written in the Hypertext Markup Language designed to display contents in a web browser.
Binary File
Although the contents of a text file is human readable, in reality this information is stored on the disk in form of machine readable bytes (1s and 0s) one character at a time. This makes the format simple to be operated by any application, but it is less efficient and consumes more memory (greater file size).
In a binary file, the contents are also stored in form of bytes, but these bytes do not directly translate into ASCII or Unicode characters.
Rather these bytes can represent anything like:
- Complex Data Structures
- Image
- Audio
- Video
and more.
As there is no simple rule to determine the basic unit of information, so opening a binary file in a text editor will display garbage values and even a single bit of change can corrupt the entire file and make it unreadable. Hence, specialized softwares are required to read and write binary files.
Python has a built-inpickle
module which has implemented protocols to read and write binary files having.dat
or.pickle
file extension.pickle
is a Python specific binary file format which can serialize any Python data structure (lists, dictionaries, etc) or code object into a binary file. This byte-content of this file can then be de-serialized and used later in any computer running Python.
A text or binary file can be opened using the built-inopen(file_name, mode='r')
function.
The parameters of the function are explained below:
file_name
file_name
is a required string parameter specifying the path of the file to be opened.
It can either be an absolute path or a relative path as shown below:
'fname.txt'
is the relative path of a text file residing in the current working directory from where the python script is being executed.'../fname.txt'
is the relative path of a text file outside the current directory where the python script is being executed.'/Users/edpunk/Documents/fname.txt'
is the absolute path of a text file which can be opened by the python script from any location as long as it is in the same system.
mode
mode
is an optional string parameter which specifies the mode in which the file has to be opened. It defaults to'r'
which means open for reading in text mode.
The available modes are:
mode | purpose |
---|---|
'r' | Opens the file for reading (default). |
't' | Opens the file in text mode (default). |
'w' | Opens the file for writing, truncating (emptying) the file if it already exists. |
'x' | Same as'w' , but it fails if the file already exists. |
'a' | Opens the file for writing, where any new data is added at the end. It creates a new file if the file does not exists. |
'b' | Opens the file in binary mode. |
'rb' | Opens the file in binary and read-only mode. |
'wb' | Opens the file for writing in binary mode, truncating (emptying) the file if it already exists. |
'+' | Allows both read and write operations on a file. |
'r+' | Opens the file in read and write mode. It throws an error in case the file does not exists. If the file already exists, new data is overwritten on top of the existing data if the position of stream is not moved to the end of the file. |
'r+b' | Opens the file in binary read and write mode. It does not truncate the file if it already exists. |
'w+' | Opens the file in read and write mode. It creates a new file or truncates the contents of the file if it already exists. |
'w+b' | Opens the file in binary read and write mode. It creates a new file or truncates the contents of the file if it already exists. |
'a+' | Opens the file in read and append mode. It creates a new file if it does not exist. If the file already exists, new data is automatically added at the end of the file after existing data. |
'a+b' | Opens the file in binary read and append mode. |
After opening a file and performing some file operations, it can be safely closed using theclose()
method.
For example, let us write a program to open a filedata.txt
in read and append mode and close the file object in the successive statement.
f=open('data.txt','a+')f.close()
Opening a file using theopen()
function creates and returns a file object on which the following activities can be performed:
The following three methods can be used on a file object to sequentially read the contents of a file:
read(size = -1)
method is used to readsize
numbers of bytes of data from a file at a time.
In casesize
is not specified, it reads and returns the entire content of the file.
Example
Let us consider a file namedinfo.txt
containing the below text:
Hello WorldThank You
Code
>>>f=open('info.txt','r')>>>print(f.read(3))Hel>>>print(f.read(4))loW>>>f.close()>>>f=open('info.txt','r')>>>print(f.read())HelloWorldThankYou>>>f.close()
readline(size = -1)
method without any arguments is used to read and return one line at a time from the given file.
Ifsize
is specified, it returns at mostsize
bytes of data or the data until it encounters the newline character.
Example
Let us consider a file namedinfo.txt
containing the below text:
Hello WorldThank You
Code
>>>f=open('info.txt','r')>>>f.readline(3)'Hel'# Remaining data (byte size < 20)# until it encounters newline>>>f.readline(20)'lo World\n'>>>f.close()>>>f=open('info.txt','r')>>>f.readline()'Hello World\n'>>>f.close()
Sincereadline()
returns one row at a time, it can be used in awhile
statement to iterate over the data row-wise. Once it reaches the end of file it returns an empty string.
Code
f=open('info.txt','r')line=f.readline()whileline:# Removes newline# character from the# end of the lineline=line.strip()print(line)# read the next lineline=f.readline()f.close()
Output
Hello WorldThank You
for
statement can also be used to traverse the file row-wise without any requirement of thereadline()
method. Simply iterating over the file object returns the data one row at a time.
Code
f=open('info.txt','r')forlineinf:# Removes newline# character from the# end of the lineline=line.strip()print(line)f.close()
Output
Hello WorldThank You
readlines()
method returns a list of strings representing the lines (along with the newline character) of text file content.
Example
Let us consider a file namedinfo.txt
containing the below text:
Hello WorldThank You
Code
>>>f=open('info.txt','r')>>>f.readlines()['Hello World\n','Thank You']
The following two methods can be used on a file object to randomly access contents of a file:
tell()
returns the current byte position (integer) of the file object in the file.
Let us consider the same fileinfo.txt
for the below example:
>>>f=open('info.txt','r')>>>f.read(4)'Hell'>>>f.tell()4
seek(offset, reference=0)
can be used to seek a location in the file object which isoffset
bytes from the providedreference
.
The default value forreference
is0
which stands for the beginning of the file. For this defaultreference
, theoffset
has to be a whole number (>=0
).
Other allowed values ofreference
are:
1
, which denotes that the offset will be calculated from the current position of the file object (offset
can be positive or negative)2
, which denotes that the offset is calculated from the end of the file (offset
is negative)
Note: In text files (those opened without ab
in the mode string), only seek relative to the beginning of the file (reference = 0
) is allowed.reference = 1 or 2
is only valid when the file is opened in binary mode.
Let us consider the same fileinfo.txt
for the below example:
>>>f.seek(6)6# Hello World\nThank You# ^# Position of file pointer# after seek.>>>f.read()'World\nThank You'>>>f.tell()21>>>f.seek(12)12# Hello World\nThank You# ^# Position of file pointer# after seek.>>>f.read()'Thank You'
The following modes support file writing:
- Writing from the beginning of a file :
r+
,r+b
,w
,x
,wb
,w+
,w+b
. - Appending at the end of an existing file :
a
,ab
,a+
,a+b
.
The following methods can be used to write data in a file:
write(s)
method is useful for writing a strings
into a file.
>>>s="Hi World\nThank You">>>f=open("info.txt","w")>>>f.write(s)21# Number of characters written>>>f.close()
Now, on openinginfo.txt
using a text editor, the following content is displayed.
Hi WorldThank You
writelines(lines)
method writes a list of strings into a file.
This method does not add the line separator (newline character\n
), so they should be present at the end of each string.
>>>lines= ["Hi World\n", ..."Thank You\n"]>>>f=open("info.txt","w")>>>f.writelines(lines)>>>f.close()
Now, on openinginfo.txt
using a text editor, the following content is displayed.
Hi WorldThank You
Note the empty line at the end which is added due to the\n
at the end of the last string.
In Python, the protocols to read and write binary files (.dat
or.pickle
file extension) have been implemented in the built-inpickle
module.
pickle
is a Python specific binary file format which can not only be used to store binary data, but also store any Python object.
This process of translating data structures (lists, dictionaries, etc.) and code objects (classes, functions, etc.) into bytes that can be stored in a binary file is known asSerialization. This binary file can be stored on disk or shared and it can be de-serialized and used later via Python.
Thedump()
function can be used to write the pickled representation of any data or object into a file.
Syntax:pickle.dump(obj,file)
where:
obj
is the object to be writtenfile
is an open file object (opened in binary writewb
or appendab
mode).
>>>importpickle>>>l= [["Anita","Maths",83], ... ["Amar","Maths",95],... ["Ani","Maths",90]]>>>f=open("marks.dat","wb")>>>pickle.dump(l,f)>>>f.close()
Theload()
function can be used to read the pickled representation of an object or data from a file.
Syntax:pickle.load(file)
wherefile
is an open file object (opened in binary read moderb
).
>>>importpickle>>>f=open("marks.dat","rb")>>>l=pickle.load(f)>>>f.close()>>>l[['Anita','Maths',83], ['Amar','Maths',95], ['Ani','Maths',90]]
Write a program which performs the following Record Keeping activities:
- Accept some student marks as input and write the data in a binary file
- read all data from the file
- add some more data to the file
- show the entire contents of the file
Code
importpickle# Open file in binary write modef=open("marks.dat","wb")# Enter student marks dataflag=1whileflag!=0:name=input("Enter student name: ")subject=input("Enter subject: ")marks=float(input("Enter marks: "))rec= [name,subject,marks]pickle.dump(rec,f)flag=int(input("Enter 1 to add more or 0 to terminate: "))f.close()# read contents of file and displayf=open("marks.dat","rb")print("File contents:")# Iteratively read all pickle objects written# in the file until the end of fie is reachedtry:whileTrue:rec=pickle.load(f)print(rec)exceptEOFError:print("End of file reached")f.close()# Add more dataf=open("marks.dat","ab")flag=1whileflag!=0:name=input("Enter student name: ")subject=input("Enter subject: ")marks=float(input("Enter marks: "))rec= [name,subject,marks]pickle.dump(rec,f)flag=int(input("Enter 1 to add more or 0 to terminate: "))f.close()# show all contentsf=open("marks.dat","rb")print("File contents after appending:")try:whileTrue:rec=pickle.load(f)print(rec)exceptEOFError:print("End of file reached")f.close()
Input/Output
Enter student name: AnitaEnter subject: MathsEnter marks: 83Enter 1 to add more or 0 to terminate: 1Enter student name: AmarEnter subject: MathsEnter marks: 95Enter 1 to add more or 0 to terminate: 0File contents:['Anita', 'Maths', 83.0]['Amar', 'Maths', 95.0]End of file reachedEnter student name: AkashEnter subject: ScienceEnter marks: 92Enter 1 to add more or 0 to terminate: 1Enter student name: IraEnter subject: ScienceEnter marks: 99Enter 1 to add more or 0 to terminate: 0File contents after appending:['Anita', 'Maths', 83.0]['Amar', 'Maths', 95.0]['Akash', 'Science', 92.0]['Ira', 'Science', 99.0]End of file reached
Comma-separated value (CSV) file format is one of the most common data serialization and exchange format where each row is a set of values separated by a delimiter.
The main rules for creating a CSV file are:
- Each field is separated by a delimiter. Comma (
,
) is the default delimiter used in CSV files, but other delimiters (;
,\t
,|
, etc.) can also be used. - If any field value contains a delimiter, it must be surrounded by a pair of quote characters (usually double-quotes character
"
). - There is an optional single header line which contains the names of the fields.
An example CSV file containing the marks data is given below:
Name,Subject,MarksAnita,Maths,83.0Amar,Maths,95.0Akash,Science,92.0Ira,Science,99.0
To demonstrate why quote characters are required, let us have a look at the below contents of a CSV file which contains the average marks of some subjects (separated by comma) for each student.
Name,Average Of,MarksAnita,"Accounting,Maths",83.0Amar,"Accounting,Maths",95.0Akash,"Physics,Chemistry,Biology",92.0Ira,"Physics,Chemistry,Biology",99.0
Python has built-incsv
module which provides utilities to read and write CSV files.
csv.reader(csvfile, delimiter=',', quotechar='"')
function is used to return the reader object which iterates over CSV file line-by-line and returns it as a list of strings.
Thecsvfile
file object should be opened withnewline=''
argument as thecsv
module has its own newline handling which correctly interprets the newlines depending on platform or in case they are embedded inside quoted fields.
Example #1
Let us write a program to read the contents ofmarks.csv
.
marks.csv
Name,Subject,MarksAnita,Maths,83.0Amar,Maths,95.0Akash,Science,92.0Ira,Science,99.0
Code
importcsvwithopen('marks.csv',newline='')asf:reader=csv.reader(f)forrowinreader:print(row)
Output
['Name', 'Subject', 'Marks']['Anita', 'Maths', '83.0']['Amar', 'Maths', '95.0']['Akash', 'Science', '92.0']['Ira', 'Science', '99.0']
Example #2
Let us write a program to read the contents ofmarks2.csv
.
Note the change in delimiter & quote character.
marks2.csv
Name|Average Of|MarksAnita|^Accounting|Maths^|83.0Amar|^Accounting|Maths^|95.0Akash|^Physics|Chemistry|Biology^|92.0Ira|^Physics|Chemistry|Biology^|99.0
Code
importcsvwithopen('marks2.csv',newline='')asf:reader=csv.reader(f,delimiter='|',quotechar='^')forrowinreader:print(row)
Output
['Name', 'Average Of', 'Marks']['Anita', 'Accounting|Maths', '83.0']['Amar', 'Accounting|Maths', '95.0']['Akash', 'Physics|Chemistry|Biology', '92.0']['Ira', 'Physics|Chemistry|Biology', '99.0']
csv.writer(csvfile, delimiter=',', quotechar='"')
function returns a writer object which converts data in form of a list into delimited strings which is passed down to the file object for writing.
All non-string data in the list are automatically converted into string before it is written.
The methodswriterow(row)
orwriterows(rows)
can be used to write a row (list of strings) or list of rows to a CSV file.
Example #3
Write a program to create a CSV file containing subject-wise marks of students delimited by semi-colon (;
).
Solution #1: Using writerow()
importcsvh= ["Name","Subject","Marks"]marks= [["Anita","Maths",83.0], ["Amar","Maths",95.0], ["Akash","Science",92.0], ["Ira","Science",99.0]]withopen('marks.csv','w',newline='')asf:writer=csv.writer(f,delimiter=';')# Writing headerwriter.writerow(h)#Writing Dataforrowinmarks:writer.writerow(row)
Solution #2: Using writerows()
importcsvh= ["Name","Subject","Marks"]marks= [["Anita","Maths",83.0], ["Amar","Maths",95.0], ["Akash","Science",92.0], ["Ira","Science",99.0]]withopen('marks.csv','w',newline='')asf:writer=csv.writer(f,delimiter=';')# Writing headerwriter.writerow(h)#Writing Datawriter.writerows(marks)
Output - marks.csv
Name;Subject;MarksAnita;Maths;83.0Amar;Maths;95.0Akash;Science;92.0Ira;Science;99.0
The built-in functions and modules of the Python Standard Library add tremendous value by performing generic operations and can be utilised by developers for any task.
But, at times a programmer needs to perform a set of operations for a specific task multiple times. In such cases instead of rewriting the statements again and again, the block of code can be wrapped into a function which can be called anytime in the scope of the program.
User-defined functions make the program more:
- Organized, as a block of code is now replaced by a single function call which performs a specific task.
- Manageable, due to reduction of code length.
- Reusable, as the user defined function can be used anywhere in the current program or it can also be imported in another program.
Let us take a deep dive into a user defined function (provided below) and understand its various components:
defadder(f,s,t=None):""" Returns the sum of f, s and t. If t is not provided, return the sum of f and s. """s=f+sift:s+=treturns
defadder(f,s,t=None):
The function header begins with the keyworddef
.
The function name follows thedef
keyword. As it is an identifier, the same nomenclature is followed while naming it.adder
is the name of the function in the above example.
The function name is followed by a pair of parenthesis( )
.
In case any parameters are required by the function, they are enclosed inside the parentheses.f, s, t = None
are the three parameters of the function.
Finally, the function header ends with a colon.
The body of the function consists of one or more Python statements which have the same amount of indentation (4 spaces) from the function header.
It is a good practice to include the documentation string at the beginning of the function body that briefly explains how it works. This docstring can either be a single-line or a multiline string literal. In the above example the docstring is:
""" Returns the sum of f, s and t. If t is not provided, return the sum of f and s. """
This docstring is followed by a series of statements which represent the set of instructions to be executed. The set of instructions in the above example are
s=f+sift:s+=t
The code block finally ends with areturn
statement which returns one or more values.
returns
In the above example the value ofs
is returned which is the sum.
A missingreturn
statement or areturn
statement returning no value implies that the function returnsNone
. These functions are known as void functions which display the result of statements viaprint()
function.
defadder(f,s,t=None):""" Returns the sum of f, s and t. If t is not provided, return the sum of f and s. """s=f+sift:s+=treturns
After defining the above function, let us now invoke or call the function:
fst=20snd=10trd=10sm1=adder(fst,snd,trd)sm2=adder(fst,snd)
f
,s
andt
are known as positional parameters as they have a defined position in the function definition.
Also there is a provision to assign a default value to a parameter using=
.
In the above examplet
has a default value ofNone
.
Arguments are the values passed to a function (or method) when it is called. In the above examplefst
,snd
andtrd
are the arguments. Sincet
has a default value, the function can be invoked with or without thetrd
argument as shown forsm2
.
Argument values are assigned to the corresponding function parameters that are available as local variables inside the function.
Thus, value offst
is assigned tof
andsnd
is assigned tos
. In case there is no third argument,t
has the default valueNone
.
The values ofsm1
andsm2
after executing the script are:
>>>sm140>>>sm230
In Python, every variable has a well defined scope, i.e., it is accessible by a part of the program and it cannot be accessed from outside its scope.
Based on their scope, there are two types of variables:
Local variables are defined inside a function or a code block and can only be accessed where they are defined.
They exist only during the execution of the function or the code block.
Code
deftriple(a):result=3*a# variable result# is accessible locallyprint("local:",result)# Outputs value of result# inside the functiontriple(3)# Throws an error as result is# not accessible outside the scope# of function triple()print("outside:",result)
Output
local: 9Traceback (most recent call last): File "<stdin>", line 1, in <module>NameError: name 'result' is not defined
Variables defined outside any function or code block are known as global variables.
They are often used to specify mathematical constants, file path or other such values and can be accessed anywhere in the source code (by functions or code blocks).
Below example demonstrates how the global variablen
can be accessed by all the functions.
Code
n=10defuno():print(n)deftres():print(n*3)uno()tres()
Output
1030
In case a variable is defined inside a function with the same name as that of a global variable, then the variable is considered as a local variable and all references made to the variable point to this local variable.
Code
n=10# local n inside the# functiondefuno():n=5print(n)# global n inside the# functiondeftres():print(n*3)uno()tres()
Output
530
Any changes made to a global variable inside a code block or a function can modify it for that session. This can have an impact on all the code blocks/functions that access and use it.
To modify the value of a global variable one can use theglobal
keyword as shown in the example below.
Code
n=10# function modifies the# value of global ndefuno():globalnn=5print(n)deftres():print(n*3)uno()tres()
Output
515
When mutable objects (list
,dict
) are provided as an argument to a function, any modification made to the corresponding parameters in the body of the function leads to the modification of the original object.
Hence, care should be taken while passing mutable objects.
Code
defrepeat(l):foriinrange(len(l)):l[i]=l[i]*2l= [1,2,3]print(l)repeat(l)# l is modifiedprint(l)
Output
[1, 2, 3][2, 4, 6]
Code
defrepeat(d):foriind.keys():d[i]=d[i]*2d= {'a':1,'b':2,'c':3}print(d)repeat(d)# d is modifiedprint(d)
Output
{'a': 1, 'b': 2, 'c': 3}{'a': 2, 'b': 4, 'c': 6}
Once a function is defined in the Python interpreter, it can be called any number of times. But, these function definitions are lost upon exiting the interpreter.
To solve this problem we can create a python script with the function definitions at the beginning of the file, followed by the rest of the code which includes statements invoking the defined functions.
But, this process is tedious and not manageable as what makes user-defined functions powerful is that the programmer can -Write once, and use many times.
Instead of repeating the function definition again and again for each new program, one can put all the function definitions in a file from which the required function can be imported and invoked either in script mode or interactive mode.
This file (.py
extension) is known as amodule
and it is the most basic form of reusable code accessible by other programs.
Let us create a new filebasics.py
containing the following functions:
defadder(f,s,t=None):""" Returns the sum of f, s and t. If t is not provided, return the sum of f and s. """s=f+sift:s+=treturnsdeftripler(a):""" Multiplies a by 3 and returns it """result=3*areturnresult
After saving thebasics.py
file, reopen IDLE and create a new filetest.py
in the same directory as basics.py.
The name of the file is the module name which is also available as the value of the global variable__name__
in the module.
Import the functions of the basics module intest.py
by executing the following statement.
importbasics
The aboveimport
statement loads all the functions available in thebasics
module. To access any function in the module, simply type the module name followed by a period (.
), followed by the function name.
Code
importbasicssa=basics.adder(20,10,10)st=basics.tripler(20)print(sa)print(st)
Output
4060
Theas
keyword can also be used to create an alias which makes it easier and more manageable to use in the program.
Code
importbasicsasbasa=ba.adder(20,10,10)st=ba.tripler(20)print(sa)print(st)
Output
4060
Instead of loading all the functions in a module, thefrom
statement can be used to access only specified functions.
Code
frombasicsimportaddersa=adder(20,10,10)print(sa)
Output
40
To import all functions and global variables in the module the*
wildcard character can be used.
Code
frombasicsimport*sa=adder(20,10,10)st=tripler(20)
Output
4060
The above method of using*
to import the contents of a module is not recommended as it can clutter the namespace and cause issues if there are conflicts between the identifiers defined by the programmer and those defined in the module/package.
Apart from containing definitions, a module can also contain block of code which is executed whenever the stand-alone script is run in script mode. The block has to be enclosed in anif
statement as shown below:
if__name__=='__main__': ...codetobeexecuted...
Using the above pattern, a file can be imported or executed directly.
Let us undertake an example module (save it asmulti.py
) which multiplies two numbers:
defmultiply(a,b):returna*bf=int(input("Enter a: "))s=int(input("Enter b: "))print(multiply(f,s))
Now when we try to load all functions from the module it automatically executes the input assignment statements and prints the output.
>>>fromtestimport*Entera:4Enterb:520>>>
Let us modify the code:
defmultiply(a,b):returna*bif__name__=='__main__':f=int(input("Enter a: "))s=int(input("Enter b: "))print(multiply(f,s))
Now, the block will execute only if the script is executed directly and not when the file is imported as a module.
>>>fromtestimport*>>>multiply(4,5)20
A collection of modules which can work together to provide a common functionality is known as apackage.
These modules are present in a folder along with the__init__.py
file which tells Python that this folder is a package.
A package can also contain subfolders (sub-packages), each containing their respective__init__.py
files.
Let us take the example of a package calledrestaurant
which consists of various modules to perform activities such as order booking, reservation, staff attendance, etc.
Here is a possible structure for the package:
restaurant/ __init__.py orders.py reservation.py employee.py inventory.py
A package is simply the directory containing sub-packages and modules, but when this package or a collection of packages are made available for others to use (eg. via PyPI) it is known as alibrary.
For example,restaurant
can be called a library if it provides reusable codes to manage a restaurant and is built using multiple packages which handle the various aspects of a restaurant like human resource management, inventory management, order fulfillment and billing, etc.
One should note that the above definition is not strict and often the terms package and library are used interchangeably.
About
📖🐍 Free & Open Source book to master Python 3. Also available: PDF & Web Interface.
Topics
Resources
License
Code of conduct
Uh oh!
There was an error while loading.Please reload this page.