10.2.fileinput — Iterate over lines from multiple input streams¶
Source code:Lib/fileinput.py
This module implements a helper class and functions to quickly write aloop over standard input or a list of files. If you just want to read orwrite one file seeopen().
The typical use is:
importfileinputforlineinfileinput.input():process(line)
This iterates over the lines of all files listed insys.argv[1:], defaultingtosys.stdin if the list is empty. If a filename is'-', it is alsoreplaced bysys.stdin. To specify an alternative list of filenames, pass itas the first argument toinput(). A single file name is also allowed.
All files are opened in text mode by default, but you can override this byspecifying themode parameter in the call toinput() orFileInput(). If an I/O error occurs during opening or reading a file,IOError is raised.
Ifsys.stdin is used more than once, the second and further use will returnno lines, except perhaps for interactive use, or if it has been explicitly reset(e.g. usingsys.stdin.seek(0)).
Empty files are opened and immediately closed; the only time their presence inthe list of filenames is noticeable at all is when the last file opened isempty.
Lines are returned with any newlines intact, which means that the last line ina file may not have one.
You can control how files are opened by providing an opening hook via theopenhook parameter tofileinput.input() orFileInput(). Thehook must be a function that takes two arguments,filename andmode, andreturns an accordingly opened file-like object. Two useful hooks are alreadyprovided by this module.
The following function is the primary interface of this module:
fileinput.input([files[,inplace[,backup[,bufsize[,mode[,openhook]]]]]])¶Create an instance of the
FileInputclass. The instance will be usedas global state for the functions of this module, and is also returned to useduring iteration. The parameters to this function will be passed along to theconstructor of theFileInputclass.Changed in version 2.5:Added themode andopenhook parameters.
Changed in version 2.7.12:Thebufsize parameter is no longer used.
The following functions use the global state created byfileinput.input();if there is no active state,RuntimeError is raised.
fileinput.filename()¶Return the name of the file currently being read. Before the first line hasbeen read, returns
None.
fileinput.fileno()¶Return the integer “file descriptor” for the current file. When no file isopened (before the first line and between files), returns
-1.New in version 2.5.
fileinput.lineno()¶Return the cumulative line number of the line that has just been read. Beforethe first line has been read, returns
0. After the last line of the lastfile has been read, returns the line number of that line.
fileinput.filelineno()¶Return the line number in the current file. Before the first line has beenread, returns
0. After the last line of the last file has been read,returns the line number of that line within the file.
fileinput.isfirstline()¶Returns true if the line just read is the first line of its file, otherwisereturns false.
fileinput.isstdin()¶Returns true if the last line was read from
sys.stdin, otherwise returnsfalse.
fileinput.nextfile()¶Close the current file so that the next iteration will read the first line fromthe next file (if any); lines not read from the file will not count towards thecumulative line count. The filename is not changed until after the first lineof the next file has been read. Before the first line has been read, thisfunction has no effect; it cannot be used to skip the first file. After thelast line of the last file has been read, this function has no effect.
fileinput.close()¶Close the sequence.
The class which implements the sequence behavior provided by the module isavailable for subclassing as well:
- class
fileinput.FileInput([files[,inplace[,backup[,bufsize[,mode[,openhook]]]]]])¶ Class
FileInputis the implementation; its methodsfilename(),fileno(),lineno(),filelineno(),isfirstline(),isstdin(),nextfile()andclose()correspond to thefunctions of the same name in the module. In addition it has areadline()method which returns the next input line,and a__getitem__()method which implements the sequence behavior.The sequence must be accessed in strictly sequential order; random accessandreadline()cannot be mixed.Withmode you can specify which file mode will be passed to
open(). Itmust be one of'r','rU','U'and'rb'.Theopenhook, when given, must be a function that takes two arguments,filename andmode, and returns an accordingly opened file-like object. Youcannot useinplace andopenhook together.
Changed in version 2.5:Added themode andopenhook parameters.
Changed in version 2.7.12:Thebufsize parameter is no longer used.
Optional in-place filtering: if the keyword argumentinplace=1 is passedtofileinput.input() or to theFileInput constructor, the file ismoved to a backup file and standard output is directed to the input file (if afile of the same name as the backup file already exists, it will be replacedsilently). This makes it possible to write a filter that rewrites its inputfile in place. If thebackup parameter is given (typically asbackup='.<someextension>'), it specifies the extension for the backup file,and the backup file remains around; by default, the extension is'.bak' andit is deleted when the output file is closed. In-place filtering is disabledwhen standard input is read.
Note
The current implementation does not work for MS-DOS 8+3 filesystems.
The two following opening hooks are provided by this module:
fileinput.hook_compressed(filename,mode)¶Transparently opens files compressed with gzip and bzip2 (recognized by theextensions
'.gz'and'.bz2') using thegzipandbz2modules. If the filename extension is not'.gz'or'.bz2', the file isopened normally (ie, usingopen()without any decompression).Usage example:
fi=fileinput.FileInput(openhook=fileinput.hook_compressed)New in version 2.5.
fileinput.hook_encoded(encoding)¶Returns a hook which opens each file with
io.open(), using the givenencoding to read the file.Usage example:
fi=fileinput.FileInput(openhook=fileinput.hook_encoded("iso-8859-1"))Note
With this hook,
FileInputmight return Unicode strings depending on thespecifiedencoding.New in version 2.5.
