NAME
PSGI - Perl Web Server Gateway Interface Specification
ABSTRACT
This document specifies a standard interface between web servers and Perl web applications or frameworks. This interface is designed to promote web application portability and reduce the duplication of effort by web application framework developers.
Please keep in mind that PSGI is not Yet Another web application framework. PSGI is a specification to decouple web server environments from web application framework code. Nor is PSGI a web application API. Web application developers (end users) will not run their web applications directly using the PSGI interface, but instead are encouraged to use frameworks that support PSGI.
TERMINOLOGY
- Web Servers
Web servers accept HTTP requests issued by web clients, dispatching those requests to web applications if configured to do so, and return HTTP responses to the request-initiating clients.
- PSGI Server
APSGI Server is a Perl program providing an environment for aPSGI application to run in.
PSGI specifying an interface for web applications and the main purpose of web applications being to be served to the Internet, aPSGI Server will most likely be either: part of a web server (like Apache mod_perl), connected to a web server (with FastCGI, SCGI), invoked by a web server (as in plain old CGI), or be a standalone web server itself, written entirely or partly in Perl.
There is, however, no requirement for aPSGI Server to actually be a web server or part of one, asPSGI only defines an interface between the server and the application, not between the server and the world.
APSGI Server is often also calledPSGI Application Container because it is similar to aJava Servlet container, which is Java process providing an environment forJava Servlets.
- Applications
Web applications accept HTTP requests and return HTTP responses.
PSGI applications are web applications conforming to the PSGI interface, prescribing they take the form of a code reference with defined input and output.
For simplicity,PSGI Applications will also be referred to asApplications for the remainder of this document.
- Middleware
Middleware is a PSGI application (a code reference)and aServer.Middleware looks like anapplication when called from aserver, and it in turn can call otherapplications. It can be thought of aplugin to extend a PSGI application.
- Framework developers
Framework developers are the authors of web application frameworks. They write adapters (or engines) which accept PSGI input, run a web application, and return a PSGI response to theserver.
- Web application developers
Web application developers are developers who write code on top of a web application framework. These developers should never have to deal with PSGI directly.
SPECIFICATION
Application
A PSGI application is a Perl code reference. It takes exactly one argument, the environment, and returns an array reference containing exactly three values.
my $app = sub { my $env = shift; return [ '200', [ 'Content-Type' => 'text/plain' ], [ "Hello World" ], # or IO::Handle-like object ];};
The Environment
The environment MUST be a hash reference that includes CGI-like headers, as detailed below. The application is free to modify the environment. The environment MUST include these keys (adopted fromPEP 333,Rack andJSGI) except when they would normally be empty.
When an environment key is described as a boolean, its value MUST conform to Perl's notion of boolean-ness. This means that an empty string or an explicit0
are both valid false values. If a boolean key is not present, an application MAY treat this as a false value.
The values for all CGI keys (named without a period) MUST be a scalar string.
See below for details.
REQUEST_METHOD
: The HTTP request method, such as "GET" or "POST". ThisMUST NOT be an empty string, and so is always required.SCRIPT_NAME
: The initial portion of the request URL'spath, corresponding to the application. This tells the application its virtual "location". This may be an empty string if the application corresponds to the server's root URI.If this key is not empty, it MUST start with a forward slash (
/
).PATH_INFO
: The remainder of the request URL'spath, designating the virtual "location" of the request's target within the application. This may be an empty string if the request URL targets the application root and does not have a trailing slash. This value should be URI decoded by servers in order to be compatible withRFC 3875.If this key is not empty, it MUST start with a forward slash (
/
).REQUEST_URI
: The undecoded, raw request URL line. It is the raw URI path and query part that appears in the HTTPGET /... HTTP/1.x
line and doesn't contain URI scheme and host names.Unlike
PATH_INFO
, this valueSHOULD NOT be decoded by servers. It is an application's responsibility to properly decode paths in order to map URLs to application handlers if they choose to use this key instead ofPATH_INFO
.QUERY_STRING
: The portion of the request URL that follows the?
, if any. This key MAY be empty, butMUST always be present, even if empty.SERVER_NAME
,SERVER_PORT
: When combined withSCRIPT_NAME
andPATH_INFO
, these keys can be used to complete the URL. Note, however, thatHTTP_HOST
, if present, should be used in preference toSERVER_NAME
for reconstructing the request URL.SERVER_NAME
andSERVER_PORT
MUST NOT be empty strings, and are always required.SERVER_PROTOCOL
: The version of the protocol the client used to send the request. Typically this will be something like "HTTP/1.0" or "HTTP/1.1" and may be used by the application to determine how to treat any HTTP request headers.CONTENT_LENGTH
: The length of the content in bytes, as an integer. The presence or absence of this key should correspond to the presence or absence of HTTP Content-Length header in the request.CONTENT_TYPE
: The request's MIME type, as specified by the client. The presence or absence of this key should correspond to the presence or absence of HTTP Content-Type header in the request.HTTP_*
Keys: These keys correspond to the client-supplied HTTP request headers. The presence or absence of these keys should correspond to the presence or absence of the appropriate HTTP header in the request.The key is obtained converting the HTTP header field name to upper case, replacing all occurrences of hyphens
-
with underscores_
and prependingHTTP_
, as inRFC 3875.If there are multiple header lines sent with the same key, the server should treat them as if they were sent in one line and combine them with
,
, as inRFC 2616.
A server should attempt to provide as many other CGI variables as are applicable. Note, however, that an application that uses any CGI variables other than the ones listed above are necessarily non-portable to web servers that do not support the relevant extensions.
In addition to the keys above, the PSGI environment MUST also include these PSGI-specific keys:
psgi.version
: An array reference [1,1] representing this version of PSGI. The first number is the major version and the second it the minor version.psgi.url_scheme
: A stringhttp
orhttps
, depending on the request URL.psgi.input
: the input stream. See below for details.psgi.errors
: the error stream. See below for details.psgi.multithread
: This is a boolean value, which MUST be true if the application may be simultaneously invoked by another thread in the same process, false otherwise.psgi.multiprocess
: This is a boolean value, which MUST be true if an equivalent application object may be simultaneously invoked by another process, false otherwise.psgi.run_once
: A boolean which is true if the server expects (but does not guarantee!) that the application will only be invoked this one time during the life of its containing process. Normally, this will only be true for a server based on CGI (or something similar).psgi.nonblocking
: A boolean which is true if the server is calling the application in an non-blocking event loop.psgi.streaming
: A boolean which is true if the server supports callback style delayed response and streaming writer object.
The server or the application can store its own data in the environment as well. These keys MUST contain at least one dot, and SHOULD be prefixed uniquely.
Thepsgi.
prefix is reserved for use with the PSGI core specification, andpsgix.
prefix is reserved for officially blessed extensions. These prefixesMUST NOT be used by other servers or application. Seepsgi-extensions for the list of officially approved extensions.
The environmentMUST NOT contain keys namedHTTP_CONTENT_TYPE
orHTTP_CONTENT_LENGTH
.
One ofSCRIPT_NAME
orPATH_INFO
MUST be set. WhenREQUEST_URI
is/
,PATH_INFO
should be/
andSCRIPT_NAME
should be empty.SCRIPT_NAME
MUST NOT be/
, but MAY be empty.
The Input Stream
The input stream inpsgi.input
is anIO::Handle-like object which streams the raw HTTP POST or PUT data. If it is a file handle then it MUST be opened in binary mode. The input streamMUST respond toread
and MAY implementseek
.
Perl's built-in filehandles orIO::Handle based objects should work as-is in a PSGI server. Application developersSHOULD NOT inspect the type or class of the stream. Instead, they SHOULD simply callread
on the object.
Application developersSHOULD NOT use Perl's built-inread
or iterator (<$fh>
) to read from the input stream. Instead, application developers should callread
as a method ($fh->read
) to allow for duck typing.
Framework developers, if they know the input stream will be used with the built-in read() in any upstream code they can't touch, SHOULD use PerlIO or a tied handle to work around with this problem.
The input stream object is expected to provide aread
method:
- read
$input->read($buf, $len [, $offset ]);
Returns the number of characters actually read, 0 at end of file, or undef if there was an error.
It may also implement an optionalseek
method. Ifpsgix.input.buffered
environment is true, it MUST implement theseek
method.
- seek
$input->seek($pos, $whence);
Returns 1 on success, 0 otherwise.
See theIO::Handle documentation for more details on exactly how these methods should work.
The Error Stream
The error stream inpsgi.errors
is anIO::Handle-like object to print errors. The error stream must implement aprint
method.
As with the input stream, Perl's built-in filehandles orIO::Handle based objects should work as-is in a PSGI server. Application developersSHOULD NOT inspect the type or class of the stream. Instead, they SHOULD simply callprint
on the object.
$errors->print($error);
Returns true if successful.
The Response
Applications MUST return a response as either a three element array reference, or a code reference for a delayed/streaming response.
The response array reference consists of the following elements:
Status
An HTTP status code. This MUST be an integer greater than or equal to 100, and SHOULD be an HTTP status code as documented inRFC 2616.
Headers
The headers MUST be an array reference (not a hash reference) of key/value pairs. This means it MUST contain an even number of elements.
The headerMUST NOT contain a key namedStatus
, nor any keys with:
or newlines in their name. ItMUST NOT contain any keys that end in-
or_
.
All keys MUST consist only of letters, digits,_
or-
. All keys MUST start with a letter. The value of the headerMUST be a scalar string and defined. The value stringMUST NOT contain characters below octal 037 i.e. chr(31).
If the same key name appears multiple times in an array ref, those header lines MUST be sent to the client separately (e.g. multipleSet-Cookie
lines).
Content-Type
There MUST be aContent-Type
except when theStatus
is 1xx, 204 or 304, in which case thereMUST NOT be a content type.
Content-Length
ThereMUST NOT be aContent-Length
header when theStatus
is 1xx, 204 or 304.
If the Status is not 1xx, 204 or 304 and there is noContent-Length
header, a PSGI server MAY calculate the content length by looking at the Body. This value can then be appended to the list of headers returned by the application.
Body
The response body MUST be returned from the application as either an array reference or a handle containing the response body as byte strings. The body MUST be encoded into appropriate encodings andMUST NOT contain wide characters (> 255).
If the body is an array reference, it is expected to contain an array of lines which make up the body.
my $body = [ "Hello\n", "World\n" ];
Note that the elements in an array reference areNOT REQUIRED to end in a newline. A server SHOULD write each elements as-is to the client, andSHOULD NOT care if the line ends with newline or not.
An array reference with a single value is valid. So
[ $html ]
is a valid response body.The body can instead be a handle, either a Perl built-in filehandle or anIO::Handle-like object.
open my $body, "</path/to/file";open my $body, "<:via(SomePerlIO)", ...;my $body = IO::File->new("/path/to/file");# mock class that implements getline() and close()my $body = SomeClass->new();
ServersSHOULD NOT check the type or class of the body. Instead, they should simply call
getline
to iterate over the body, and callclose
when done.Servers MAY check if the body is a real filehandle using
fileno
andScalar::Util::reftype
. If the body is real filehandle, the server MAY optimize using techniques likesendfile(2).The body object MAY also respond to a
path
method. This method is expected to return the path to a file accessible by the server. This allows the server to use this information instead of a file descriptor number to serve the file.Servers SHOULD set the
$/
special variable to the buffer size when reading content from$body
using thegetline
method. This is done by setting$/
with a reference to an integer ($/ = \8192
).If the body filehandle is a Perl built-in filehandleIO::Handle object, they will respect this value. Similarly, an object which provides the same API MAY also respect this special variable, but are not required to do so.
Delayed Response and Streaming Body
The PSGI interface allows applications and servers to provide a callback-style response instead of the three-element array reference. This allows for a delayed response and a streaming body (server push).
This interface SHOULD be implemented by PSGI servers, andpsgi.streaming
environment MUST be set to true in such servers.
To enable a delayed response, the application SHOULD return a callback as its response. An application MAY check if thepsgi.streaming
environment is true and falls back to the direct response if it isn't.
This callback will be called withanother subroutine reference (referred to as theresponder from now on) as its only argument. Theresponder should in turn be called with the standard three element array reference response. This is best illustrated with an example:
my $app = sub { my $env = shift; # Delays response until it fetches content from the network return sub { my $responder = shift; fetch_content_from_server(sub { my $content = shift; $responder->([ 200, $headers, [ $content ] ]); }); };};
An application MAY omit the third element (the body) when calling theresponder. If the body is omitted, theresponder MUST returnyet another object which implementswrite
andclose
methods. Again, an example illustrates this best.
my $app = sub { my $env = shift; # immediately starts the response and stream the content return sub { my $responder = shift; my $writer = $responder->( [ 200, [ 'Content-Type', 'application/json' ]]); wait_for_events(sub { my $new_event = shift; if ($new_event) { $writer->write($new_event->as_json . "\n"); } else { $writer->close; } }); };};
This delayed response and streaming API is useful if you want to implement a non-blocking I/O based server streaming or long-poll Comet push technology, but could also be used to implement unbuffered writes in a blocking server.
Middleware
Amiddleware component takes another PSGI application and runs it. From the perspective of a server, a middleware component is a PSGI application. From the perspective of the application being run by the middleware component, the middleware is the server. Generally, this will be done in order to implement some sort of pre-processing on the PSGI environment hash or post-processing on the response.
Here's a simple example that appends a special HTTP headerX-PSGI-Used to any PSGI application.
# $app is a simple PSGI applicationmy $app = sub { my $env = shift; return [ '200', [ 'Content-Type' => 'text/plain' ], [ "Hello World" ] ];};# $xheader is a piece of middleware that wraps $appmy $xheader = sub { my $env = shift; my $res = $app->($env); push @{$res->[1]}, 'X-PSGI-Used' => 1; return $res;};
Middleware MUST behave exactly like a PSGI application from the perspective of a server. Middleware MAY decide not to support the streaming interface discussed earlier, but SHOULD pass through the response types that it doesn't understand.
CHANGELOGS
1.1: 2010.02.xx
Added optional PSGI keys as extensions:
psgix.logger
andpsgix.session
.psgi.streaming
SHOULD be implemented by PSGI servers, rather thanMAY.PSGI keys
psgi.run_once
,psgi.nonblocking
andpsgi.streaming
MUST be set by PSGI servers.Removed
poll_cb
from writer methods.
ACKNOWLEDGEMENTS
Some parts of this specification are adopted from the following specifications.
PEP333 Python Web Server Gateway Interfacehttp://www.python.org/dev/peps/pep-0333
JSGI Specificationhttp://jackjs.org/jsgi-spec.html
I'd like to thank authors of these great documents.
AUTHOR
Tatsuhiko Miyagawa <miyagawa@bulknews.net>
CONTRIBUTORS
The following people have contributed to the PSGI specification and Plack implementation by commiting their code, sending patches, reporting bugs, asking questions, suggesting useful advices, nitpicking, chatting on IRC or commenting on my blog (in no particular order):
Tokuhiro MatsunoKazuhiro OsawaYuval KogmanKazuho OkuAlexis SukriehTakatoshi KitanoStevan LittleDaisuke MurasemalaPedro MeloJesse LuehrsJohn BeppuShawn M MooreMark StosbergMatt S TroutJesse VincentChia-liang KaoDave RolskyHans Dieter PearceyRandy J RayBenjamin TrottMax MaischeinSlaven RezićMarcel GrünauerMasayoshi SekimuraBrock WilcoxPiers CawleyDaisuke MakiKang-min LiuYasuhiro MatsumotoAsh BerlinArtur BergmanSimon CozensScott McWhirterJiro NishiguchiMasahiro ChibaPatrick DonelanPaul DriverFlorian Ragwitz
COPYRIGHT AND LICENSE
Copyright Tatsuhiko Miyagawa, 2009-2011.
This document is licensed under the Creative Commons license by-sa.
Module Install Instructions
To install PSGI, copy and paste the appropriate command in to your terminal.
cpanm PSGI
perl -MCPAN -e shellinstall PSGI
For more information on module installation, please visitthe detailed CPAN module installation guide.