Understanding HTTP Headers and the PHP header() Function
HTTP Headers and the PHP header() Function
A tutorial by NicholasSolutions
Introduction
Many beginning and intermediate PHP programmers seem to think the header()
function
is some kind of strange voodoo. They work from examples and code snippets and are able to get things done with it,
but they don't know quite how it works. That was certainly the
way I regarded header()
the first time I saw it.
In reality, it's quite simple. In this tutorial, I'll explain a little about how HTTP headers work, how they relate to PHP,
and a little about their meta tag equivalents.
Hopefully by the time you're done reading this, you'll feel more confident about how to
use the header()
function, and even have some new ideas about how it can help you. We'll also cover some other important
topics related to HTTP headers and PHP. Before we talk about any programming
at all, though, we need to quickly (and incompletely) go over how HTTP (HyperText Transfer Protocol) works in general.
HTTP Overview
Headers: words in a conversation
HTTP is the protocol (the set of 'rules') for transferring
data (e.g. HTML in web pages, pictures, files) between web servers and
client browsers, and usually takes place on 80.
This is where the 'http://
' in website URLs comes from.
The first time most people make a web page, they write the HTML on their computer, view it locally in a browser,
upload it to their server, and view it on the web. It might seem like viewing a page locally and viewing it on the server
is exactly the same, and that the only data going back and forth between the server and the
browser is the HTML and any images included in the page. But there is actually a lot of other information that you do not
see when you view a file on the web — the headers.
Headers can be separated into two broad types: Request headers that your browser sends to the server when you request a file, and
Response headers that the server sends to the browser when it serves the file. Think of these headers as the words in a
conversation taking place between the browser and the server. I like to imagine the server as a librarian, and the browser as a
researcher asking for a library resource. The browser walks up to the server at the main desk (port 80) and says
something like, "Hi, my name
is Mozilla, and I'm looking for the resource with the call number 'http://www.zjjv.com/'. Can you get it for me?" The server listens, and responds "Yes, I found it, let me send it to you. The data in the item is HTML text, and it says '<html>
...'" The browser reads through, and comes to an image tag, and asks the server for item with the location in the src
attribute. The server looks, finds the file and says "This file is a PNG image, and the data is...." You get the idea.
Another conversation might go like this:
Browser: Hi, I'm Mozilla, can I have the file at 'http://www.zjjv.com//moved.html'?
Server: That file is no longer there, it is at 'http://www.zjjv.com//newloc.html'.
Browser: Hi, I'm Mozilla, can I have the file at 'http://www.zjjv.com//newloc.html'?
Server: I found the file. Look at it for 10 seconds and then ask me again. It's HTML text and it reads....
...10 seconds...
Browser:> Hi, I'm Mozilla, can I have the file at 'http://www.zjjv.com//newloc.html'?
Server: I found the file. Look at it for 10 seconds and then ask me again. It's HTML text and it reads....
...10 seconds...
Browser: Hi, I'm Mozilla, can I have the file at 'http://www.zjjv.com//newloc.html'?
Server: I found the file. Look at it for 10 seconds and then ask me again. It's HTML text and it reads....
....and so on, until the browser is redirected by the user....
As you can see, there is a lot going on that headers control. Using the header()
function, you can make the
server send any headers that you need want, which allows you to do some really cool things beyond just sending plain old HTML.
Seeing the whole conversation
Before moving ahead, let's get a better idea of how HTTP headers work by viewing a webpage without a browser, so we can
see the converation in is entirety. Start by opening a command prompt (in windows, go to Start->Run, type cmd
, and click "OK"...if you're using linux you probably already know). At the prompt, type:
telnet expertsrt.com 80
and press Enter. This will connect you to expertsrt.com on port 80. Next, copy and paste just the text below:
GET / HTTP/1.1
Host: expertsrt.com
Don't worry if when
you type or paste the text, it does not show up in your command window and all you see is the cursor -- it is indeed being sent to the server. The first line says you are using the GET
request method to get the resource /
(i.e. the file in the base directory of the host), and that you are using HTTP version 1.1. The second tells the server which host
you want to connect to. When you finish typing 'expertsrt.com', hit Enter twice (and twice only). You should almost immediately get a response that looks like:
HTTP/1.1 301 Moved Permanently
Date: Wed, 08 Feb 2006 07:44:07 GMT
Server: Apache/2.0.54 (Debian GNU/Linux) mod_auth_pgsql/2.0.2b1 mod_ssl/2.0.54 OpenSSL/0.9.7e
Location: http://http://www.zjjv.com///
Content-Length: 233
Content-Type: text/html; charset=iso-8859-1
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
<html><head>
<title>301 Moved Permanently</title>
</head><body>
<h1>Moved Permanently</h1>
<p>The document has moved <a href="http://http://www.zjjv.com///">here</a>.</p>
</body></html>
Whoops! Looks like we requested a resource that wasn't there; it's been permanently moved to the new Location
http://http://www.zjjv.com//
. If you were using a browser, you'd only see the HTML — everything before the first blank
line is the headers. In fact, modern browsers are even smarter than that — when they see the Location
header on the
third line, they automatically go there so you don't have to type in a new URL. Let's go to the new URL. By this point, you
probably got disconnected while you were reading this. If so, just press your up arrow on the keyboard to get your telnet
command back, and press enter to reconnect. If you're still connected, you can just go ahead and type the following:
GET / HTTP/1.1
Host: http://www.zjjv.com/
and press Enter twice after the second line. You'll get another similar response telling you that the page is actually at
http://http://www.zjjv.com///index.php
. The server is particular, isn't it? ;-) Repeat the above, but this time type
GET /index.php HTTP/1.1
Host: http://www.zjjv.com/
Notice that the name of the file we want is in the first line. This time we get flooded with text: the HTML from ERT's homepage.
The headers look like
HTTP/1.1 200 OK
Date: Wed, 08 Feb 2006 08:20:07 GMT
Server: Apache/2.0.54 (Debian GNU/Linux) mod_auth_pgsql/2.0.2b1 mod_ssl/2.0.54 OpenSSL/0.9.7e
X-Powered-By: PHP/4.4.0
Transfer-Encoding: chunked
Content-Type: text/html
Simple, no?. Let's move forward and see how this relates to your programming.
Don't worry if you didn't understand every single thing
that we just did. The important thing is to have a general feel for how the browser and server talk to each other,
and to realize that there is nothing magic about it. The take home points are:
The browser and the server talk to each other using headers
Headers are sent before the main content, and are separated from the main content by a a
double-CRLF/newline.
In the header section, there is one header per line. The name of the header comes first, followed by a colon and a space, followed by the content/value of the header:
Header-Name: header-value
Headers can contain many types of information and instructions that the server and browser use to help each other know
what to do next
Note: If you're the type who likes to really dig into the details, you can look at
RFC 2616 for the complete HTTP/1.1 specification in all its glory.
In particular, offers a complete
definition for each header field.
PHP header()
: The Basics
Notice the response headers X-Powered-By: PHP/4.4.0
and Content-Type: text/html
that were
returned when we
finally got to the homepage. PHP was designed from the beginning to output HTML (the 'H' in PHP stands for 'Hypertext'), and
the first time a script generates output (e.g. by using echo
), PHP automatically includes those headers for you. This is
very convenient, but also contributes to the confusion many PHP beginners have regarding headers — in more 'bare bones'
languages like Perl that were not originally designed for the web, sending output without including your own headers produces
the dreaded '500 Internal Server Error', so Perl web
programmers have no choice but to learn about headers immediately.
The header()
function sends HTTP response headers; nothing
more, nothing less.
Using this function, you can make your scripts send
headers of your choosing to the browser, and create some very useful and dynamic results. However, the first thing you need to know about the
header()
function is that you have to use it before PHP has sent any output (and therefore its default headers).
I doubt there is a PHP programmer in the world who has never seen an error that looks like
Warning: Cannot modify header information - headers already sent by.....
As we said above, the response headers are separated from the content by a blank line. This means you can only send them once, and if
your script has any output (even a blank line or space before your opening <?php
tag), PHP does so without asking
you. For example, consider the script below, which seems logical enough:
Welcometomywebsite!<br />
<?php
if($test){
echo"You'rein!";
}
else{
);
}
?>
What this script is trying to do is redirect the visitor using the Location
header if
$test
is not true. Do you see the problem? The 'Welcome...' text gets sent no matter what, so the headers are
automatically sent. By the time header()
is called, it's already too late: instead of getting redirected,
the user will just see an error message (or if you have error reporting off, nothing but the 'Welcome...' text).
There are basically two solutions to this. The first is to rewrite the code
<?php
if($test){
echo'Welcometomy website<br />You'rein!';
}
else{
);
}
?>
The second is output buffering, which can be somewhat more elegant and easy to use.
In our example above, rewriting the code wasn't much trouble, but imagine if there had been quite a bit of
HTML to move around — it could be pretty cumbersome, and it might make our code harder to follow. While our first example caused an error, the logic of the program was fine. Output buffering allows you
to hold on to ('buffer') output (even HTML outside of PHP code tags) and send it to the browser only when you explicitly say to do
so. This way you can program however you would like to, and explicitly send the output after you've specified any headers you need to. The two relevant functiosns are
ob_start()
, which turns output buffering on, and
ob_flush()
, which sends the content that has accumulated
in the buffer:
<?php
ob_start();//beginbufferingtheoutput
?>
Welcometomywebsite!
<?php
if(true){
echo"You'rein!";
}
else{
);
}
ob_flush();//outputthedatainthebuffer
?>
I encourage you to read more about all of the output buffering functions, which can be quite useful. You should flush the output
buffer as soon as possible, especially if you have quite a bit of content to send. Otherwise, your page will appear to load
slower, becuase the content will be sent only after it has been entirely assembled, rather than as it is available.
Note: The 2nd argument If you call header()
more than once for the same header field, the value for that header will
be the one included in the last call you made. For example,
<?php
header('Some-Header:Value-1');
);
?>
would produce the header Some-Header: Value-2
. You can cause both headers to be sent by using the second replace
argument
for header, which is true
by default. If you set this to false
, the second header value will not replace the first,
and both will be sent. So the code
<?php
header('Some-Header:Value-1');
//don'treplacethefirstvalue
?>
will produce the header Some-Header: Value-1, Value-2
. You will rarely need this, but is good to know.
Armed with a good understanding of how HTTP headers and PHP work together, let's look at some specific examples of using this
functionality.
PHP header()
: Some Examples
Note: The code snippets appearing below are just that: snippets from
complete working code. When you you include them in your own programs, remember to define all your variables,
assign default values, and adhere to other good programming practices.
Redirecting with the Location
header
We've seen this one a couple times above: it redirects the browser.
<?php
header('Location:http/http://www.zjjv.com//new_location.html');
?>
While you can somtimes get away with supplying a relative URL for the value, according to the HTTP specification, you should
really use an absolute URL.
One mistake that is easy to make with the Location
header is not calling
exit
directly afterwards (you may not always want to do
this, but usually you do). The reason this is a mistake is that the PHP code of the page continues to execute even though the user
has gone to a new location. In the best case, this uses system resources unnecessarily. In the worst case, you may perform tasks that
you never meant to. Consider the code below:
<?php
//Redirectuserswithaccesslevelbelow4
){
);
}
//Mailuserswithhigheraccesslevelthesecretcode
);
echo'Thesecretemailisonitsway!';
?>
Unauthorized users are indeed redirected, but in fact, they too will receive the email, because the script continues to run.
To avoid this, the part for authorized users could be wrapped in an else{}
statement, but it is cleaner and easier
to call exit
immediately after the header
command to end the execution of the script:
<?php
//Redirectuserswithaccesslevelbelow4
){
);
exit;//stopscriptexecution
}
//Mailuserswithhigheraccesslevelthesecretcode
);
echo'Thesecretemailisonitsway!';
?>
Redirecting with the Refresh
header
The Refresh
redirects users like the Location
header does, but you can add a delay before the user
is redirected. For example, the following code would redirect the user to a new page after displaying the current one for 10
seconds:
<?php
header('Refresh:10;url=http://http://www.zjjv.com///otherpage.php');
echo'Youwillberedirectedin10seconds';
?>
Another common application is to force a page to update repeatedly by 'redirecting' to the current page (see the second
'conversation' ). For example, here is a simple page that will 'count' down from 10, with a 3 second
pause between numbers:
<?php
])){
;
}
if(){
));
echo];
}
else{
echo'BLASTOFF!';
}
?>
Note: If the refresh time is set to 0, then the Refresh
header is
effectively the same as the Location
header.
Serving different types of files and generating dynamic content using the Content-Type
header
The Content-Type
header tells the browser what type of data the server is about to send. Using this header, you can
have your PHP scripts output anything from plain text files to images or zip files. The table below lists frequently-used
MIME types:
You can do several interesting things with this. For example, perhaps you want to send the user a pre-formatted text file
rather than HTML:
<?php
header('Content-Type:text/plain');
echo$plain_text_content;
?>
Or perhaps you'd like to prompt the user to download the file, rather than viewing it in the browser. With the help of the
Content-Disposition
header, it's easy to do, and you can even suggest a file name for the user to use:
<?php
header('Content-Type:application/octet-stream');
header('Content-Disposition:attachment;'
.'filename="plain_text_file.txt"');
echo$plain_text_content;
?>
Maybe you need to serve a file for download, but you'd like to obscure its true location and name, and only serve it to users
who are logged in:
<?php
if($b_is_logged_in){
);
header('Content-Disposition:attachment;'
);
);
}
else{
echo'Youarenotauthorizedtoviewthisfile';
}
?>
Perhaps you've dynamically generated an image using PHP's image functions and you want to display it to the user. You could create a file build_image.php
like this
text/htmlHTML (PHP default)
text/plainPlain Text
image/gifGIF Image
image/jpegJPEG Image
image/pngPNG Image
video/mpegMPEG Video
audio/wavWAV Audio
audio/mpegMP3 Audio
video/mov
video/quicktimeQuicktime Video
video/x-ms-wmvWindows WMV video
audio/x-ms-wmaWindows WMA audio
audio/x-realaudioRealPlayer Audio/Video (.rm)
audio/x-pn-realaudioRealPlayer Audio/Video (.ram)
video/x-msvideo
video/aviAVI Video
application/pdfPDF Document
application/mswordMS Word .doc file
application/zipZip File
application/octet-streamMisc. data. Use to force download or open with application.*
x-foo/x-barMisc. data. Use to force download ot open with application.*
<?php
//buildtheimageabove
);
);
?>
Note: Beware of magic_quotes!
PHP's automatic escaping of special characters with a backslash may seem like a good idea at first, but most good programmers
generally agree that it (a) encourages sloppy programming that does not validate input and (b) causes
annoyances in well-written code that would not occur if "magic quoting" were turned off. One such annoyance is
the corruption of binary data. In the example above, if
is on, the data that readfile()
outputs may have backslashes added to it, thus
corrupting the file that is sent to the user. Ideally, you should turn magic_quotes_runtime
off in your
php.ini
file to avoid this, but if you do not have access to the configuration file, you can also use the
set_magic_quotes_runtime() function
(pass is the 0
(zero) integer) to turn the setting off.
Happily, the of a recent
PHP Developer meeting show that they have decided to abandon magic quotes in future versions (6+) of PHP. Until
everyone upgrades, however, keeping the problems this feature can cause in mind can save you quite a bit of
trouble and frustration.
You might pass the parameters necessary to generate the image via the URL so you can access them in the $_GET
array.
Then in another page, you might include this image using an img
tag:
<imgsrc="build_image.php">
The possibilities are more or less endless. The more PHP programming you do, the more you will find that the Content-Type
header truly is your friend.
Note: The way that browser are supposed to handle content of various MIME types, and the way they actually do
may not always be consistent (especially with Internet Explorer), so you're well-advised to test your pages in the browsers
you need to support to make sure they behave as expected. The PHP Manual has many helpful tips in the
user-contributed
comments on the header()
page.
Preventing Page Caching
PHP pages often generate very dynamic content, and to prevent users from missing updates by viewing cached pages, it is
often helpful to be able to tell browsers not to cache certain pages. The following snippet works quite well on the
browsers that are likely to visit your site:
<?php
header//HTTP/1.1
);
//HTTP/1.0
?>
The Expires
header can be any date in the past. As with MIME types, browsers (especially older ones) may not
always listen properly to your caching instructions (although most modern ones will).
Other Applications
There are other ways you can use headers as well, such as setting the
HTTP Response Code, or in performing
HTTP Authentication (if you are running PHP as an Apache module).
Now that you understand how header()
works and how to use it, you'll be able to do all sorts of things you
might not have thought of before.
Request Headers in PHP
We've covered some of the things you can do with response headers above. We can also get a great deal of information
from the request headers received by the server from the browser. There are two ways to access these. First, many of the
values in the $_SERVER
array are determined from the
request headers. Second, if PHP is installed as an Apache module, then
apache_request_headers() will return an
array of all request headers (even those not in $_SERVER
).
Security first: don't trust request headers
Since request headers are set by the browser, which is controlled by the client, you must never trust request
headers for information that is important to the security of your site. A good example is the
$_SERVER['HTTP_REFERER']
variable, which should hold the URL of the page that referred the
user to the current one. A common mistake among beginners is to think that they can use this to make sure
that users only access pages through a certain path, and that they therefore do not need to
worry about server side data validation. For example,
consider this code, which attempts to make sure that data has been submitted from a specific page, rather
than a custom form on another website:
<?php
){
);
echo'Youmustusetheformonmysite...redirectingnow.';
}
else{
]);
}
?>
This might work to deter an unsophisticated hacker who is using his web browser to submit data through a custom form, but someone
who is a little more savvy could easily submit data via a telnet session like we did above, including the request header
Referer: http://http://www.zjjv.com///myform.html
and easily defeat this 'protection'. The moral of the story is: use HTTP request headers to gather statistics and to help make
the user experience more pleasant — most request headers you receive will be supplied by standard browsers and will be
entirely truthful...But do not rely on request headers for any issues pertaining to security.
Using HTTP request headers
There are several things you can do with these. Using $_SERVER['HTTP_USER_AGENT']
you can detect the type of browser
the user says it has. You might check the $_SERVER['HTTP_ACCEPT_LANGUAGE']
(perhaps along with $_SERVER['HTTP_ACCEPT_CHARSET']
and some
IP address geolocation) to help determine the
best language in which to serve your pages to a given user.
Although $_SERVER['HTTP_REFERER']
is not reliable for security
purposes, it could be useful as an aid for building statistics about your website traffic or customizing content to
match the path the user took to reach a given page. If for some reason you want to manipulate the raw query string used when
the page was accessed, you can look in $_SERVER['QUERY_STRING']
. Looking in $_SERVER['REQUEST_METHOD']
will
tell you whether your page was accessed via GET
or POST
. There's quite a bit of information there for
you to find creative uses for.
HTML Meta Tag HTTP Header Equivalents
Chances are, before reading this article, you have seen or used the HTML meta tag below to redirect a user:
<meta http-equiv="refresh" content="0;http://http://www.zjjv.com///somepage.html" />
Look familiar? The 'http-equiv' meta tags are 'equivalent' to HTTP response headers, and were introduced so that people
writing HTML pages without server side programming would have access to the powerful functionality described above. Using these
meta tags is simple: they can be placed anywhere in the <head>
of the document, and their http-equiv
attribute contains the header name, while the content
attribute contains the value for the header.
I've found that these, like the HTTP headers in general, often produce confusion, but now they should seem quite simple to you.
Although I usually prefer to use the PHP header()
function, these meta tag HTTP header equivalents are often very handy
for things like specifying the character set. For example, I often use this is my HTML pages (and sometimes my PHP ones):
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
Note: Support for HTTP headers as equivalet meta tags is not uniformally supported, so it is usually safer and
faster to use the headers themselves if you can. Also, it should be obvious that some headers and values will not work as meta
equivalents: you cannot set the Content-Type
to image/png
when the real headers have been sent and the
browser is already reading the HTML ;-)
Conclusion
Now that you are done with this article, you should have a pretty firm grasp of how HTTP works, how request and response headers
are used, and how you can employ this functionality in your programming. This reasonably detailed knowledge should also enable
you to start thinking more critically about your web application efficiency and security. I hope that as you move forward with your
programming, you will find that you've become quite comfortable working with HTTP headers, and that you are able to exploit them to
make your job easier and your pages better.
As a parting thought, remember that headers are like words: they convey information and ask for certain actions to be performed,
but by themselves they don't force anything to happen. 99.9% of the time, cooperative browsers are talking to cooperative servers,
and everything happens smoothly. But you have to remember that, as in life, every once in a while you'll run across a jerk
(a hacker), or someone who's got his own way of doing things (Internet Explorer). Web development is very much a job of customer
service, so you've got to do your best to keep the crooks out, and accomodate the customers with 'special needs.' ;-)
I hope you found this tutorial helpful. Feel free to send questions, comments, or flames (but please, no spam) to info [_at_] nicholassolutions (dot) com (you may also use the contact form on our website,
nicholassolutions.com).
If this is your first time visiting ERT, why not stay and look around for a while? This article is just a small part of the content we have to offer.
This tutorial contains copyright-protected content, which may be distributed under the Creative Commons Attribution NonCommercial NoDerivs 2.5 License
src="http://http://www.zjjv.com///Icons/valid-html401"
alt="Valid HTML 4.01 Strict" />
I've taken the time to write an interoperable page that validates as HTML 4.01 Strict. I encourage you to do the same ;-)