The Parts of a URL
- http://websearch.about.com/od/dailywebsearchtips/qt/dnt0526.htm is our example.
- http://: "HTTP" stands for Hypertext Transfer (or Transport) Protocol, and is the identification technology used to communicate between Web servers and the users who access their information. In some URLs, you might see "HTTPS", which stands for "Hyper Text Transfer Protocol", a secure technology developed to keep sensitive transactions and information safe online. This first part of the URL indicates what protocol the URL will be using in order to deliver the information from the server to the user, and is in front of the domain name, separated by the "://".
- domain name: The domain name is the textual representation of the IP address, used to identify a specific Web page or pages, and comes after the "://". For example, one of the IP addresses assigned to Facebook is 184.108.40.206; conversely, the domain name for Facebook's IP address is facebook.com. Every domain name has a top level domain. These are simple suffixes attached to the end of the domain name that indicates its place in the domain name hierarchy. Common top level domains are .com, org, .net, and .gov.
- forward slash,then file name: path or directory on the computer to this file; which in our case is "od/dailywebsearchtips/qt". Think of a file cabinet, with folders assigned to specific topics, projects, or information. This identifying information is simply locating where these files can be found on the Web server, then displaying them for the end user.
- name of file: name of file, usually ending in .html or .htm. The file name for this file is "dnt0526"; moving backwards through the URL we can deduce that the file lies in sub-directories called "qt", "dailywebsearchtips", and "od", is on the domain name "websearch.about.com", and is being delivered to us by the HTTP protocol.
History of the URL
In addition to creating HTML, the standard language used to create the vast majority of pages on the World Wide Web, and hyperlinks, the linked text connections between one Web page to the other, Tim Berners-Lee also originated the idea of Uniform Resource Locators, an organizational system that gave every Web page its own unique locational address online (you can read more about the Web's early history at How the Web Got Started).
Different Kinds of URLs
There are a wide variety of different kinds of URLs, as well as different terms to describe what a URL looks like. For example:
Messy: This is a URL with a lot of garbled numbers and letters on it that makes little organizational sense, i.e., "http://www.example.com/woeiruwoei909305820580". Typically these URLs are computer-generated from programs creating thousands of Web pages on the same domain name.
Dynamic: These are what the previous explanation of "messy URLs" really come from. Dynamic URLs are the end result of database queries that provide content output based on the result of that query. The URL ends up looking quite garbled, aka "messy", and often includes the following characters: ?, &, %, +, =, $. Dynamic URLs are often found as part of consumer-driven websites: shopping, travel, or anything that requires changing answers for many different user queries.
Static: A static URL is the opposite of a dynamic URL. The URL is "hard-wired" into the Web page's HTML coding and will not change depending on what the user requests.
Obfuscated: Obfuscated, or hidden, URLs are primarily used in phishing scams. Basically, a familiar URL is distorted in some way to make it seem legitimate. The user clicks on the obfuscated URL and is redirected to a malicious website.
Clues That Web Addresses Can Give You
There's a lot of information that you can glean from a simple URL, including:
- what kind of server the Web page is hosted on
- what kind of organization the Web page belongs to
- where the Web page is located in the world
- the names of the directories on the website
By carefully looking at the different parts of any Web address, you can quickly determine quite a bit of useful information. In addition, by simply deleting parts of the URL, you can learn more about the website than what might be actually publicly accessible. For example:
- http://www.widget.com/blog/music/: This points to a resource online, and the URL tells you that yes, indeed, it does point to a online resource. Let's go further back.
- http://www.widget.com/blog/: By moving backwards in the URL from right to left, we we can see that we're now at the blog section of this publication.
- http://www.widget.com: The home page of the website.
Of course, this is a very simple example. However, by dissecting complex URLs one step at a time, quite a bit of information can be uncovered.