|
Home > Getting Started > Featured Article Time Traveling on the InternetCarl Heine and Dennis O'Connor
Occasionally, however, those who time travel without knowing it, encounter problems. These problems stem from not recognizing that information can be live and/or archived. For example, the three links below show the BBC Home Web page. The first link leads to the live BBC news page; the other two links display archived copies of earlier editions. Depending on the precise moment these links are clicked, the information retrieved may look nearly identical or be considerably different.
The archived information block at the top of the page should tip the user off that they are viewing news from the "past." Compare the url of the live BBC page: http://www.bbc.co.uk/
The longer URLs are saved copies that will not change until the search engine crawlers revisit the BBC home page to copy updated information and refresh the search engine databases. Google & Ask crawl the BBC page on a different schedule yielding different results in each system's cache. Most fast-changing pages, like the BBC's, are crawled frequently but not necessarily daily. It is possible to find cached pages that have not been crawled for weeks. Tripped up by Time Travel
Searching Google (or any other search engine) is really about retrieving the most current copy of a page. The most current copy is called the cache. The results snippet, which is an abstract of the cached page, is from the Internet past. Everything about the snippet is cached. The only way to the present is to click the title link. This takes the search from the past into the present. You leave the cache and jump to the current edition of the live page webpage. This also explains why clicking a link from a search engine can hand you a PAGE NOT FOUND error message. This just means the url has changed since the last time the search engine made a copy. (Hint: Truncate the URL and find the local site search tools and you may find what you were looking for!)
Everything in this snippet goes back to the cache: the only way to the present is to click the first link! |
|||||||||||||
| There is only one way to search live pages: browsing. Browsing requires knowing what keywords may lead in the direction of the information needed. Browsing also depends heavily on luck, which makes it a more difficult search method than using a search engine. | |
| There are three ways to search a cache: search engine, subject directory and browsing. Each has advantages in certain situations. | |





