4
UTeach CS Principles Unit 6: Innovative Technologies World Wide Web WWW How many times have you seen www. at the start of a URL? It is so ubiquitous that many web browsers and web sites will insert it into the URL even if you do not type it. But www. is a special part of a domain’s address indicating that it is a server hosting content designed to meet the standards of the World Wide Web. And almost every online service you likely use is a part of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking about how it works or what problems it was originally created to solve. But, since its inception in the early 1990s, the Web has proven to be one of the most revolutionary and empowering inventions in history. Not to be confused with the broader concept of the Internet, the World Wide Web, itself, is a content-oriented ecosystem that has been built atop the globally networked infrastructure of the Internet. It was designed primarily to provide an open platform that could provide uses from all over the world a standard and accessible means of communicating and sharing information online. Origins and Growth of the Web "In those days, there was different information on different computers, but you had to log on to different computers to get at it. Also, sometimes you had to learn a different program on each computer. Often it was just easier to go and ask people when they were having coffee...”—Tim Berners-Lee

World Wide Web - d1qmdf3vop2l07.cloudfront.net · of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: World Wide Web - d1qmdf3vop2l07.cloudfront.net · of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking

UTeach CSPrinciples

Unit 6: InnovativeTechnologies

World Wide Web

WWWHow many times have you seen www. at the start of a URL? It is so ubiquitous that many web browsers and web sites willinsert it into the URL even if you do not type it. But www. is a special part of a domain’s address indicating that it is a serverhosting content designed to meet the standards of the World Wide Web. And almost every online service you likely use is a partof the World Wide Web.

In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking about how itworks or what problems it was originally created to solve. But, since its inception in the early 1990s, the Web has proven to beone of the most revolutionary and empowering inventions in history.

Not to be confused with the broader concept of the Internet, the World Wide Web, itself, is a content-oriented ecosystem thathas been built atop the globally networked infrastructure of the Internet. It was designed primarily to provide an open platformthat could provide uses from all over the world a standard and accessible means of communicating and sharing informationonline.

Origins and Growth of the Web

"In those days, there was different information on different computers, but you had to log on to differentcomputers to get at it. Also, sometimes you had to learn a different program on each computer. Often itwas just easier to go and ask people when they were having coffee...”—Tim Berners-Lee

Page 2: World Wide Web - d1qmdf3vop2l07.cloudfront.net · of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking

While working at CERN near Geneva, Switzerland, British computer scientist Tim Berners-Lee recognized the potential of theInternet as a communications and computational medium and proposed the development of a platform that might help toovercome some of its limitations.

As the Internet became more established, the world’s many computers, servers, routers, and other computational devicesgradually became networked together into a worldwide, interconnected ecosystem. However, much the same way that theintroduction of air travel in the early 1900’s suddenly brought together people from far-off lands who spoke different languages,shared different customs, and adhered to different laws, the Internet also exposed similar differences and incompatibilitiesbetween the world’s various computing systems.

Berners-Lee proposed that a standardized set of protocols and tools be developed that might help to ease the integration ofthese disparate computing systems and to facilitate improved communications between them. In short, he wanted to employ theideas of abstraction to design a more generalized means of sharing information across the Internet that was independent of anyparticular hardware or software that a user might be using. It is also to keep in mind that abstractions can be combined. Lower-level abstractions can be blended to make higher-level abstractions, such as short message services (SMS) or emailmessages, images, audio files, and videos.

As a result of his efforts, Berners-Lee created the set of fundamental tools and technologies that make up what we now morefamiliarly know of as the World Wide Web*.

Web Applications:

Web browser—Client application that runs on an end-user’s computer and is used to request and view web pages.Web server—Program that runs on a remote computer and that serves up web pages.

Web Technologies:

HTML (Hypertext Markup Language)—A standardized set of formatting instructions that dictate how the content of a web

Page 3: World Wide Web - d1qmdf3vop2l07.cloudfront.net · of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking

page should be arranged and displayed by the client application (i.e., web browser).URI (Uniform Resource Identifier)—A unique address that identifies each resource on the web and also known as a URL(Uniform Resource Locator).HTTP (Hypertext Transfer Protocol)—Standards for requesting and receiving linked resources from across the Web.

On August 6, 1991, Berners-Lee brought the world’s first web site online. It ran on a NeXT cube computer located in his lab atCERN and prominently displayed a sticker on the front of the machine which read, “This machine is a server. DO NOT POWERIT DOWN!!”

<img src="../images/640px-NeXTcube_first_webserver.jpeg" width=300"> <img src="../images/2251820987_343559b717_z.jpg"width=300">

*Interestingly enough, “World Wide Web” was not the only name that Berners-Lee considered when choosing a name for hiscreation. He almost named it one of his other ideas: Information Mesh, Mine of Information, or Information Mine. Consider howthe Web the Mine might look today with URLs like moi.google.com or moi.facebook.com instead of our familiar www.prefix.

HyperlinksOne of the key features that Berners-Lee incorporated into his invention is the use of hyperlinks to connect documents with oneanother in a non-linear way. While the pages of a book are arranged linearly in sequence (e.g., page 1, page 2, page 3, etc.),there is no such sequencing of documents in the World Wide Web. Instead, like the multiply connected computers of theInternet, the Web consists of a collection of massively interconnected pages of content.

Each web page is effectively a single, text-based document that has been “marked up” with embedded formatting instructionsknown as HTML (Hypertext Markup Language) tags. Each of these electronic documents are stored on a computer running aweb server. The location of the file within the computer’s file system corresponds to the documents URL (i.e., the address of theweb page).

A hyperlink is a clickable bit of text, image, or other on-screen element within an HTML document that a user can select torequest another, related document. Each link is designed to enable the user to selective seek out, or browse, from onedocument to the next, following whatever sequence they choose. This non-linear approach to organizing and connectinginformation has created an unlimited number of new ways that people can find, learn, and consume information.

Consider the following bit of HTML:

Youcan<ahref="http://www.google.com">search</a>for

something,<ahref="http://www.twitter.com">tweet</a>a

comment,or<ahref="http://www.facebook.com">like</a>a

friend’spostatthesepopularsites.

The above example, would produce the following hyperlinked text within a web page:

You can search for something, tweet a comment, or like a friend’s post at these popular sites.

Here, you can see that “search,” “tweet,” and “like” have each been formatted to act as hyperlinks (linking to Google, Twitter,and Facebook, respectively). Each hyperlink is denoted with the use of an anchor (<a>...</a> ) tag that frames the textbeing linked (e.g., “search,” “tweet,” and “like”). Each anchor tag includes the URL of the other page or site that the hyperlink isreferencing (e.g. href="..." ).

When a user clicks on any of these links, the web browser sends a request to the corresponding web server for the specifiedpage (as referenced in the href tag).

Page 4: World Wide Web - d1qmdf3vop2l07.cloudfront.net · of the World Wide Web. In fact, the World Wide Web is one of those things that most of us use on a regular basis without ever thinking

© 2018 The University of Texas atAustin

Exercise #1: Map the WebBuild a map of the World Wide Web. OK, maybe not all of it (it is rather large, after all). In this exercise, you will begin mappingout the interconnectedness of a very small portion of the Web.

1. Using your preferred search engine (Google, Bing, DuckDuckGo, etc.), conduct a search for your own name.2. Record the URL of the first link that your search returns.3. Visit that URL and count and record the total number of different links that you can find on that page.4. Also record the URLs of up to three more of the hyperlinks on that page.5. Continue repeating this process counting and recording hyperlinks for each URL you record for at least two more levels.

Using your findings, estimate the total number of different pages that could be reached if you were to start at the URL foundfrom your original “vanity search” (i.e., searching for your own name) and followed a series of five clicks. What about 10 clicks?20 clicks?

Exercise #2: Wikipedia RaceYour teacher will select a random topic for you to look up on Wikipedia. This will be your starting point for the race. Your teacherwill then name a second topic. This will be your target. Your goal is to browse through Wikipedia to reach your target topic byonly clicking on hyperlinks within the body of Wikipedia article. What is the shortest path that you can find to get from thestarting topic to the ending topic (i.e., following the fewest number of links)?

UTeach Computer Science—http://uteachcs.org