Report on Internet searching

**study tips** · 25-08-2017, 09:32 PM

Internet searching

.docx

Internet searching.docx (Size: 210.36 KB / Downloads: 21)

ABSTRACT:

Introduction- steps in searching- crawling google bot discussed in detail, gopher, Archie, metatags- indexing, weight for words, data structure in indexing hashing- search query processing, multiple words withboolean operators in search, page rank, spelling correcting system - future searches, natural language

INTRODUCTION:

In the World Wide Web, there are hundreds of millions of pages available, waiting to present information on an amazing variety of topics. When you need to know about a particular subject, you visit anInternet search engine.
Internet search engines -->special sites on the Web that are designed to help people find information stored on other sites. There are differences in the ways various search engines work, but they all perform three basic tasks:
• They search the Internet -- or select pieces of the Internet -- based on important words.
• They keep an index of the words they find, and where they find them.
• They allow users to look for words or combinations of words found in that index.

WEB CRAWLER:

Programs with names like "gopher" and "Archie" kept indexes of files stored on servers connected to the Internet and reduced the amount of time required to find programs and documents.
To find information on the hundreds of millions of Web pages that exist, a search engine employs special software robots, called spiders, to build lists of the words found on Web sites. When a spider is building its lists, the process is called Web crawling. In order to build and maintain a useful list of words, a search engine's spiders have to look at a lot of pages.
Search engine built their initial system to use multiple spiders, usually three at one time.
Each spider 1 spider 300 connections to Web pages open at a time.
Peak performance, four spiders 100 pages/sec, and 600 KB of data each second.
Keeping everything running quickly meant building a system to feed necessary information to the spiders. The early Google system had a server dedicated to providing URLs to the spiders. Rather than depending on a ISP-Internet service provider for the (DNS) Domain name server that translates a server's name into an address, Google had its own DNS, in order to keep delays to a minimum.

Metatags:

Meta tags allow the owner of a page to specify key words and concepts under which the page will be indexed. This can be helpful, especially in cases in which the words on the page might have double or triple meanings -- the Meta tags can guide the search engine in choosing which of the several possible meanings for these words is correct.
Defect: A careless or unscrupulous page owner might add metatags that fit very popular topics but have nothing to do with the actual contents of the page. Prevention:To protect against this, spiders will correlate metatags with page content, rejecting the metatags that don't match the words on the page.
If a webmaster wishes to restrict the information on their site available to a Googlebot, or another well-behaved spider, they can do so with the appropriate directives in a robots.txt file,or by adding the meta tag <meta name="Googlebot" content="nofollow" /> to the web page.

Indexing:

Once the spiders have completed the task of finding information on Web pages, the search engine must store the information in a way that makes it useful. There are two key components involved in making the gathered data accessible to users:
• The information stored with the data
• The method by which the information is indexed
An engine might store the number of times that the word appears on a page. The engine might assign a weight to each entry, with increasing values assigned to words as they appear near the top of the document, in sub-headings, in links, in the meta tags or in the title of the page. Each commercial search engine has a different formula for assigning weight to the words in its index.

FUTURE SEARCHES:

Many groups are working to improve both results and performance of this type of search engine. Others have moved on to another area of research, called natural-language queries.
The idea behind natural-language queries is that you can type a question in the same way you would ask it to a human sitting beside you -- no need to keep track of Boolean operators or complex query structures. The most popular natural language query site today is AskJeeves.com, which parses the query for keywords that it then applies to the index of sites it has built. It only works with simple queries; but competition is heavy to develop a natural-language query engine that can accept a query of great complexity.

**jaseela123** · 15-09-2017, 01:29 PM

While there is still a certain stigma attached to the Internet as an academic tool, prejudices against the Web are slowly declining, especially as students are showing instructors how powerful an Internet tool can be. This web resource is designed to help you develop Internet research skills.

When researching projects such as documents or reports for the class, we need to arrange the pieces of information into a meaningful whole. The key to effective research is not just gathering a lot of information. You also have to evaluate the information and make sense of it. The evaluation process is especially important for the information found on the Internet, since the Internet is a resource that provides access to a lot of information that varies considerably in quality and reliability.

The purpose of the research - in the context of this web resource - is to treat academic research at the university level. This type of research often bases the argument of an essay on the context or tradition of other thinkers who have worked on the same subject. For example, it might be necessary to find sources in Arthur Miller, author of Death of a Salesman. You can compare your interpretation of the game with the interpretations that have come before.

University-level research often involves finding information about a specific topic and organizing that information in a meaningful way. I might even say that research involves making a kind of history of other unrelated details. Consider, for example, investigating the civil rights movement in the South of the United States. From a lot of details (including dates, people, places and events), it is necessary to create some connections between those details. So research is not just the process of finding information: it is making information make sense.

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Biometrics Security System Full Download Seminar Report and Paper Presentation	computer science crazy	30	190,561,110	24-02-2021, 08:13 AM Last Post: buy cialis generic
	DNSSEC (A Protocol towards securing the Internet Infrastructure)	Computer Science Clay	3	64,035,515	25-05-2018, 04:02 PM Last Post: pttytopa8058
	Ultrasonic Trapping In Capillaries For Trace-Amount Bi (Download Full Seminar Report)	Computer Science Clay	2	104,277,107	17-01-2018, 11:59 AM Last Post: dhanabhagya
	nanorobotics full report	project topics	24	176,551,278	16-01-2018, 05:50 PM Last Post: Guest
	robotic surgery full report	project report tiger	16	150,961,205	07-01-2018, 07:28 PM Last Post: Raymondnof
	Human Computer Interface : Seminar Report and PPT	seminar post	1	1,337	22-09-2017, 11:23 AM Last Post: jaseela123
	4G Broadband : Seminar Report and PPT	study tips	1	1,261	22-09-2017, 11:19 AM Last Post: jaseela123
	Amoeba full report	project topics	1	1,631,984	22-09-2017, 10:38 AM Last Post: jaseela123
	Where Mathematics Meets the Internet	study tips	1	891	21-09-2017, 01:09 PM Last Post: jaseela123
	Itanium Processor : Seminar Report and PPT	seminar projects maker	1	1,052	21-09-2017, 12:46 PM Last Post: jaseela123

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.