How do search engine works??


search engine databases are selected and built by the computer robot programs called  spider
A search engine maintains the following processes in near real time:
->Web crawling
->Indexing
->Searching

1, Crawling   -  Scanning the site and getting a complete list of everything including the page title, text,images, keywords it contains and any other pages it links to  abare minimum. Modern crawlers may cache a copy of the whole page, and look for some additional information such as the page layout. Web pages are crawled by a web spider . It visits each page, just like you or I would, only very quickly and it is added indexed page list. When a page contains hyperlink (link to another page) it is automatically added to discovered page list by spider in crawling
.
2. indexing Indexing is the process of taking all of that data  that you have from a crawling and placing it in system database. Imagine  to  make a list of all the books  their author and the number of pages. Going through each book is the crawl and writing the list is the index All of this data is stored in huge  data-centres with thousands of petabytes of drives.Hence in indexing  it is analyzed and stored in system databases .

3. searching- In this step user query fetches a list of relevant pages.When you search anything in the browser search bar  and the search engine attempts to display the most relevant information or document it finds that match your query. It is  only the area in which search engines differentiate themselves. Example some work with keywords, some allow you to ask a question, and some include advanced features like keyword proximity .


In advance search engine find the pages for the potential inclusion by the following the links in the pages that already have in their database during crawling.Search engine spider only find them if web page is linked to any other page.We can access brand of new pages which are never linked to the other page by submitting the pages to the search engine.In Indexing  identifies text ,links to other page and stores in the search engine database.Hence database can be searched by the keyword during searching.
Some types of the pages and links are excluded from most search engine policy and other are excluded because search engine spider cannot find them.Such excluded pages are referred as Invisible Web.We can't see invisible web on normal search engine results. The Invisible Web is estimated to be two or three times bigger than the visible web.
If you want to get more information click here and comment below.