Free Republic
Browse · Search
General/Chat
Topics · Post Article

Skip to comments.

10 Search Engines to Explore the Invisible Web
MakeUseOf.com ^ | Mar. 14th, 2010 | Saikat Basu

Posted on 03/23/2010 3:15:34 AM PDT by Daffynition

No, it’s not Spiderman’s latest web slinging tool but something that’s more real world. Like the World Wide Web.

The Invisible Web refers to the part of the WWW that’s not indexed by the search engines. Most of us think that that search powerhouses like Google and Bing are like the Great Oracle…they see everything. Unfortunately, they can’t because they aren’t divine at all; they are just web spiders who index pages by following one hyperlink after the other.

But there are some places where a spider cannot enter. Take library databases which need a password for access. Or even pages that belong to private networks of organizations. Dynamically generated web pages in response to a query are often left un-indexed by search engine spiders.


Search engine technology has progressed by leaps and bounds. Today, we have real time search and the capability to index Flash based and PDF content. Even then, there remain large swathes of the web which a general search engine cannot penetrate. The term, Deep Net, Deep Web or Invisible Web lingers on.

To get a more precise idea of the nature of this ‘Dark Continent’ involving the invisible and web search engines, read what Wikipedia has to say about the Deep Web. The figures are attention grabbers – the size of the open web is 167 terabytes. The Invisible Web is estimated at 91,000 terabytes. Check this out – the Library of Congress, in 1997, was figured to have close to 3,000 terabytes!

How do we get to this mother load of information?

That’s what this post is all about. Let’s get to know a few resources which will be our deep diving vessel for the Invisible Web. Some of these are invisible web search engines with specifically indexed information.

Infomine

invisible web search engines

Infomine has been built by a pool of libraries in the United States. Some of them are University of California, Wake Forest University, California State University, and the University of Detroit. Infomine ‘mines’ information from databases, electronic journals, electronic books, bulletin boards, mailing lists, online library card catalogs, articles, directories of researchers, and many other resources.

You can search by subject category and further tweak your search using the search options. Infomine is not only a standalone search engine for the Deep Web but also a staging point for a lot of other reference information. Check out its Other Search Tools and General Reference links at the bottom.

The WWW Virtual Library

invisible web search engines

This is considered to be the oldest catalog on the web and was started by started by Tim Berners-Lee, the creator of the web. So, isn’t it strange that it finds a place in the list of Invisible Web resources? Maybe, but the WWW Virtual Library lists quite a lot of relevant resources on quite a lot of subjects. You can go vertically into the categories or use the search bar. The screenshot shows the alphabetical arrangement of subjects covered at the site.

Intute

invisible web search engines

Intute is UK centric, but it has some of the most esteemed universities of the region providing the resources for study and research. You can browse by subject or do a keyword search for academic topics like agriculture to veterinary medicine. The online service has subject specialists who review and index other websites that cater to the topics for study and research.

Intute also provides free of cost over 60 free online tutorials to learn effective internet research skills. Tutorials are step by step guides and are arranged around specific subjects.

Complete Planet

search invisible web

Complete Planet calls itself the ‘front door to the Deep Web’. This free and well designed directory resource makes it easy to access the mass of dynamic databases that are cloaked from a general purpose search. The databases indexed by Complete Planet number around 70,000 and range from Agriculture to Weather. Also thrown in are databases like Food & Drink and Military.

For a really effective Deep Web search, try out the Advanced Search options where among other things, you can set a date range.

Infoplease

search invisible web

Infoplease is an information portal with a host of features. Using the site, you can tap into a good number of encyclopedias, almanacs, an atlas, and biographies. Infoplease also has a few nice offshoots like Factmonster.com for kids and Biosearch, a search engine just for biographies.

DeepPeep

search invisible web

DeepPeep aims to enter the Invisible Web through forms that query databases and web services for information. Typed queries open up dynamic but short lived results which cannot be indexed by normal search engines. By indexing databases, DeepPeep hopes to track 45,000 forms across 7 domains.

The domains covered by DeepPeep (Beta) are Auto, Airfare, Biology, Book, Hotel, Job, and Rental. Being a beta service, there are occasional glitches as some results don’t load in the browser.

IncyWincy

how to use the invisible web

IncyWincy is an Invisible Web search engine and it behaves as a meta-search engine by tapping into other search engines and filtering the results. It searches the web, directory, forms, and images. With a free registration, you can track search results with alerts.

DeepWebTech

how to use the invisible web

DeepWebTech gives you five search engines (and browser plugins) for specific topics. The search engines cover science, medicine, and business. Using these topic specific search engines, you can query the underlying databases in the Deep Web.

Scirus

how to use the invisible web

Scirus has a pure scientific focus. It is a far reaching research engine that can scour journals, scientists’ homepages, courseware, pre-print server material, patents and institutional intranets.

TechXtra

TechXtra concentrates on engineering, mathematics and computing. It gives you industry news, job announcements, technical reports, technical data, full text eprints, teaching and learning resources along with articles and relevant website information.

Just like general web search, searching the Invisible Web is also about looking for the needle in the haystack. Only here, the haystack is much bigger. The Invisible Web is definitely not for the casual searcher. It is a deep but not dark because if you know what you are searching for, enlightenment is a few keywords away.

Do you venture into the Invisible Web? Which is your preferred search tool?

Image credit: MarcelGermain


TOPICS: Computers/Internet; Education; History
KEYWORDS: deepnet; deepweb; internet; invisibleweb; pl; searchengine; searchengines
Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-62 next last
To: dangerdoc
Remember, that figure (20TB) was from 1997, 13 years ago.

The web's content has grown by literally orders of magnitude since then.

BTW, being something of a proper-English nut, I hate it when people use the terms "literally" and "order of magnitude" improperly. But the above is indeed true (at least 10^2), so I'll take the grammar risk.

If the google quote is right, 5,000,000,000 TB, then even my statement falls short by many orders of magnitude.

41 posted on 03/23/2010 10:49:36 AM PDT by dayglored (Listen, strange women lying in ponds distributing swords is no basis for a system of government!)
[ Post Reply | Private Reply | To 39 | View Replies]

To: dayglored; dangerdoc
oh crap, I can't cut and paste any more.

> If the google quote is right, 5,000,000,000 TB

should have been

If the google quote is right, 5,000,000 TB

42 posted on 03/23/2010 10:52:03 AM PDT by dayglored (Listen, strange women lying in ponds distributing swords is no basis for a system of government!)
[ Post Reply | Private Reply | To 41 | View Replies]

To: Daffynition

Maybe I missed something, but how do these search engines access private networks?


43 posted on 03/23/2010 10:54:46 AM PDT by Straight Vermonter (Congratulations UVM Catamounts - Going to the Big Dance!)
[ Post Reply | Private Reply | To 1 | View Replies]

To: dayglored

“If the google quote is right, 5,000,000 TB”

How much of that is porn, pirated movies and left wing blather? Makes you wonder if this whole interweb thing was a good idea.


44 posted on 03/23/2010 11:27:41 AM PDT by dangerdoc
[ Post Reply | Private Reply | To 42 | View Replies]

To: Daffynition

For later.


45 posted on 03/23/2010 11:29:28 AM PDT by Lurker (The avalanche has begun. The pebbles no longer have a vote.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition
The deep Web is several orders of magnitude larger than the surface Web.[2]

from Wekipedia....

46 posted on 03/23/2010 1:27:46 PM PDT by GOPJ (http://hisz.rsoe.hu/alertmap/index2.php?area=dam&lang=eng)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

Great post - thanks.


47 posted on 03/23/2010 1:38:05 PM PDT by GOPJ (http://hisz.rsoe.hu/alertmap/index2.php?area=dam&lang=eng)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Puddleglum

It’s going to be fun exploring some of these links....or an enormous waste of time!


48 posted on 03/23/2010 3:39:31 PM PDT by Daffynition (What's all this about hellfire and Dalmatians?)
[ Post Reply | Private Reply | To 5 | View Replies]

To: Daffynition

Interesting, but the phrase is ‘mother lode’, not ‘load’.

/Pedantic mode


49 posted on 03/23/2010 3:50:48 PM PDT by Riley (The Fourth Estate is the Fifth Column.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

Bookmarked...


50 posted on 03/23/2010 6:26:49 PM PDT by BlessedBeGod (New Wizard of Oz: Pelosi as the Wicked Witch of the West & Michelle as the Wicked Witch of the East.)
[ Post Reply | Private Reply | To 1 | View Replies]

To: dangerdoc
> How much of that is porn, pirated movies and left wing blather? Makes you wonder if this whole interweb thing was a good idea.

My own estimate:

And then consider that in the case of the porn and movies and music, those figures represent hundreds if not thousands of duplicate copies of the same content, just in different places.

With regard to left-wing blather, it's actually not that bad. Right-wing blather has it beat hands down. :) The left may own the broadcast/cable networks, but the right owns the blogosphere and the forums.

There's no political site in the world that can touch FreeRepublic.com. I love the web.

51 posted on 03/23/2010 9:04:03 PM PDT by dayglored (Listen, strange women lying in ponds distributing swords is no basis for a system of government!)
[ Post Reply | Private Reply | To 44 | View Replies]

To: Daffynition

It wouldn’t surprise me if most of the Deep Net turned out to be LOLcats stored on private drives.


52 posted on 03/23/2010 9:08:37 PM PDT by Yardstick
[ Post Reply | Private Reply | To 1 | View Replies]

To: ShadowAce

bumpo


53 posted on 03/23/2010 10:40:32 PM PDT by bitt ("WE THE PEOPLE" http://www.youtube.com/watch?v=JVAhr4hZDJE)
[ Post Reply | Private Reply | To 31 | View Replies]

To: Swordmaker

Pinging you and your ping list


54 posted on 03/23/2010 10:41:04 PM PDT by Darnright (There can never be a complete confidence in a power which is excessive. - Tacitus)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

Put up a thread after you have checked them all out.

I checked all ten and there is only one I would look twice at.


55 posted on 03/23/2010 11:33:15 PM PDT by Balata
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

Excellent thread, thank you.


56 posted on 03/23/2010 11:55:46 PM PDT by nw_arizona_granny ( garden/survival/cooking/storage- http://www.freerepublic.com/focus/chat/2299939/posts?page=5555)
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

bumperoonie


57 posted on 03/24/2010 12:06:49 AM PDT by moehoward
[ Post Reply | Private Reply | To 1 | View Replies]

To: Daffynition

I tried some of these - they’re wonderful. Great post.


58 posted on 03/26/2010 7:50:22 AM PDT by GOPJ (http://hisz.rsoe.hu/alertmap/index2.php?area=dam&lang=eng)
[ Post Reply | Private Reply | To 1 | View Replies]

To: All

For internet searches of persons.

http://www.spokeo.com

http://www.zabasearch.com


59 posted on 04/11/2010 6:55:17 PM PDT by Jet Jaguar (*)
[ Post Reply | Private Reply | To 58 | View Replies]

To: Cindy; icwhatudo

post 59 if interested.


60 posted on 04/11/2010 6:56:46 PM PDT by Jet Jaguar (*)
[ Post Reply | Private Reply | To 59 | View Replies]


Navigation: use the links below to view more comments.
first previous 1-2021-4041-6061-62 next last

Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.

Free Republic
Browse · Search
General/Chat
Topics · Post Article

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794
FreeRepublic.com is powered by software copyright 2000-2008 John Robinson