Deep Web vs Dark Web Flashcards
(3 cards)
What are web crawlers?
Search engines rely on automated programs called web crawlers to traverse the Internet,
discovering and indexing publicly available web pages. These crawlers follow hyperlinks and links connecting websites to discover new content and add it to their search engines database.
Websites can create a robot.txt file, instructing web crawlers not to index specific pages or directories for the public. Many websites require users to log in before accessing content.
Since crawlers cannot authenticate, they cannot access the information behind these logins.
Some websites generate content on the fly based on user input, making it difficult for static search engines to access this information.
What is the deep web?
the deep web refers to the part of the Internet, not indexed by standard search engines.
It includes legitimate password-protected content like medical records, financial data, and academic databases. Unlike the readily accessible surface web, the deep web contains information that is deliberately excluded from reach. Deep web content can be accessed with traditional web browsers, but not traditional search engines. Consequently, the deep web encompasses diverse content that could be legal or illegal, including private data,
including personal information like medical records, financial data, and legal documents
stored on secure servers. Paywalled content, like academic databases, subscription-based news websites, and exclusive online forums that often required paid memberships or subscriptions. This makes them inaccessible to standard search engines. While public profiles might be indexed, private profiles on social media platforms like Facebook or Instagram are part of the deep web. Sometimes data files containing sensitive personal information or
company information can be found on the deep web.
What is the dark web?
the dark web, which is mostly associated with criminal activities. It is a small subset of the deep web. Unlike the vast and diverse deep web, the dark web comprises a collection of websites deliberately hidden from standard search engines, and it’s only accessible through specialized software.
While the anonymity of the dark web can serve legitimate purposes like protecting journalists or whistleblowers, it has unfortunately facilitated the proliferation of illegal activities on the dark web, including hidden black market places that act as platforms for selling illegal goods and services.
The dark web provides a haven for cyber criminals to engage in activities like malware distribution, hacking tool sales, and stolen identity trading. Furthermore, the anonymous nature of the dark web allows for the proliferation of harmful content, including hate speech and violent extremism. When a company’s database is breached, the stolen data often ends up on the dark web. This could include sensitive information such as credit card numbers, social security numbers, and login credentials.