Web technologies Flashcards by Yug Patel

What is the main function of a crawler?

To discover web pages by following links from one webpage to another, systematically visiting pages on the web

How well did you know this?

Not at all

Perfectly

How does a crawler work?

It starts with a set of seed URLs and visits other pages linked from those URLs
They follow rules and guidlines established by website owners
Once a crawler reaches a webpage, it fetches the HTML content of the page
The crawler examines the HTML structure and retrieves information, such as text contents and headings
The HTML that was retrieved is broken down into individual components
This process involves identifying elements, tags and attributes that hold valuable information

How well did you know this?

Not at all

Perfectly

What is indexing?

The data extracted by the crawler is indexed. This involves storing the data in a structured manner in the search engines database
The index allows for quick retrieval and ranking of relevant web pages in response to user queries

How well did you know this?

Not at all

Perfectly

How are web pages ranked?

When a user enters a query, the search engine searches the index for matching pages and returns the results that they believe are the highest quality and morelevant to the users query

How well did you know this?

Not at all

Perfectly

What are the benefits of crawling and indexing?

Improved search results: more relevant and up-to-date results
Efficient retrieval: can search the index to produce results quickly
Ranking & relevance: rankings are determined by various algorithms
Freshness & updates: crawlers periodically revisit indexd pages to update the index

How well did you know this?

Not at all

Perfectly

What does PageRank do?

Web pages are evaluated and ranked by the PageRank algorithm based on their percieved relevance and importance

How well did you know this?

Not at all

Perfectly

What are the key elements of the PageRank algorithm?

Link analysis
Link weight distribution
Iterative calculation
Damping factor

How well did you know this?

Not at all

Perfectly

What is link analysis in the PageRank algorithm?

The PageRank algorithm analyses the structure of links between pages on the web
Web pages are given importance by the algorithm, which considers the quantity and quality of inbound links from other pages
Each link acts as a “vote” for the target page, with the voting weight determined by the quality of the linking page
Webpages that have more “high-quality” links pointing towards them are deemed to be more important and hence ranked higher

How well did you know this?

Not at all

Perfectly

What is the link weight distribution in the PageRank algorithm?

The importance of a webpage is determined by the PageRank algorithm, which takes into account the total number of votes it has
The algorithm distrbutes the weight by sharing a portion of its importance with each outgoing link
Hence, pages of higher greater quality (max pages linking to it and min pages it links to) are given greater importance

How well did you know this?

Not at all

Perfectly

What is the iterative calculation in the PageRank algorithm?

The PageRank algorithm uses a repetetive calculation process. At the beginning, every webpage is given the same value
In subsequent iterations, the significance of every webpage is re-evaluated by considering the weighted significance of inbound links
The process repeats until the ranking become stable

How well did you know this?

Not at all

Perfectly

What is the damping factor in the PageRank algorithm?

The damping factor is a value between 0 and 1 (usually 0.85)
It represents the probability that the user will not follow the link on a page
It makes the model more realistic

How well did you know this?

Not at all

Perfectly

What factors influence the PageRank?

Relevance
User engagement
Authority and trust
Content freshness
Mobile - friendliness

How well did you know this?

Not at all

Perfectly

Limitations and evolving nature of the PageRank algorithm

Although the PageRank algorithm is important in search engines, it is not the only factor that determines webpage rankings
Search engines use different algorithms to guarantee they provide varied and top-quality results

How well did you know this?

Not at all

Perfectly

What is server side processing?

Involves running code on and carrying out operations on the server instead of the clients device or browser
Web development often utilises server side programming languages such as PHP, python or java to handle incoming requests and processing data

How well did you know this?

Not at all

Perfectly

What is PHP?

PHP (hypertext processor) is a server side scripting language specifically designed for web development
PHP focuses mainly on completeing tasks on the server

How well did you know this?

Not at all

Perfectly

Examples of server side processing with PHP

Study These Flashcards

Data retrieval and manipulation: PHP is capable of interacting with databases, processing data and generating dynamic content
Server operations: completing tasks that are not accesible by the client. E.G: retrieving and displaying content from a database
Form processing: form submissions and processing submitted data

Benefits of server side processing

Study These Flashcards

Imroved security measures can be implemented, ensuring secure management of sensitive data
Uses the resources of the server to perform advanced calculations
Consistent behaviour across different devices and browsers
Can be easiy scaled by adding more servers

Drawbacks of server side processing

Study These Flashcards

Multiple requests can decrease overall server processing, due to increased server load
Latency due to client communicating with server, leading to increased response times
Relies on availability and reliability of the server
Limits real-time interactivity and responsiveness
May require more complex development and setup

What is client side processing?

Study These Flashcards

Involves carrying out code or processing tasks on the users device, usually within the browser, instead of on the server
Enables users to have an interactive and dynamic experience without constantly requesting data from the server
Primarily done using javascript

Client side processing with javascript

Study These Flashcards

It allows developer to modify web content and manage user interactions without requiring server requests

Examples of client side processing using javascript

Study These Flashcards

Form validation: allows validations of user input in real time, which means users can receive instant feedback without the need for a server roundtrip
Developers can modify DOM (Document Object Model) to make dynamic changes to webpages content and structure
Communication with the server happens in the background. Allows content to be updated dynamically without a full page refresh

Benefits of client side processing

Study These Flashcards

Enhanced user experience: eliminates the need for frequent server requests and page reloads
Server load is reduced - improved scalability
Inputs can be instantly validated and feedback can be provided in real time
Webpage content is updated dynamically: more engaging browser experiences
Offline functionality

Drawbacks of client side processing

Study These Flashcards

Potential security risk as data can be seen by users
Compatability of devices and browsers may vary
Can hurt page load time due to webpages requiring substantial processing power
Heavily dependent on javascript: if the users device does not support javascript then the webpage won’t work
Intellectual property is at risk

Client side processing vs server side processing

Study These Flashcards

Client side processing is better for tasks that require immediate feedback, real-time interactions and dynamic user interfaces within the browser
Servers side processing is better for tasks that envolve accessing databases, handling sensitive data and complex business logic

What is HTML?

- HTML is the foundational language used to structure content on the web - It consists of a series os elements, uaually referred to as "tags" - The tag is the root element of the HTML page. Only tags written within this tag are displayed

Text tags in HTML

- : contains metadata (information about the webpage) and is not dislayed itself - : contains the name of the webpage, displayed on the page tab. It is contained within the <head> tag - <body> : contains all of the contents of the webpage, such as text, images, hyperlinks. There can only be one <body> tag - <h1> - <h6> : heading tags in decreasing importance. <h1> is the most important and hence displayed the biggest. <h6> is displayed the smallest - <p> : used for a paragraph of the text. Each paragraph is seperated by a line </div> </div> </div> <div class='flashcard-row thin-card is-blurrable' data-a-button-url='/sign-up?packId=22407357&returnTo=%2Fpacks%2F22407357%2Fsubscribe&source=preview-card&subject=Computer+Science+A-Level' data-card-is-blurrable data-number='27' id='card-602500602'> <div class='header'> 27 </div> <div class='card-face question'> <div class='question-content'> Image in HTML </div> </div> <div class='card-face answer'> <div class='answer-content'> - <img> : embed an image into the webpage. Contains all the other tags about the image - <src> : specifies the source of the image - <alt> : provides alternative text to be displayed if the image cannot - <height> and <width> : specifies the dimensions of the image - <img src="https://cdn.savemyexams.com/logo/sme-logo-small.svg" alt="Save My Exams Image" height="100" width="200"> </div> </div> </div> <div class='flashcard-row thin-card is-blurrable' data-a-button-url='/sign-up?packId=22407357&returnTo=%2Fpacks%2F22407357%2Fsubscribe&source=preview-card&subject=Computer+Science+A-Level' data-card-is-blurrable data-number='28' id='card-602502028'> <div class='header'> 28 </div> <div class='card-face question'> <div class='question-content'> Links in HTML </div> </div> <div class='card-face answer'> <div class='answer-content'> - <a> : anchor tag. Used to create hyperlinks - href : attribute contained within the <a> tag (<a href= ... >). Contains the URL for where the hyperlink leads to - <p>Visit the <a href="https://www.savemyexams.co.uk">Save My Exams site</a> for more information.</p> </div> </div> </div> <div class='flashcard-row thin-card is-blurrable' data-a-button-url='/sign-up?packId=22407357&returnTo=%2Fpacks%2F22407357%2Fsubscribe&source=preview-card&subject=Computer+Science+A-Level' data-card-is-blurrable data-number='29' id='card-602503140'> <div class='header'> 29 </div> <div class='card-face question'> <div class='question-content'> Lists in HTML </div> </div> <div class='card-face answer'> <div class='answer-content'> - <ol> : organised list. Displayes a numbered list - <ul> : unorganised list. Displays a bulletpointed list - <li> used to define each value in a list <ul> <li>html</li> <li>head</li> <li>title</li> </ul> </div> </div> </div> </div> </div> </div> <div class='flashcards-sidebar'> <div class='sidebar-header'> <div class='react-component' id='flashcards-search-bar'> <div class='placeholder market-search-bar' id='flashcards-search-bar-placeholder'></div> </div> </div> <div class='sidebar-content'> <p class='deck-subject-heading'> <a class="decks-in-subject-link" href="/packs/computer-science-a-level-22407357"><span class="pack-name">Computer Science A-Level</span> (9 decks) </a></p> <ul class='deck-list-items'> <a class='deck-link ' href='/flashcards/systems-software-17063088/packs/22407357'> <li class='deck-list-item'>Systems Software</li> </a> <a class='deck-link ' href='/flashcards/computational-thinking-17400167/packs/22407357'> <li class='deck-list-item'>Computational thinking</li> </a> <a class='deck-link ' href='/flashcards/applications-generation-17827519/packs/22407357'> <li class='deck-list-item'>Applications generation</li> </a> <a class='deck-link ' href='/flashcards/characteristics-of-the-cpu-18132829/packs/22407357'> <li class='deck-list-item'>Characteristics of the CPU</li> </a> <a class='deck-link ' href='/flashcards/data-types-18139410/packs/22407357'> <li class='deck-list-item'>Data types</li> </a> <a class='deck-link selected' href='/flashcards/web-technologies-19100402/packs/22407357'> <li class='deck-list-item'>Web technologies</li> </a> <a class='deck-link ' href='/flashcards/object-orientated-programming-19128897/packs/22407357'> <li class='deck-list-item'>Object Orientated Programming</li> </a> <a class='deck-link ' href='/flashcards/boolean-logic-19129770/packs/22407357'> <li class='deck-list-item'>Boolean logic</li> </a> <a class='deck-link ' href='/flashcards/algorithms-19247914/packs/22407357'> <li class='deck-list-item'>Algorithms</li> </a> </ul> </div> </div> </div> <div id='tooltip-controller'></div> <div data='{"packId":22407357,"source":"spaced-repetition-modal","subject":"Computer Science A-Level","resources":{"deckId":19100402,"packId":22407357},"returnTo":"/packs/22407357/subscribe"}' id='spaced-repetition-modal-controller'></div> <div id='banner-controller'></div> <div id='dialog-modal-controller'></div> <div class='band band-footer'> <div class='footer-main'> <ul class='sections'> <li class='section key-links'> <p class='section-heading'> Key Links </p> <ul class='options-list'> <li class='option'> <a id="footer-pricing-link" class="option-link" href="/pricing?paywall=upgrade">Pricing</a> </li> <li class='option'> <a class="option-link" href="/companies">Corporate Training</a> </li> <li class='option'> <a class="option-link" href="/teachers">Teachers & Schools</a> </li> <li class='option'> <a class="option-link" target="_blank" rel="nofollow noopener noreferrer" href="https://itunes.apple.com/us/app/brainscape-smart-flashcards/id442415567?mt=8">iOS App</a> </li> <li class='option'> <a class="option-link" target="_blank" rel="nofollow noopener noreferrer" href="https://play.google.com/store/apps/details?id=com.brainscape.mobile.portal">Android App</a> </li> <li class='option'> <a class="option-link" target="_blank" rel="noopener" href="https://www.brainscape.com/faq">Help Center</a> </li> </ul> </li> <li class='section subjects'> <p class='section-heading'> Subjects </p> <ul class='options-list'> <li class='option'> <a class="option-link" href="/subjects/medical-nursing">Medical & Nursing</a> </li> <li class='option'> <a class="option-link" href="/subjects/law">Law Education</a> </li> <li class='option'> <a class="option-link" href="/subjects/foreign-languages">Foreign Languages</a> </li> <li class='option'> <a class="option-link" href="/subjects-directory/a">All Subjects A-Z</a> </li> <li class='option certified-classes'> <a class="option-link" href="/learn">All Certified Classes</a> </li> </ul> </li> <li class='section company'> <p class='section-heading'> Company </p> <ul class='options-list'> <li class='option'> <a class="option-link" href="/about">About Us</a> </li> <li class='option'> <a target="_blank" class="option-link" rel="nofollow noopener noreferrer" href="https://brainscape.zendesk.com/hc/en-us/articles/115002370011-Can-I-earn-money-from-my-flashcards-">Earn Money!</a> </li> <li class='option'> <a target="_blank" class="option-link" href="https://www.brainscape.com/academy">Academy</a> </li> <li class='option'> <a target="_blank" class="option-link" href="https://brainscapeshop.myspreadshop.com/all">Swag Shop</a> </li> <li class='option'> <a target="_blank" rel="nofollow noopener" class="option-link" href="/contact">Contact</a> </li> <li class='option'> <a target="_blank" rel="nofollow noopener" class="option-link" href="/terms">Terms</a> </li> <li class='option'> <a target="_blank" class="option-link" href="https://www.brainscape.com/academy/brainscape-podcasts/">Podcasts</a> </li> <li class='option'> <a target="_blank" class="option-link" href="/careers">Careers</a> </li> </ul> </li> <li class='section find-us'> <p class='section-heading'> Find Us </p> <ul class='social-media-list'> <li class='option twitter-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://twitter.com/Brainscape"><img data-src="/pks/images/shared/twitterx-af917e8b474ed7c95a19.svg" alt="twitter badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option linkedin-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.linkedin.com/company/brainscape/"><img data-src="/pks/images/shared/linkedin-2f15819658f768056cef.svg" alt="linkedin badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option facebook-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.facebook.com/Brainscape"><img data-src="/pks/images/shared/facebook-1598a44227eabc411188.svg" alt="facebook badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option youtube-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.youtube.com/c/BrainscapeNY"><img data-src="/pks/images/shared/youtube-7f2994b2dc1891582524.svg" alt="youtube badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option pinterest-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.pinterest.com/brainscape/"><img data-src="/pks/images/shared/pinterest-04f51aa292161075437b.svg" alt="pinterest badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option tiktok-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.tiktok.com/@brainscapeu"><img data-src="/pks/images/shared/tiktok-644cf4608bd73fbbb24f.svg" alt="tiktok badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> <li class='option insta-badge group'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://www.instagram.com/brainscape/"><img data-src="/pks/images/shared/insta-210cc2d059ae807961d2.svg" alt="insta badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="24" height="24" /></a> </li> </ul> <div class='get-the-app'> <div class='qr-code'> <img data-src="https://www.brainscape.com/assets/cms/public-views/shared/shortio-from-homepage.png" alt="QR code" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="130" height="130" /> </div> <div class='app-badges'> <div class='badge apple-badge'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://apps.apple.com/us/app/brainscape-smart-flashcards/id442415567"><img data-src="/pks/images/shared/apple-badge-b6e4f380fb879821d601.svg" alt="apple badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="124" height="50" /></a> </div> <div class='badge android-badge'> <a rel="nofollow noopener noreferrer" target="_blank" class="option-link" href="https://play.google.com/store/apps/details?id=com.brainscape.mobile.portal&utm_source=global_co&utm_medium=prtnr&utm_content=Mar2515&utm_campaign=PartBadge&pcampaignid=MKT-Other-global-all-co-prtnr-py-PartBadge-Mar2515-1"><img data-src="/pks/images/shared/android-badge-a2251833dc7f6ca8879c.svg" alt="android badge" class="lazy-load" src="/pks/images/shared/placeholder-2f8e0834f3c4456dc1cc.jpg" width="124" height="50" /></a> </div> </div> </div> </li> </ul> </div> <div class='footer-blurb'> Brainscape helps you reach your goals faster, through stronger study habits. <br> © 2025 Bold Learning Solutions. <a class="option-link" href="/terms">Terms and Conditions</a> </div> </div> <script> if (typeof window.__REACT_DEVTOOLS_GLOBAL_HOOK__ === 'object') { __REACT_DEVTOOLS_GLOBAL_HOOK__.inject = function() {}; } </script> <script> window.addEventListener('load', () => { setTimeout(() => { const script = document.createElement('script'); script.src = "/pks/js/public-deck-cards-page-d0410d84e8b38e8511bf.js"; script.defer = true; document.body.appendChild(script); }, 0); }); </script> <script src="https://appleid.cdn-apple.com/appleauth/static/jsapi/appleid/1/en_US/appleid.auth.js" defer="defer"></script> <script> document.addEventListener("mainSharedready", () => { GaHelper.setGaDimension("dimension1","No"); }); </script> <script type='application/ld+json'> {"@context":"https://schema.org/","@type":"Quiz","about":{"@type":"Thing","name":"Web technologies"},"hasPart":[{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is the main function of a crawler?","acceptedAnswer":{"@type":"Answer","text":"To discover web pages by following links from one webpage to another, systematically visiting pages on the web"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"How does a crawler work?","acceptedAnswer":{"@type":"Answer","text":"- It starts with a set of seed URLs and visits other pages linked from those URLs - They follow rules and guidlines established by website owners - Once a crawler reaches a webpage, it fetches the HTML content of the page - The crawler examines the HTML structure and retrieves information, such as text contents and headings - The HTML that was retrieved is broken down into individual components - This process involves identifying elements, tags and attributes that hold valuable information"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is indexing?","acceptedAnswer":{"@type":"Answer","text":"- The data extracted by the crawler is indexed. This involves storing the data in a structured manner in the search engines database - The index allows for quick retrieval and ranking of relevant web pages in response to user queries"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"How are web pages ranked?","acceptedAnswer":{"@type":"Answer","text":"When a user enters a query, the search engine searches the index for matching pages and returns the results that they believe are the highest quality and morelevant to the users query"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What are the benefits of crawling and indexing?","acceptedAnswer":{"@type":"Answer","text":"- Improved search results: more relevant and up-to-date results - Efficient retrieval: can search the index to produce results quickly - Ranking \u0026 relevance: rankings are determined by various algorithms - Freshness \u0026 updates: crawlers periodically revisit indexd pages to update the index"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What does PageRank do?","acceptedAnswer":{"@type":"Answer","text":"Web pages are evaluated and ranked by the PageRank algorithm based on their percieved relevance and importance"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What are the key elements of the PageRank algorithm?","acceptedAnswer":{"@type":"Answer","text":"- Link analysis - Link weight distribution - Iterative calculation - Damping factor"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is link analysis in the PageRank algorithm?","acceptedAnswer":{"@type":"Answer","text":"- The PageRank algorithm analyses the structure of links between pages on the web - Web pages are given importance by the algorithm, which considers the quantity and quality of inbound links from other pages - Each link acts as a \"vote\" for the target page, with the voting weight determined by the quality of the linking page - Webpages that have more \"high-quality\" links pointing towards them are deemed to be more important and hence ranked higher"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is the link weight distribution in the PageRank algorithm?","acceptedAnswer":{"@type":"Answer","text":"- The importance of a webpage is determined by the PageRank algorithm, which takes into account the total number of votes it has - The algorithm distrbutes the weight by sharing a portion of its importance with each outgoing link - Hence, pages of higher greater quality (max pages linking to it and min pages it links to) are given greater importance"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is the iterative calculation in the PageRank algorithm?","acceptedAnswer":{"@type":"Answer","text":"- The PageRank algorithm uses a repetetive calculation process. At the beginning, every webpage is given the same value - In subsequent iterations, the significance of every webpage is re-evaluated by considering the weighted significance of inbound links - The process repeats until the ranking become stable"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is the damping factor in the PageRank algorithm?","acceptedAnswer":{"@type":"Answer","text":"- The damping factor is a value between 0 and 1 (usually 0.85) - It represents the probability that the user will not follow the link on a page - It makes the model more realistic"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What factors influence the PageRank?","acceptedAnswer":{"@type":"Answer","text":"- Relevance - User engagement - Authority and trust - Content freshness - Mobile - friendliness"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"Limitations and evolving nature of the PageRank algorithm","acceptedAnswer":{"@type":"Answer","text":"- Although the PageRank algorithm is important in search engines, it is not the only factor that determines webpage rankings - Search engines use different algorithms to guarantee they provide varied and top-quality results"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is server side processing?","acceptedAnswer":{"@type":"Answer","text":"- Involves running code on and carrying out operations on the server instead of the clients device or browser - Web development often utilises server side programming languages such as PHP, python or java to handle incoming requests and processing data"}},{"@context":"https://schema.org/","@type":"Question","eduQuestionType":"Flashcard","text":"What is PHP?","acceptedAnswer":{"@type":"Answer","text":"- PHP (hypertext processor) is a server side scripting language specifically designed for web development - PHP focuses mainly on completeing tasks on the server"}}],"educationalAlignment":[{"@type":"AlignmentObject","alignmentType":"educationalSubject","targetName":"Web technologies"}]} </script> </body> </html>

Web technologies Flashcards

(29 cards)