02 Introduction to retrieval system Flashcards

1
Q

what is information retrieval

A

science of search engine
- effectively get the right information to user
- efficiently get information to user
- relevance to see if the query and document is about the same topic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

general definition of IR

A

retrieval of relevant information from data sources which were not originally intended for access

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

documents vs database records**

A

database records store in tuples, but how to match with textual records
retrieving:
- structured data (DB)
- free text (IR)

queries
- formally defined (DB)
- vague, imprecise (IR)

results
- exact, always correct (DB)
- sometime relevant (IR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

imprecision in IR

A

most algorithms in computer science have ‘right’ answers
IR techniques are essentially heuristic as we do not know the right answers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

users classification

A

naive to expert

  1. professionals
    - ‘accommodation in glasgow’
    - direct term search
  2. general users
    - how do i get to …
  3. laymen
    - doctor said i have… where do i get more information
    - long texts
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

information pyramid

A

data
information = data in context
knowledge = basis for making decisions
wisdom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what makes a document relevant

A
  • does it contain all query terms
  • contain many times
  • fresh
  • authoritative (have many links)
  • doesnt contain too many ads
  • doesnt contain spam
  • has been clicked the most
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

search engines

A

web search process is not a one search approach
- question answer assessment
- user engagement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly