QA over KGs Flashcards
(11 cards)
What are the main stages of the KGQA pipeline?
1) Natural Language Question
2) Query Formulation
3) Structured Query (SPARQL)
4) Query Execution
5) Knowledge Graph Execution
6) Result Processing
7) Answer Generation
What occurs in the ‘Query Formulation’ stage of KGQA?
Translating the natural language question into a formal structured query (e.g., SPARQL) by recognizing entities, relations, and constructing the logical form.
What is involved in ‘Query Execution’ and ‘Knowledge Graph Execution’?
Running the SPARQL query against the knowledge graph endpoint to retrieve matching triples or subgraphs that answer the query.
What does ‘Result Processing’ entail in the KGQA pipeline?
Aggregating raw query results, filtering or deduplicating, ranking candidate answers, and formatting the data for the final generation step.
Describe the ‘Answer Generation’ stage in KGQA.
Using the processed results (and optionally retrieved text via RAG) to generate a coherent, natural-language answer for the user through templates or an LLM.
Where and how does Retrieval-Augmented Generation (RAG) integrate into the pipeline?
Between Result Processing and Answer Generation: format retrieved KG data as context, optionally augment with text or embeddings, then feed to an LLM to produce the final answer.
Name the four key challenges highlighted on the KGQA pipeline slide.
1) Entity Recognition
2) Relation Mapping
3) Query Construction
4) Answer Ranking
What issues arise in ‘Entity Recognition’ within KGQA?
Identifying and disambiguating entity mentions in user questions, handling synonyms, polysemy, and acronyms.
What is the ‘Relation Mapping’ challenge in KGQA?
Mapping natural-language predicates to the correct KG properties, dealing with varied phrasing and implicit relations.
What does the ‘Query Construction’ challenge refer to?
Assembling the recognized entities and relations into a valid SPARQL query, capturing the intended logic and filters.
Explain the ‘Answer Ranking’ challenge in KGQA.
Ordering multiple candidate answers by relevance or correctness, selecting the best for presentation to the user.