316 MT Flashcards

(10 cards)

1
Q

translation divergences

A
  • differences between languages
  • typology-how different languages have different affixes to them
    *structural- the structures in which sentences are set..SVO SOV
  • lexical divergences- difference in ambiguity and grammar
  • lexical gap- words that doesnt have a meaning in another language..rivere
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

ambiguity resolution

A

words that are spelt the same in one language but has two separate words in another language..he plays the guitar..he plays soccer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

direct translation

A

morphological analysis,lexical transfer,local reordering, morphological generation.
* word to word translations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

syntactic transfer
semantic transfer

A

analysis,shallow analysis
transfer(lexica,syntactic),transfer phase
generation,synthesis phase
requires both syntactic and direct translation
one makes possible parse trees then finds the most likely parse tree.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

a good translation should be:

A

faithful-correctly convery information and tone
fluent- grammatically well structured and readable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

phrase based translation

A

translation probability-
translation model-tells us what a setnence/phrase in a source language most likely translates to(faithfulness)

distortion probability- tells how likely a given sentence/phrase is(language model)(fluency)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

encoder and decoder

A

encoder is used for language understanding in source languae
decode is used for language generation in target language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

seq2seq encoder

A

the encoder usually uses an LSTm to generate context vectro from the output
*input tokes are read:
-one at time
-in reverse order
* the encoder conosists of stacked LSTMS
* column-timestep
row- single layer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

human rater evaluations

A
  • fluency
    -asking raters to rate output interms of readability,clarity and naturalness
  • cloze test:
    if theres a space within the output and someone can accurately depict the word that showed be there..then the machine is doing well
  • fidelity:
    -adequacy-measures whether a translation contains the info that existed in the source
    -informativeness-text based evaluation if theres sufficient info in the output
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

bleu

A

the closer machine translation is to a professional human translation,the better it is
-if results are needed urgently
* two constraints
a gram in the reference translation cannot be matched more than once
brevity penalty-very small sentences that would achieve a 1.0 precision is penalised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly