02a - Data Engineering Flashcards

1
Q

Wie ist die Data Engineering Pipeline aufgebaut?

A

F2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Was muss gemacht werden, damit Daten im Machine Learning verwendet werden können?

A

F2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Was wird im Pipeline Schritt Wrangling gemacht?

A

F3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Was wird im Pipeline Schritt Cleaning gemacht?

A

F3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Was wird im Pipeline Schritt Preprocessing gemacht?

A

F3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Welche 3 Faktoren zeichnen die Datenqualität hauptsächlich aus?

A

F4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Wie geht man vor, wenn Datenwerte fehlen?

A

F5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Wie geht man vor, wenn man fehlende Datenwerte ersetzen möchte?

A

F5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Was ist Noise bei Daten?

A

F6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Wie geht man mit Noise um?

A

F6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly