L11: Tracking Flashcards

1
Q

Meanshift: What is the intial track that needs to be loaded?

A

ROI (Region-of-interest). This can be hand chosen or by a detection algorithm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Meanshift: Does it handle scale and orientation changes?

A

No, use Camshift instead

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Meanshift: What is Histogram backprojection?

A

You take the original ROI/image’s histogram and backproject from a new frame. The black and white areas indicate how similar they are.

High values = regions that are similar
Low values = regions that are dissimilar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Meanshift: When is it applied and what do it do?

A

After the histogram backprojection. It shift the ROI to the new location that encapsulate most of the points, hence the mean of the points.

Hence where the name comes from

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Camshift: How does it work compared to Meanshift?

A

Camshift (Continuously Adaptive Meanshift) essentially follow the same procedure. It does Meanshift. Afterwards it tries to rotate and fit and elipsoid to the distribution of over points. If the elipsoid increases with some significant size it will update the ROI.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

SORT: What are the 4 stages of it?

A
  1. Localization of track identities using an external detector.
  2. Kalman filtering for prediction
  3. Bounding box assignment
  4. Dynamic creation/deletion of track identities
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SORT: what detector can be used?

A

Essentially all that generate a bounding box and sufficient data for the Kalman filter.

The original used detector is the Faster R-CNN (Deep Neural Network) trained to locate pedestrians / people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SORT: What is the parameterization of the bounding box?

A

It is: x=[u,v,r,u^dot,v^dot,s_dot]^t

(u,v) = center of bounding box
r = the aspect ratio
s = the area

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

SORT: why do we need the r term? The aspect ratio term

A

Because the method relies on constant aspect ratio between two consecutive frames. This also makes the prediction trask for the filter slightly easier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

SORT: What is the assignment problem?

A

At each frame our detector will generate bounding boxes (N number) aswell as our estimate from the kalman filter (M number).

The assignment problem is then, how do we assign an object to the right bounding box?

We use an cost matrix N x M and compute the IoU score.

Essentially, meaning how well does a detector bounding box overlap with all of the estimated bounding boxes and vice versa.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

SORT: what if the square matrix (NxM) is unequal?

A

Simply add a dummy rows/columns with zero entries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

SORT: Which algorithm performs optimal assignment? and how?

A

The hungarian algorithm: It does so by permutes the columns/rows of the cost matrix until the trace becomes minimal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

SORT: What is the drawbacks?

A
  • Complexity
  • The runtime is cubic in the number of rows
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

SORT: When does a new track get created or an old deleted?

A

If a new object has been seen for a couple of frames it will be given a new track.

Vice-versa for deletion, if an object has left the scene for some couple of frames the track will be removed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

SORT: What is the kalman filter step used for?

A

The Kalman filter is used to predict the expected position and velocity of each existing track in the current frame based on its previous state.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SORT: What is the IoU score?

A

IoU = Area of overlap / area of union

High overlap = good
Low overlap = bad

See picture from slide for better explanation