Misc Flashcards

1
Q

What is ASCII character set ?
(can be described as character encoding or since Unicode - codepage)

A

Computer stores 8 bit bytes. The ASCII Character Set is an encoding scheme that attributes a number to a letter.
for e.g. ‘a’ = 1000001 (binary) = 65 (decimal)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

ASCII History
How many characters ?
What language ?
what were first 32 characters ?

A

128 (0-127)
English - since it started in US
non-printable control characters - now obsolete , used to control devices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What was the Extended ASCII character set ?

A

Since ASCII only used 7 bits, the 8th bit was unused.
Non-English users, took advantage of this to set the 8th bit.
If the 8th bit was set, it was possible to define an additional set of characters with that character on. This was not official and resulted in clashes. Different languages used different Extended character sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What was the Unicode consortium ?
different countries/companies

A

An attempt to unify encodeings around the globe

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Code Point ?

A

65 is the decimal code point of the character ‘a’
1000001 is the binary code point of the same character.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a code point encoding ?

A

Code points can be encoding in more than one way…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the defacto standard for encoding code points in Web Applications ?

A

UTF-8

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How many bytes does UTF-32 use to encode each codepoint ?

A

4 bytes… so it wasn’t adopted because it wasted too much space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is UTF-16 ? how many bytes does it use ?

A

It depends on the value of the codepoint. Either 2 bytes or 4 bytes. Not backward compatible with ASCII

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Big Endian and Little Endian ?

A

In UTF-16 encoding, the order of the bytes can be one of two ways.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Byte Order Mark ?

A

It indicates whether the following UTF-16 is Little Endian style or Big Endian Style

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is UTF-8 ?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly