Big Idea 2 - Data Flashcards
Abstraction
The process of simplifying complex systems by focusing on the essential details and hiding unnecessary complexities.
Analog Data
Refers to continuous, real-world information that is represented by a range of values. It can take on any value within a given range and is often used to describe physical quantities like temperature or sound.
Binary Numbers
A base-2 number system that uses only two digits, 0 and 1, to represent all values. Each digit in a binary number is called a bit.
Byte
A unit of digital information that consists of 8 bits. It can represent a single character or a small amount of numerical data.
Cleaning Data
The process of identifying and correcting or removing errors, inconsistencies, and inaccuracies in a dataset. It involves tasks like removing duplicate entries, handling missing values, and standardizing formats to ensure data quality.
Data Compression
The process of reducing the size of data files while maintaining as much information as possible. This allows for more efficient storage and transmission of data.
Data Filtering
The process of selectively extracting or removing specific pieces of data from a larger dataset based on certain criteria or conditions. It allows you to focus on relevant information while excluding irrelevant or unwanted data.
Data Transformation
Refers to the process of converting data from one format or structure to another. It involves modifying, reorganizing, or manipulating data to make it more suitable for analysis or other purposes.
Digital Data
Refers to information that is represented using discrete, binary values (0s and 1s). It is commonly used in computers and other digital devices because it can be easily stored, processed, and transmitted.
Hexadecimal
A number system that uses base-16 instead of base-10. It uses digits from 0 to 9 and letters from A to F to represent values from 0 to 15.
Lossless Compression Algorithms
Methods used to compress data files without losing any information. The compressed file can be fully restored to its original form without any loss of data.
Lossy Compression Algorithms
Methods used to reduce the size of a file by permanently removing some data. These algorithms achieve high compression rates but result in a loss of quality or detail in the compressed file.
Metadata
Refers to descriptive information about data, such as its format, location, authorship, and creation date. It provides context and additional details that help organize and manage data effectively.
Overflow Error
Occurs when a computer program or system tries to store a value that is too large to be represented within the available memory or data type. This can lead to unexpected and incorrect results.
Rounding Error
Occurs when a number is approximated or rounded to a certain decimal place, resulting in a small discrepancy between the rounded value and the actual value.
ASCII code
Stands for American Standard Code for Information Interchange. It is a character encoding standard that assigns unique numeric values to represent characters such as letters, numbers, and symbols in computer systems.
Data
Refers to information that is collected, stored, and processed by computers. It can be in the form of numbers, text, images, or any other type of digital content.
Constant Value
A fixed value that does not change during the execution of a program.
Lists
Lists are ordered collections of items in computer programming. They allow you to store multiple values under one variable name and access them using their position or index.
Machine Code
A low-level programming language that consists of binary instructions directly understood by the computer’s hardware. It represents the most basic level of instructions that a computer can execute.
Number Bases
Also known as numeral systems, are methods of representing numbers using a specific set of symbols or digits. Each digit in a number represents a multiple of the base raised to a power.
Rounding Errors
Inaccuracies that occur when representing real numbers with finite precision. These errors happen because some numbers cannot be precisely represented using binary floating-point representation.
Unicode system
An international character encoding standard that assigns unique numeric values to represent characters from various languages and scripts around the world. It allows computers to handle text in multiple languages more effectively than previous standards like ASCII.
Variables
Containers that hold values or data in a computer program. They can store different types of information such as numbers, text, or boolean values.