Primary areas of fintech
Big Data
Big Data refers to all the potentially useful information that is generated in the economy, including data from traditional sources and alternative data.
Corporate exhaust
Businesses that generate potentially useful information such as bank records and retail scanner data
Internet of Things
Sensors, such as radio frequency identification chips, that are embedded in numerous devices such as smart phones and smart buildings
Characteristics of Big Data
Volume, velocity, and variety
Data processing method include
Neural Networks
Example of artificial intelligence in that they are programmed to process information in a way similar to the human brain
Machine Learning
This refers to programming that gives a computer system the ability to improve its performance of a task over time. The machine learning process typically requires vast amounts of data.
ML can produce models that overfit or underfit the data.
Fintech Application
Distributed ledger
A distributed ledger is a database that is shared on a network so that each participant has an identical copy. A distributed ledger must have a consensus mechanism to validate new entries into the ledger. Distributed ledger technology uses cryptography to ensure only authorized network participants can use the data.
Distributed ledgers can take the form of permissionless or permissioned networks.
Blockchain
A blockchain is a distributed ledger that records transactions sequentially in blocks and links these blocks in a chain. Each block has a cryptographically secured “hash” that links it to the previous block. The consensus mechanism in a blockchain requires some of the computers on the network to solve a cryptographic problem. These computers are referred to as miners.
Financial applications of distributed ledger technology
Post-trade clearing and settlement is an area of finance to which distributed ledger technology might be productively applied. Distributed ledgers could automate many of the processes currently carried out by custodians and other third parties. On the other hand, the inability to alter past transactions on a distributed ledger is problematic when canceling a trade is required.
Covariance
Correlation coefficient (r)
Spurious Correlation
Spurious correlation refers to the appearance of a causal linear relationship when, in fact, there is no relation.
Simple Linear Regression
The purpose of simple linear regression is to explain the variation in a dependent variable in terms of the variation in a single independent variable.
Dependent vs. independent variable
Linear Regression Assumptions
Slope Coefficient
slope coefficient for the regression line describes the change in Y for a one unit change in X.
Sum of squared errors (SSE)
The sum of the squared vertical distances between the estimated and actual Y-values is referred to as the sum of squared errors (SSE).
Simple linear regression is frequently referred to as ordinary least squares (OLS) regression, and the values estimated by the estimated regression equation, Yi, are called least squares estimates.
Standard error of estimate (SEE)
Coefficient of determination (R2)
Analysis of variance (ANOVA)
a statistical procedure for analyzing the total variability of the dependent variable.
Total sum of squares (SST)
Regression sum of squares (RSS)
Sum of squared errors (SSE)