Deep Learning and Machine Learning for Stock Predictions

Description: This is for learning, studying, researching, and analyzing stock in deep learning (DL) and machine learning (ML). Predicting Stock with Machine Learning method or Deep Learning method with different types of algorithm. Experimenting in stock data to see how it works and why it works or why it does not work that way. Using different types of stock strategies in machine learning or deep learning. Using Technical Analysis or Fundamental Analysis in machine learning or deep learning to predict the future stock price. In addition, to predict stock in long terms or short terms.

Machine learning is a subset of artificial intelligence involved with the creating of algorithms that can change itself without human intervention to produce an output by feeding itself through structured data. On the other hand, deep learning is a subset of machine learning where algorithms created, but the function are like machine learning and many of the different type of algorithms give a different interpretation of the data. The network of algorithms called artificial neural networks and is similar to neural connections that exist in the human brain.

Languages and Tools:

Three main types of data: Categorical, Discrete, and Continuous variables

Categorical variable(Qualitative): Label data or distinct groups.
Example: location, gender, material type, payment, highest level of education
Discrete variable (Class Data): Numerica variables but the data is countable number of values between any two values.
Example: customer complaints or number of flaws or defects, Children per Household, age (number of years)
Continuous variable (Quantitative): Numeric variables that have an infinite number of values between any two values. Example: length of a part or the date and time a payment is received, running distance, age (infinitly accurate and use an infinite number of decimal places)

Data Use

For 'Quantitative data' is used with all three centre measures (mean, median and mode) and all spread measures.
For 'Class data' is used with median and mode.
For 'Qualitative data' is for only with mode.

Two types of problems:

Classification (predict label)
Regression (predict values)

Bias-Variance Tradeoff

Bias

Bias is the difference between our actual and predicted values.
Bias is the simple assumptions that our model makes about our data to be able to predict new data.
Assumptions made by a model to make a function easier to learn.

Variance

Variance is opposite of bias.
Variance is variability of model prediction for a given data point or a value that tells us the spread of our data.
If you train your data on training data and obtain a very low error, upon changing the data and then training the same.

Overfitting, Underfitting, and the bias-variance tradeoff

Overfitted is when the model memorizes the noise and fits too closely to the training set. Good fit is a model that learns the training dataset and genernalizes well with the old out dataset. Underfitting is when it cannot establish the dominant trend within the data; as a result, in training errors and poor performance of the model.

Overfitting:

Overfitting model is a good model with the training data that fit or at lease with near each observation; however, the model mist the point and random noise is capture inside the model. The model have low training error and high CV error, low in-sample error and high out-of-sample error, and high variance.

High Train Accuracy
Low Test Accuracy

Avoiding Overfitting:

Early stopping - stop the training before the model starts learning the noise within the model.
Training with more data - adding more data will increase the accuracy of the modelor can help algorithms detect the signal better.
Data augmentation - add clean and relevant data into training data.
Feature selection - Use important features within the data. Remove features.
Regularization - reduce features by using regularization methods such as L1 regularization, Lasso regularization, and dropout.
Ensemble methods - combine predictions from multiple separate models such as bagging and boosting.
Increase training data.

Good fit:

High Train Accuracy
High Test Accuracy

Underfitting:

Underfitting model is not perfect, so it does not capture the underlying logic of the data. Therefore, the model does not have strong predictive power with low accuracy. The model have large training set error, large in-sample error, and high bias.

Low Train Accuracy
Low Test Accuracy

Avoiding Underfitting:

Decrease regularization - reduce the variance with a model by applying a penalty to the input parameters with the larger coefficients such as L1 regularization, Lasso regularization, dropout, etc.
Increase the duration of training - extending the duration of training because stopping the training early will cause underfit model.
Feature selection - not enough predictive features present, then adding more features or features with greater importance would improve the model.
Increase the number of features - performing feature engineering
Remove noise from the data

Python Reviews

Step 1 through step 8 is a reviews in python.
After step 8, everything you need to know that is relate to data analysis, data engineering, data science, machine learning, and deep learning.

List of Machine Learning Algorithms for Stock Trading

Most Common Regression Algorithms

Linear Regression Model
Logistic Regression
Lasso Regression
Support Vector Machines
Polynomial Regression
Stepwise Regression
Ridge Regression
Multivariate Regression Algorithm
Multiple Regression Algorithm
K Means Clustering Algorithm
Naïve Bayes Classifier Algorithm
Random Forests
Decision Trees
Nearest Neighbours
Lasso Regression
ElasticNet Regression
Reinforcement Learning
Artificial Intelligence
MultiModal Network
Biologic Intelligence

Different Types of Machine Learning Algorithms and Models

Algorithms is a process and set of instructions to solve a class of problems. In addition, algorithms perform a computation such as calculations, data processing, automated reasoning, and other tasks. A machine learning algorithms is a method that provides the systems to have the ability to automatically learn and improve from experience without being formulated.

Prerequistes

Python 3.5+
Jupyter Notebook Python 3

🔲 Add more of algorithms and different types of algorithms

Authors

* Tin Hang

Disclaimer

🔻 Do not use this code for investing or trading in the stock market. However, if you are interest in the stock market, you should read 📚 books that relate to stock market, investment, or finance. On the other hand, if you into quant or machine learning, read books about 📘 machine trading, algorithmic trading, and quantitative trading. You should read 📗 about Machine Learning and Deep Learning to understand the concept, theory, and the mathematics. On the other hand, you should read academic paper and do research online about machine learning and deep learning on 💻

Name		Name	Last commit message	Last commit date
Latest commit History 372 Commits
Stock_Algorithms		Stock_Algorithms
Stock_Apps		Stock_Apps
001_Pandas.ipynb		001_Pandas.ipynb
002_Numpy.ipynb		002_Numpy.ipynb
003_Matrix.ipynb		003_Matrix.ipynb
004_Data_PreProcessing.ipynb		004_Data_PreProcessing.ipynb
005_Pre_Proccessing (Part_2).ipynb		005_Pre_Proccessing (Part_2).ipynb
006_Data_Visualization.ipynb		006_Data_Visualization.ipynb
007_Understand_Data.ipynb		007_Understand_Data.ipynb
008_Basic_Statistics.ipynb		008_Basic_Statistics.ipynb
ANOVA_F_value.ipynb		ANOVA_F_value.ipynb
Array_Selection_Numpy.ipynb		Array_Selection_Numpy.ipynb
Basic_Machine_Learning_Predicts.ipynb		Basic_Machine_Learning_Predicts.ipynb
Categorical_Continuous.ipynb		Categorical_Continuous.ipynb
Chi_Squared.ipynb		Chi_Squared.ipynb
Column_Selection_Pandas.ipynb		Column_Selection_Pandas.ipynb
DL_Title.PNG		DL_Title.PNG
Data_Cleaning_for_Machine_Learning.ipynb		Data_Cleaning_for_Machine_Learning.ipynb
Descriptive_Statistics.ipynb		Descriptive_Statistics.ipynb
Drop_Highly_Correlated_Features.ipynb		Drop_Highly_Correlated_Features.ipynb
Feature_Importance_Classification.ipynb		Feature_Importance_Classification.ipynb
Feature_Importance_Continuous.ipynb		Feature_Importance_Continuous.ipynb
Features_Analysis.ipynb		Features_Analysis.ipynb
Features_Extraction.ipynb		Features_Extraction.ipynb
Features_Extraction_with_PCA.ipynb		Features_Extraction_with_PCA.ipynb
Features_Rank.ipynb		Features_Rank.ipynb
Features_Scores.ipynb		Features_Scores.ipynb
Features_Selections.ipynb		Features_Selections.ipynb
Features_Transformation.ipynb		Features_Transformation.ipynb
In_Sample_Out_Sample.ipynb		In_Sample_Out_Sample.ipynb
LICENSE		LICENSE
Linear_Regression_Stock.ipynb		Linear_Regression_Stock.ipynb
Logistic_Regression_Stock.ipynb		Logistic_Regression_Stock.ipynb
Metric.ipynb		Metric.ipynb
Nested_Cross-Validation_Part2.ipynb		Nested_Cross-Validation_Part2.ipynb
NetworkX.ipynb		NetworkX.ipynb
Poisson_Regression.ipynb		Poisson_Regression.ipynb
Principal_Component_Analysis_(PCA).ipynb		Principal_Component_Analysis_(PCA).ipynb
Principal_Component_Analysis_(PCA)_Stock.ipynb		Principal_Component_Analysis_(PCA)_Stock.ipynb
Probabilities.ipynb		Probabilities.ipynb
README.md		README.md
Scaling_and_Transformations.ipynb		Scaling_and_Transformations.ipynb
Split_Data.ipynb		Split_Data.ipynb
Stationary_Check.ipynb		Stationary_Check.ipynb
Stationary_Check_Part_2.ipynb		Stationary_Check_Part_2.ipynb
Tensorflow_Basics.ipynb		Tensorflow_Basics.ipynb
Title.PNG		Title.PNG
Train_Test_Split.ipynb		Train_Test_Split.ipynb
Train_Validate_Test.ipynb		Train_Validate_Test.ipynb
Underfitting_Overfitting_Check_Regression.ipynb		Underfitting_Overfitting_Check_Regression.ipynb
Understand_Data.ipynb		Understand_Data.ipynb
Variance_Inflation_Factor.ipynb		Variance_Inflation_Factor.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning and Machine Learning for Stock Predictions

Languages and Tools:

Three main types of data: Categorical, Discrete, and Continuous variables

Data Use

Two types of problems:

Bias-Variance Tradeoff

Bias

Variance

Overfitting, Underfitting, and the bias-variance tradeoff

Overfitting:

Avoiding Overfitting:

Good fit:

Underfitting:

Avoiding Underfitting:

Python Reviews

List of Machine Learning Algorithms for Stock Trading

Most Common Regression Algorithms

Different Types of Machine Learning Algorithms and Models

Prerequistes

🔲 Add more of algorithms and different types of algorithms

Authors

* Tin Hang

Disclaimer

About

Releases

Packages

Languages

License

tianbaoluo/Deep-Learning-Machine-Learning-Stock

Folders and files

Latest commit

History

Repository files navigation

Deep Learning and Machine Learning for Stock Predictions

Languages and Tools:

Three main types of data: Categorical, Discrete, and Continuous variables

Data Use

Two types of problems:

Bias-Variance Tradeoff

Bias

Variance

Overfitting, Underfitting, and the bias-variance tradeoff

Overfitting:

Avoiding Overfitting:

Good fit:

Underfitting:

Avoiding Underfitting:

Python Reviews

List of Machine Learning Algorithms for Stock Trading

Most Common Regression Algorithms

Different Types of Machine Learning Algorithms and Models

Prerequistes

🔲 Add more of algorithms and different types of algorithms

Authors

* Tin Hang

Disclaimer

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages