Implementation of Technical Indicators into a Machine Learning framework for Quantitative Trading

The extrapolation terms are designed to increase in machine learning technical analysis value as days proceed, reflecting prices deviations and increasing uncertainties. Simultaneously, their influences are alleviated as time passes a certain point thus, some of the terms will vanish at the end of their prediction window. The main finding of this research is that the application of advanced machine learning-based analytical techniques can provide significant benefits to investors, both in reducing risk and achieving more optimal returns.

A Machine Learning Platform for Stock Investment Recommendation Systems

The body of the candle denotes “Open” and “Close” prices and the wicks represent the day’s “Low” and “High”. The date format has been standardized, and the index is now in chronological order. The closing price plot shows the trend of the stock price over the entire period covered by the dataset. Feature learning is motivated by the fact that machine learning tasks such as classification often require input that is mathematically and computationally convenient to process. However, real-world data such as images, video, and sensory data has not yielded attempts to algorithmically define specific features.

  • Such systems “learn” to perform tasks by considering examples, generally without being programmed with any task-specific rules.
  • Table 1 shows that the study selected 31 enterprises and all are currently large enterprises, mainly in the VN-30 group.
  • A correlation matrix is a useful tool for this purpose as it quantifies the strength of linear relationships between each feature pair and the target.
  • Artificial neural networks (ANNs), or connectionist systems, are computing systems vaguely inspired by the biological neural networks that constitute animal brains.
  • For stock price prediction, LSTM network performance has been greatly appreciated when combined with NLP, which uses news text data as input to predict price trends.
  • The data undergoes a standard train, fit, predit process with the training window being 60% of the data, and the testing window being 40% of the data.

Approaches

Thus, to make our model even more sophisticated, we will create different ML models for each cluster.

Relationships to other fields

Decision tree learning uses a decision tree as a predictive model to go from observations about an item (represented in the branches) to conclusions about the item’s target value (represented in the leaves). It is one of the predictive modelling approaches used in statistics, data mining, and machine learning. Tree models where the target variable can take a discrete set of values are called classification trees; in these tree structures, leaves represent class labels, and branches represent conjunctions of features that lead to those class labels. Decision trees where the target variable can take continuous values (typically real numbers) are called regression trees. In decision analysis, a decision tree can be used to visually and explicitly represent decisions and decision making.

  • However, the extended research direction of this topic can consider combining many machine learning algorithms to improve the predictive performance of the model.
  • It is one of the predictive modelling approaches used in statistics, data mining, and machine learning.
  • We now use the Guassian Mixture clustering algorithm to assign the companies to 17 clusters based on their returns.
  • We explore the dynamics of the stock market and prominent classical methods and deep learning-based approaches that are used to forecast prices and market trends.
  • These two hypotheses establish that there is no means of accurately predicting stock prices.

Comparative study of ten machine learning algorithms for short-term forecasting in gas warning systems

However, as mentioned before, day-to-day prediction is not suitable for a practical analysis. Despite the remarkable advantages of LSTM, its abuse will lead to misleading and suboptimal results. While such structure can seemingly achieve accuracy of up to 97% which is calculated using Eq.

With fundamental analysis, it can then be gauged if the security’s market price is overvalued or undervalued. In case of undervalued prices, investors can expect a rise in the price, whether it happens in the following days or even the upcoming years (Graham 1949). Therefore, it is arguably regarded as the most sophisticated method of investment. Combining various machine learning algorithms can complement and enhance the predictive performance of the model. This study only applies a single machine learning algorithm (that is, the LSTM algorithm).

A simple way of predicting would be to assume that all the companies would follow the same ML model and create this one global model to predict returns for all companies. However, it is possible that different companies/industries react differently to a set of Technical Indicator. One way to solve this problem is to create different ML model for each cluster of companies that are expected to behave similarly perhaps belonging to the same industry, where the “behavior” is captured in their returns. Now that we have formed all the variables that will be used in predicting the stock movement, we need to define the prediction variable. Gaussian processes are popular surrogate models in Bayesian optimisation used to do hyperparameter optimisation.

Table 3 reveals significant variation across the stock prices and trading volumes, with a minimum price of 20.59 and a maximum of 19,100. This wide range suggests that the dataset covers stocks with varying levels of market activity and volatility, supporting a robust analysis of different market conditions. Additionally, the volume statistics reflect diverse trading activities, with an average trading volume of approximately 16.2 million shares and a peak volume of over 380 million shares.

However, the extended research direction of this topic can consider combining many machine learning algorithms to improve the predictive performance of the model. The prediction accuracy of the LSTM model will be compared with the baseline value of 93%. According to the trading regulations on the Ho Chi Minh City Stock Exchange, Vietnam, the maximum fluctuation range of stock prices in one trading session is 7%. Thus, if making a stock price forecast by the simplest method that today’s price will be equal to yesterday’s price (i.e., there is no change in stock price), then the degree of error is 7%. Table 1 shows that the study selected 31 enterprises and all are currently large enterprises, mainly in the VN-30 group. On the Vietnamese stock market, VN-30 is a group of 30 securities with large market capitalization and high liquidity, and is a typical representative of the stock market (here representing VN-Index).

Financial incentives

AR and MA terms, represented by the parameters p and q respectively (refer to Figure 9). For users who would like to explore combinations of a random 5 indicators, they can use the I’m feeling lucky button. To make things clear, let me show an example of how we can trade our top prediction, BIIB, in real life.

This discrepancy stems from the network’s structure, which is a common issue in the literature, often producing misleading results; surprisingly, such studies are published by prestigious journals. In this paper we demonstrate why day-to-day price prediction cannot be used adequately to train neural networks. Meanwhile, we evaluate such models and compare them to a proposed alternative, which is more realistic and aligns more closely with analytical methods used by human experts. Our model does not attempt to predict price directly; rather, it focuses on predicting upcoming trends in the market, which is a more practical and feasible objective. Considering the long-term pattern of each stock and the relative independence of each period, we use a 100-day historical period as the input for our model instead of relying on day-to-day input.

Data collection and research methodology

Biased models may result in detrimental outcomes, thereby furthering the negative impacts on society or objectives. Algorithmic bias is a potential result of data not being fully prepared for training. Machine learning ethics is becoming a field of study and notably, becoming integrated within machine learning engineering teams. The capability of CNN in pattern recognition has been demonstrated in numerous image classification models such as ResNet-50 and YOLO. Given that chart analysists rely on looking for pattern in historical data, CNN can serve as a useful tool for trend forecasting. The same hyperparameters used in the transformer-based model, including optimizer settings and batch size, are also applied to the proposed CNN-based model.

In addition, Gülmez (2023) believed that the LSTM model is suitable for time series data on financial markets in the context of stock prices established on supply and demand relationships. Researching on the Down Jones stock index, which is a market for stocks, bonds and other securities in USA, the authors also did the stock forecasts for the period 2019 to 2023. Another research by Usmani Shamsi (2023) on Pakistan stock market research on general market, industry and stock related news categories and its influence on stock price forecast. This confirms that the LSTM model is being used more widely in stock price forecasting recently. In this paper, we investigated the capability of medium-sized neural networks and their capability for learning the trends of the stock market and forecasting prices. We demonstrated why prior works utilizing LSTM are misleading and impractical for real-world trading environments.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top