Editors' picksOPEN-SOURCE SCRIPT

Updated

The SuperTrend AI indicator is a novel take on bridging the gap between the K-means clustering machine learning method & technical indicators. In this case, we apply K-Means clustering to the famous SuperTrend indicator.

🔶**USAGE**

Users can interpret the SuperTrend AI trailing stop similarly to the regular SuperTrend indicator. Using higher minimum/maximum factors will return longer-term signals.

The displayed performance metrics displayed on each signal allow for a deeper interpretation of the indicator. Whereas higher values could indicate a higher potential for the market to be heading in the direction of the trend when compared to signals with lower values such as 1 or 0 potentially indicating retracements.

In the image above, we can notice more clear examples of the performance metrics on signals indicating trends, however, these performance metrics cannot perform or predict every signal reliably.

We can see in the image above that the trailing stop and its adaptive moving average can also act as support & resistance. Using higher values of the*performance memory* setting allows users to obtain a longer-term adaptive moving average of the returned trailing stop.

🔶**DETAILS**

🔹**K-Means Clustering**

When observing data points within a specific space, we can sometimes observe that some are closer to each other, forming groups, or "Clusters". At first sight, identifying those clusters and finding their associated data points can seem easy but doing so mathematically can be more challenging. This is where cluster analysis comes into play, where we seek to group data points into various clusters such that data points within one cluster are closer to each other. This is a common branch of AI/machine learning.

Various methods exist to find clusters within data, with the one used in this script being**K-Means Clustering**, a simple iterative unsupervised clustering method that finds a user-set amount of clusters.

A naive form of the K-Means algorithm would perform the following steps in order to find*K* clusters:

To explain how K-Means works graphically let's take the example of a one-dimensional dataset (which is the dimension used in our script) with two apparent clusters:

This is of course a simple scenario, as*K* will generally be higher, as well the amount of data points. Do note that this method can be very sensitive to the initialization of the centroids, this is why it is generally run multiple times, keeping the run returning the best centroids.

🔹**Adaptive SuperTrend Factor Using K-Means**

The proposed indicator rationale is based on the following hypothesis:

Performing the calculation of the indicator using the best setting at time*t* would return an indicator whose characteristics adapt based on its performance. However, what if the setting of the best-performing instance and second best-performing instance of the indicator have a high degree of disparity without a high difference in performance?

Even though this specific case is rare its however not uncommon to see that performance can be similar for a group of specific settings (this could be observed in a parameter optimization heatmap), then filtering out desirable settings to only use the best-performing one can seem too strict. We can as such reformulate our first hypothesis:

Finding this group of best-performing instances could be done using the previously described K-Means clustering method, assuming three groups of interest (K = 3) defined as worst performing, average performing, and best performing.

We first obtain an analog of performance*P(t, factor)* described as:

where 1 >*α* > 0, which is the performance memory determining the degree to which older inputs affect the current output. *C(t)* is the closing price, and *S(t, factor)* is the SuperTrend signal generating function with multiplicative factor *factor*.

We run this performance function for multiple*factor* settings and perform K-Means clustering on the multiple obtained performances to obtain the best-performing cluster. We initiate our centroids using quartiles of the obtained performances for faster centroids convergence.

The average of the factors associated with the best-performing cluster is then used to obtain the final factor setting, which is used to compute the final SuperTrend output.

Do note that we give the liberty for the user to get the final factor from the best, average, or worst cluster for experimental purposes.

🔶**SETTINGS**

🔹**Optimization**

This group of settings affects the runtime performances of the script.

🔶

Users can interpret the SuperTrend AI trailing stop similarly to the regular SuperTrend indicator. Using higher minimum/maximum factors will return longer-term signals.

The displayed performance metrics displayed on each signal allow for a deeper interpretation of the indicator. Whereas higher values could indicate a higher potential for the market to be heading in the direction of the trend when compared to signals with lower values such as 1 or 0 potentially indicating retracements.

In the image above, we can notice more clear examples of the performance metrics on signals indicating trends, however, these performance metrics cannot perform or predict every signal reliably.

We can see in the image above that the trailing stop and its adaptive moving average can also act as support & resistance. Using higher values of the

🔶

🔹

When observing data points within a specific space, we can sometimes observe that some are closer to each other, forming groups, or "Clusters". At first sight, identifying those clusters and finding their associated data points can seem easy but doing so mathematically can be more challenging. This is where cluster analysis comes into play, where we seek to group data points into various clusters such that data points within one cluster are closer to each other. This is a common branch of AI/machine learning.

Various methods exist to find clusters within data, with the one used in this script being

A naive form of the K-Means algorithm would perform the following steps in order to find

- (1) Determine the amount (K) of clusters to detect.
- (2) Initiate our K centroids (cluster centers) with random values.
- (3) Loop over the data points, and determine which is the closest centroid from each data point, then associate that data point with the centroid.
- (4) Update centroids by taking the average of the data points associated with a specific centroid.
- Repeat steps 3 to 4 until convergence, that is until the centroids no longer change.

To explain how K-Means works graphically let's take the example of a one-dimensional dataset (which is the dimension used in our script) with two apparent clusters:

This is of course a simple scenario, as

🔹

The proposed indicator rationale is based on the following hypothesis:

Given multiple instances of an indicator using different settings, the optimal setting choice at timetis given by the best-performing instance with settings(t).

Performing the calculation of the indicator using the best setting at time

Even though this specific case is rare its however not uncommon to see that performance can be similar for a group of specific settings (this could be observed in a parameter optimization heatmap), then filtering out desirable settings to only use the best-performing one can seem too strict. We can as such reformulate our first hypothesis:

Given multiple instances of an indicator using different settings, an optimal setting choice at timetis given by the average of the best-performing instances with settingss(t).

Finding this group of best-performing instances could be done using the previously described K-Means clustering method, assuming three groups of interest (K = 3) defined as worst performing, average performing, and best performing.

We first obtain an analog of performance

where 1 >

We run this performance function for multiple

The average of the factors associated with the best-performing cluster is then used to obtain the final factor setting, which is used to compute the final SuperTrend output.

Do note that we give the liberty for the user to get the final factor from the best, average, or worst cluster for experimental purposes.

🔶

- ATR Length: ATR period used for the calculation of the SuperTrends.
- Factor Range: Determine the minimum and maximum factor values for the calculation of the SuperTrends.
- Step: Increments of the factor range.
- Performance Memory: Determine the degree to which older inputs affect the current output, with higher values returning longer-term performance measurements.
- From Cluster: Determine which cluster is used to obtain the final factor.

🔹

This group of settings affects the runtime performances of the script.

- Maximum Iteration Steps: Maximum number of iterations allowed for finding centroids. Excessively low values can return a better script load time but poor clustering.
- Historical Bars Calculation: Calculation window of the script (in bars).

Release Notes

- Updated - Minor changes

Get access to our exclusive tools: luxalgo.com

Join our 150k+ community: discord.gg/lux

All content provided by LuxAlgo is for informational & educational purposes only. Past performance does not guarantee future results.

Join our 150k+ community: discord.gg/lux

All content provided by LuxAlgo is for informational & educational purposes only. Past performance does not guarantee future results.

In true TradingView spirit, the author of this script has published it open-source, so traders can understand and verify it. Cheers to the author! You may use it for free, but reuse of this code in publication is governed by House rules. You can favorite it to use it on a chart.

The information and publications are not meant to be, and do not constitute, financial, investment, trading, or other types of advice or recommendations supplied or endorsed by TradingView. Read more in the Terms of Use.