Posts

Showing posts from September, 2023

Machine Learning and Predictive Scaling

Machine Learning is a branch of artificial intelligence (AI) that focuses on the use of data and algorithms to provide predictive analytics, which can gradually self-improve its predicting accuracy over the learning course. Applying Machine Learning in a wide variety of use cases has become a fast growing phenomenon in recent years. This includes auto scaling in cloud computing.  Auto scaling, in cloud computing, refers to the capability of scaling the compute resources as per the demand at the time – when the compute needed is higher than the system is currently running, the system scales out (adds more) compute resources to better share the load; when the needed compute is lower, the system scales in (reduces the capacity of) the compute for the benefits both financially and environmentally. Some metrics are used to indicate the level of needed compute. For example, CPU utilisation is one of the common metrics used. Let’s use Amazon EC2 compute service as an example. Say there are cu