Skip to main content

Simple Linear Regression

 Simple Linear Regression

 

Simple linear regression is used to find out the best relationship between a single input variable (predictor, independent variable, input feature, input parameter) & output variable (predicted, dependent variable, output feature, output parameter) provided that both variables are continuous in nature. This relationship represents how an input variable is related to the output variable and how it is represented by a straight line.


After looking at scatter plot we can understand:

  1. The direction
  2. The strength
  3. The linearity

The above characteristics are between variable Y and variable X. The above scatter plot shows us that variable Y and variable X possess a strong positive linear relationship. Hence, we can project a straight line which can define the data in the most accurate way possible.

If the relationship between variable X and variable Y is strong and linear, then we conclude that particular independent variable X is the effective input variable to predict dependent variable Y.

To check the collinearity between variable X and variable Y, we have correlation coefficient (r), which will give you numerical value of correlation between two variables. You can have strong, moderate or weak correlation between two variables. Higher the value of “r”, higher the preference given for particular input variable X for predicting output variable Y. Few properties of “r” are listed as follows:

  1. Range of r: -1 to +1
  2. Perfect positive relationship: +1
  3. Perfect negative relationship: -1
  4. No Linear relationship: 0
  5. Strong correlation: r > 0.85 (depends on business scenario)

Command used for calculation “r” in RStudio is:

> cor(X, Y)

where, X: independent variable & Y: dependent variable Now, if the result of the above command is greater than 0.85 then choose simple linear regression.

If r < 0.85 then use transformation of data to increase the value of “r” and then build a simple linear regression model on transformed data.

Steps to Implement Simple Linear Regression:

  1. Analyze data (analyze scatter plot for linearity)
  2. Get sample data for model building
  3. Then design a model that explains the data
  4. And use the same developed model on the whole population to make predictions.

Comments

Popular posts from this blog

Data science: A Blend of These Data Components

Data Science Course ExcelR offers Data Science course, the most comprehensive Data Science course in the market, covering the complete Data Science lifecycle concepts from Data Collection, Data Extraction, Data Cleansing, Data Exploration, Data Transformation, Feature Engineering, Data Integration, Data Mining, building Prediction models, Data Visualization and deploying the solution to the customer. Skills and tools ranging from  Statistical Analysis , Text Mining, Regression Modelling, Hypothesis Testing, Predictive Analytics,  Machine Learning ,   Deep Learning, Neural Networks, Natural Language Processing, Predictive Modelling, R Studio, XLMiner,  Tableau , Spark, Hadoop, Minitab, programming languages like R programming ,  Python are covered extensively as part of this Data Science training. ExcelR is considered as the best Data Science training institute which offers services from training to placement as part of the Data Science training program wi...

Machine Learning Crash Course

Information science requires mastering in various fields like machine learning, R programming, Python, deep studying and plenty of extra. Machine Learning Engineer is categorized as one of the refined profiles due to the work profile in addition to the wage packages that the candidates draw from the companies. Many reinforcement studying algorithms use dynamic programming methods. This implies, through the use of a selected kind of algorithm, machines are given the facility of understanding the command and studying from there with out giving any further instructions or without programming again and again. It has been an important studying expertise for me. Online classroom training for Machine Learning Certification is carried out through online reside streaming of each class. The classification of information and labeling the information are trained in case of the supervised learning. The programme is designed for technology professionals who wish to advance their career as a spe...

Machine Studying Course Bangalore

Be taught the fundamentals of Machine Learning on this introductory course. A central application of unsupervised studying is within the field of density estimation in statistics , 26 though unsupervised learning encompasses other domains involving summarizing and explaining data features. According to Gartner, 2.three million Machine Learning Jobs shall be generated by 2020. If you have already got some experience in expertise and are serious about building a profession in the futuristic areas of Machine Studying, Deep Studying and AI, the PGPDM can equip you with all the abilities and instruments you need. The implementation particulars of these deep studying fashions along with tuning of the parameters will be illustrated in this course. I'd spent too much time building deep studying networks for ready datasets moderately than entering into the trenches and exploring data from scratch. I have completed my Machine studying training from Cetpa,Here i have discovered indepth in...