Understanding Logistic Regression: From Mathematics to Machine Learning Model | by Jugal Gajjar

Think about your self standing on the dockyard in 1912, witnessing the magnificent RMS Titanic embark on its maiden voyage. Sadly, destiny had a unique course deliberate — the “unsinkable” ship tragically struck an iceberg just some days later, ceaselessly etching its story in maritime historical past.

Quick ahead to immediately. Whereas we are able to’t change the previous, the facility of information and statistical modeling permits us to discover “what if” situations.

Get able to set sail on a voyage of discovery! This text shall be your information as we discover the fascinating world of Logistic Regression. Utilizing the fascinating story of the Titanic catastrophe as our anchor, we’ll embark on a journey to know how Logistic Regression works. All through this text, you’ll achieve a complete understanding of Logistic Regression, from its mathematical core to its sensible implementation.

Logistic regression is a robust statistical method used for classification issues. Not like linear regression, which predicts steady values, logistic regression focuses on predicting the chance of an occasion belonging to a particular class. This makes it notably helpful for issues the place the result will be categorised into distinct teams; in our case, the occasion is whether or not a passenger survived the Titanic catastrophe, and the classes are “survived” (1) or “not survived” (0).

By analyzing varied components that may have influenced survival probabilities (e.g., passenger class, age, gender), logistic regression estimates the chance of a passenger falling into the “survived” class based mostly on these components. This permits us to not solely make predictions about particular person passengers but additionally achieve insights into the general tendencies that affected survival charges.

Logistic regression, like every ML mannequin, includes some mathematical calculations. Let’s step-by-step focus on the probabilistic magic behind it:

1. Linear Regression Basis

Think about now we have a linear regression mannequin that predicts a steady worth; let’s name it z. This worth represents a linear mixture of weighted options (e.g., passenger class, age, gender) for a particular passenger:

z = w_1x_1 + w_2x_2 + ... + w_nx_n + b

Right here, w_i represents the weights assigned to every function x_i, and b is the bias time period. The linear regression mannequin basically suits a straight line by the information factors, minimizing the error between the expected z and precise final result (e.g., survived or not survived).

2. The Sigmoid Operate

Whereas linear regression works properly for steady predictions, logistic regression offers with possibilities between 0 and 1. To realize this, we introduce the sigmoid operate (additionally referred to as the logistic operate). This S-shaped operate takes any actual quantity as enter (the linear regression output z) and squishes it between 0 and 1.

f(z) = σ(z) =     1
----------
1 + e^-z

As z will increase, the output of the sigmoid operate, denoted by f(z), approaches 1. Conversely, as z decreases, f(z) approaches 0. This transformation permits us to interpret the mannequin’s output as a chance:

Values near 1 point out a excessive chance of the occasion occurring (e.g., passenger surviving).
Values near 0 point out a low chance of the occasion occurring (e.g., passengers not surviving).

3. Loss Operate

To coach the logistic regression mannequin, we want a technique to measure how properly it performs. Right here, we use the binary cross-entropy loss operate, which penalizes the mannequin for incorrect predictions. It calculates the typical distinction between the expected possibilities (f(z)) and the precise outcomes (y). (0 for not surviving, 1 for surviving)

Right here, N represents the whole variety of information factors, y_i represents the precise label for information level i (for binary classification, it’s 1 or 0), and p(y_i) represents the expected chance of the mannequin classifying information level i as belonging to the constructive class. The mannequin goals to attenuate this loss operate throughout coaching by adjusting the weights and bias phrases.

4. Optimization: Gradient Descent

To reduce the loss operate and enhance the mannequin’s efficiency, we use an optimization algorithm referred to as gradient descent. It iteratively updates the weights and bias within the path that results in the steepest lower within the loss operate. With every replace, the mannequin turns into higher at predicting the chances of the occasion occurring.

Right here, w represents the burden parameters of the mannequin being educated, b represents the bias parameter of the mannequin being educated, α represents the educational price, which controls how a lot the weights and bias are adjusted in every iteration, and Loss represents the operate whose value you are attempting to attenuate.

Now that now we have a stable understanding of the mathematical basis of logistic regression, let’s get hands-on by implementing a easy logistic regression mannequin from scratch in Python utilizing the Titanic survival dataset.

1. Import Libraries and Load Information

import pandas as pd
import numpy as np# Load the Titanic dataset
url = "https://uncooked.githubusercontent.com/datasciencedojo/datasets/grasp/titanic.csv"
titanic_df = pd.read_csv(url)
# Show the primary few rows of the dataset
titanic_df.head()

2. Fundamental Information Cleansing and Preprocessing

To arrange the information for modeling, we have to deal with lacking values and convert categorical variables to numerical values.

# Fill lacking values for 'Age' with the median worth
titanic_df['Age'].fillna(titanic_df['Age'].median(), inplace=True)# Fill lacking values for 'Embarked' with the mode worth
titanic_df['Embarked'].fillna(titanic_df['Embarked'].mode()[0], inplace=True)
# Drop the 'Cabin' column because it has too many lacking values
titanic_df.drop(columns=['Cabin'], inplace=True)
# Convert categorical variables to numeric
titanic_df = pd.get_dummies(titanic_df, columns=['Sex', 'Embarked'], drop_first=True)
# Drop columns that will not be used as options
titanic_df.drop(columns=['Name', 'Ticket', 'PassengerId'], inplace=True)
# Show the primary few rows of the cleaned dataset
titanic_df.head()

3. Outline Options (X) and Goal Variable (y)

Subsequent, we have to outline our function matrix X and goal variable y.

# Outline the goal variable 'y'
y = titanic_df['Survived'].values# Outline the function matrix 'X'
X = titanic_df.drop(columns=['Survived']).values
# Show the shapes of X and y
print(X.form, y.form)

4. Information Standardization

Right here, we carry out standardization, which helps obtain smoother gradients by inserting options on the same scale, stopping options with bigger scales from dominating the updates.

# Standardize the options
from sklearn.preprocessing import StandardScaler
scaler = StandardScaler()
X = scaler.fit_transform(X)# Convert X (options) to NumPy array for environment friendly calculations
X = np.array(X)
y = np.array(y)

5. Logistic Regression

Now, we implement the logistic regression mannequin from scratch.

class LogisticRegression:
def __init__(self, learning_rate=0.05, iterations=1000):
self.learning_rate = learning_rate
self.iterations = iterationsdef sigmoid(self, z):
return 1 / (1 + np.exp(-z))
def match(self, X, y):
# Initialize weights and bias
self.weights = np.zeros(X.form[1])
self.bias = 0
m = X.form[0]
for _ in vary(self.iterations):
# Linear mannequin
z = np.dot(X, self.weights) + self.bias
# Apply sigmoid operate
h = self.sigmoid(z)
# Compute gradients
d_weights = (1 / m) * np.dot(X.T, (h - y))
d_bias = (1 / m) * np.sum(h - y)
# Replace weights and bias
self.weights -= self.learning_rate * d_weights
self.bias -= self.learning_rate * d_bias
def predict(self, X):
# Linear mannequin
z = np.dot(X, self.weights) + self.bias
# Apply sigmoid operate
h = self.sigmoid(z)
# Convert possibilities to binary predictions
return np.the place(h >= 0.5, 1, 0)
# Initialize the mannequin
mannequin = LogisticRegression()
# Prepare the mannequin
mannequin.match(X[:int(0.9*len(X))], y[:int(0.9*len(X))])

6. Prediction

Lastly, we use the educated mannequin to make predictions based mostly on the check information.

# Make predictions
predictions = mannequin.predict(X[int(0.9*len(X)):])# Consider the mannequin
accuracy = np.imply(predictions == y[int(0.9*len(X)):])
print(f'Accuracy: {accuracy * 100:.2f}%')

By following these steps, you could have carried out a logistic regression mannequin from scratch utilizing Python. This hands-on train helps solidify your understanding of logistic regression and its software in a real-world dataset.

On this part, we’ll use the scikit-learn library to implement a logistic regression mannequin. This strategy simplifies the method by leveraging the highly effective instruments offered by scikit-learn.

1. Import Libraries

from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score

2. Create a Linear Regression Mannequin

mannequin = LogisticRegression()

3. Match & Predict

# Prepare the mannequin
mannequin.match(X[:int(0.9*len(X))], y[:int(0.9*len(X))])# Make predictions on the check set
y_pred = mannequin.predict(X[int(0.9*len(X)):])

4. Consider

# Consider the mannequin
accuracy = accuracy_score(y[int(0.9*len(X)):], y_pred)
print(f'Accuracy: {accuracy * 100:.2f}%')

Utilizing scikit-learn, you’ll be able to see how simple it’s to implement and consider a logistic regression mannequin. This strategy is environment friendly and leverages the highly effective built-in functionalities offered by the library.

Having explored logistic regression, we’ve found a robust software for sorting information into classes. It goes past simply predicting outcomes — it estimates the prospect of one thing taking place, like whether or not a passenger survived the Titanic. This system isn’t restricted to historical past; it’s utilized in finance, healthcare, and advertising too. As you delve deeper into machine studying, logistic regression turns into step one in your journey to sort out classification issues.

Source link

Unlocking the Power of BERT: How Machines are Learning to Understand Language | by Anmol Kumar | Sep, 2024

Uncovering Fraud with Machine Learning: Real-Time Detection in the Finance Industry | by vyshnaviallam | Sep, 2024

𝐓𝐡𝐞 𝐄𝐯𝐨𝐥𝐮𝐭𝐢𝐨𝐧 𝐨𝐟 𝐃𝐞𝐜𝐞𝐧𝐭𝐫𝐚𝐥𝐢𝐳𝐞𝐝 𝐬𝐲𝐬𝐭𝐞𝐦𝐬 𝐰𝐢𝐭𝐡 𝐀rtificial Intelligence | by Web3Chef | Sep, 2024

YouTube is now showing ads when you pause videos

Unlocking the Power of BERT: How Machines are Learning to Understand Language | by Anmol Kumar | Sep, 2024

Anker recalls three power banks due to fire risk – stop using them now

Black Mirror season 7 cast revealed in a cryptic computer message

Uncovering Fraud with Machine Learning: Real-Time Detection in the Finance Industry | by vyshnaviallam | Sep, 2024

Most Popular

The Hamas Threat of Hostage Execution Videos Looms Large Over Social Media

Revolutionizing the Way We Find Love

Federal Investigators Widen Tesla Inquiry, Company Says

Our Picks

YouTube is now showing ads when you pause videos

Unlocking the Power of BERT: How Machines are Learning to Understand Language | by Anmol Kumar | Sep, 2024

Anker recalls three power banks due to fire risk – stop using them now

Understanding Logistic Regression: From Mathematics to Machine Learning Model | by Jugal Gajjar | Jul, 2024

1. Linear Regression Basis

2. The Sigmoid Operate

3. Loss Operate

4. Optimization: Gradient Descent

1. Import Libraries and Load Information

2. Fundamental Information Cleansing and Preprocessing

3. Outline Options (X) and Goal Variable (y)

4. Information Standardization

5. Logistic Regression

6. Prediction

1. Import Libraries

2. Create a Linear Regression Mannequin

3. Match & Predict

4. Consider

Related Posts