Precision and Recall in Machine Learning

Palavras-chave:

Publicado em: 05/08/2025

Precision and Recall in Machine Learning

Precision and recall are essential metrics for evaluating the performance of classification models, particularly when dealing with imbalanced datasets. This article will delve into the definitions of precision and recall, illustrate their calculation, and discuss their importance in real-world scenarios.

Fundamental Concepts / Prerequisites

Before understanding precision and recall, you should have a basic grasp of the following concepts:

Classification Models: Algorithms that categorize data into predefined classes (e.g., spam detection, image recognition).
True Positives (TP): Instances correctly predicted as positive.
True Negatives (TN): Instances correctly predicted as negative.
False Positives (FP): Instances incorrectly predicted as positive (Type I error).
False Negatives (FN): Instances incorrectly predicted as negative (Type II error).

Core Implementation/Solution: Calculating Precision and Recall

The following Python code demonstrates how to calculate precision and recall given the number of true positives, false positives, and false negatives.


def calculate_precision_recall(tp, fp, fn):
    """
    Calculates precision and recall for a classification model.

    Args:
        tp (int): True Positives.
        fp (int): False Positives.
        fn (int): False Negatives.

    Returns:
        tuple: (precision, recall). Returns (0, 0) if division by zero occurs.
    """
    if tp + fp == 0:
        precision = 0
    else:
        precision = tp / (tp + fp)

    if tp + fn == 0:
        recall = 0
    else:
        recall = tp / (tp + fn)

    return precision, recall

# Example usage:
tp = 90
fp = 10
fn = 30

precision, recall = calculate_precision_recall(tp, fp, fn)

print(f"Precision: {precision:.2f}")
print(f"Recall: {recall:.2f}")

# Function to test if the calculations are correct
def test_precision_recall():
    # Test case 1
    tp1, fp1, fn1 = 5, 0, 0
    p1, r1 = calculate_precision_recall(tp1, fp1, fn1)
    assert p1 == 1.0 and r1 == 1.0, "Test Case 1 Failed"

    # Test case 2
    tp2, fp2, fn2 = 5, 5, 5
    p2, r2 = calculate_precision_recall(tp2, fp2, fn2)
    assert p2 == 0.5 and r2 == 0.5, "Test Case 2 Failed"

    # Test case 3: Handle zero values to avoid ZeroDivisionError
    tp3, fp3, fn3 = 0, 0, 0
    p3, r3 = calculate_precision_recall(tp3, fp3, fn3)
    assert p3 == 0 and r3 == 0, "Test Case 3 Failed"

    # Test case 4
    tp4, fp4, fn4 = 10, 2, 8
    p4, r4 = calculate_precision_recall(tp4, fp4, fn4)
    assert abs(p4 - (10/12)) < 1e-6 and abs(r4 - (10/18)) < 1e-6, "Test Case 4 Failed"

    print("All test cases passed!")

test_precision_recall()

Code Explanation

The `calculate_precision_recall` function takes three arguments: `tp` (true positives), `fp` (false positives), and `fn` (false negatives). It then calculates precision and recall using the following formulas:

Precision: TP / (TP + FP). This measures the accuracy of the positive predictions. It represents the proportion of correctly predicted positive instances among all instances predicted as positive.
Recall: TP / (TP + FN). This measures the ability of the model to find all the positive instances. It represents the proportion of correctly predicted positive instances among all actual positive instances.

The function handles potential division by zero errors by returning 0 for both precision and recall in such cases. The example demonstrates how to use the function with sample data and prints the results. The `test_precision_recall` function adds test cases to ensure correct computation, especially regarding handling edge cases like zero values for TP, FP and FN.

Analysis: Definitions of Precision and Recall

Precision: Answers the question: "Of all the instances the model predicted as positive, how many were actually positive?". High precision means the model is good at avoiding false positives.

Recall: Answers the question: "Of all the actual positive instances, how many did the model correctly identify?". High recall means the model is good at avoiding false negatives.

Complexity Analysis

The `calculate_precision_recall` function has a time complexity of O(1) because it only involves a few arithmetic operations. The space complexity is also O(1) because it uses a fixed amount of memory regardless of the input values. The `test_precision_recall` function has a time complexity of O(1) as well. Its space complexity is O(1) too, since all tests use fixed amounts of memory.

Alternative Approaches

Instead of writing a custom function, you can utilize libraries like scikit-learn in Python. Scikit-learn provides built-in functions for calculating precision and recall from predicted labels and true labels. This is often more convenient and less error-prone, especially when dealing with more complex scenarios like multi-class classification. However, implementing the calculation directly, as shown above, is useful for understanding the underlying concepts.

For example, using scikit-learn:


from sklearn.metrics import precision_score, recall_score
import numpy as np

y_true = np.array([0, 1, 0, 1, 1, 0]) # Ground truth labels
y_pred = np.array([0, 1, 1, 0, 1, 0]) # Predicted labels

precision = precision_score(y_true, y_pred)
recall = recall_score(y_true, y_pred)

print(f"Precision: {precision:.2f}")
print(f"Recall: {recall:.2f}")

This approach offers more features and better performance for large datasets. The downside is it increases project dependency on external libraries.

Conclusion

Precision and recall are crucial metrics for evaluating classification models. Precision emphasizes the accuracy of positive predictions, while recall emphasizes the model's ability to find all positive instances. Understanding and appropriately using these metrics helps in selecting and fine-tuning models to achieve the desired balance between avoiding false positives and false negatives for a specific problem domain.