使用FastAPI和Redis Caching加快機器學習模型推理

作者：布加迪 2025-05-14 08:16:46

這篇指南逐步介紹了通過緩存請求并生成快速響應以加快模型推理。?

譯者 | 布加迪

審校 | 重樓

Redis 是一款開源內(nèi)存數(shù)據(jù)結構存儲系統(tǒng)，是機器學習應用領域中緩存的優(yōu)選。它的速度、耐用性以及支持各種數(shù)據(jù)結構使其成為滿足實時推理任務的高吞吐量需求的理想選擇。

我們在本教程中將探討Redis緩存在機器學習工作流程中的重要性。我們將演示如何使用FastAPI和Redis構建一個強大的機器學習應用程序。本教程介紹如何在Windows上安裝Redis、在本地運行Redis以及如何將其集成到機器學習項目中。最后，我們將通過發(fā)送重復請求和獨特請求來測試該應用程序，以驗證Redis緩存系統(tǒng)正常運行。

為什么在機器學習中使用Redis緩存？

在當今快節(jié)奏的數(shù)字環(huán)境中，用戶期望機器學習應用程序能夠立即獲得結果。比如說，使用推薦模型向用戶推薦產(chǎn)品的電商平臺。如果實施Redis來緩存重復請求，該平臺就可以顯著縮短響應時間。

當用戶請求產(chǎn)品推薦時，系統(tǒng)先檢查該請求是否已被緩存。如果已緩存，則在幾微秒內(nèi)返回緩存的響應，從而提供無縫的體驗。如果沒有緩存，模型就處理該請求，生成推薦，并將結果存儲在Redis中供將來的請求使用。這種方法不僅提高了用戶滿意度，還優(yōu)化了服務器資源，使模型能夠高效地處理更多請求。

使用Redis構建網(wǎng)絡釣魚電子郵件分類應用程序

我們在本項目中將構建一個網(wǎng)絡釣魚電子郵件分類應用程序。整個過程包括加載和處理來自Kaggle的數(shù)據(jù)集，使用處理后的數(shù)據(jù)訓練機器學習模型，評估其性能，保存經(jīng)過訓練的模型，最后構建帶有Redis集成機制的FastAPI應用程序。

1. 設置

從Kaggle下載網(wǎng)絡釣魚電子郵件檢測數(shù)據(jù)集，并將其放入到data/目錄。
首先你需要安裝Redis。在終端中運行以下命令安裝Redis Python客戶程序：

pip install redis

如果你使用Windows系統(tǒng)，且未安裝Windows Subsystem for Linux（WSL），請按照微軟指南啟用WSL，并從微軟商店安裝Linux發(fā)行版（比如Ubuntu）。
WSL設置完成后，打開WSL終端，并執(zhí)行以下命令安裝Redis：

sudo apt update
sudo apt install redis-server

要啟動Redis服務器，請運行：

sudo service redis-server start

你應該會看到一條確認消息，表明“redis-server”已成功啟動。

2. 模型訓練

訓練腳本可加載數(shù)據(jù)集、處理數(shù)據(jù)、訓練模型并將其保存在本地。

import joblib
import pandas as pd
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.pipeline import Pipeline

def main():
 # Load dataset
 df = pd.read_csv("data/Phishing_Email.csv") # adjust the path as necessary

 # Assume dataset has columns "text" and "label"
 X = df["Email Text"].fillna("")
 y = df["Email Type"]

 # Split the dataset into training and testing sets
 X_train, X_test, y_train, y_test = train_test_split(
 X, y, test_size=0.2, random_state=42
 )

 # Create a pipeline with TF-IDF and Logistic Regression
 pipeline = Pipeline(
 [
 ("tfidf", TfidfVectorizer(stop_words="english")),
 ("clf", LogisticRegression(solver="liblinear")),
 ]
 )

 # Train the model
 pipeline.fit(X_train, y_train)

 # Save the trained model to a file
 joblib.dump(pipeline, "phishing_model.pkl")
 print("Model trained and saved as phishing_model.pkl")

if __name__ == "__main__":
 main()


python train.py


Model trained and saved as phishing_model.pkl

3. 模型評估

評估腳本可加載數(shù)據(jù)集和保存的模型文件以執(zhí)行模型評估。

import pandas as pd
from sklearn.metrics import classification_report, accuracy_score
from sklearn.model_selection import train_test_split
import joblib

def main():
 # Load dataset
 df = pd.read_csv("data/Phishing_Email.csv") # adjust the path as necessary

 # Assume dataset has columns "text" and "label"
 X = df["Email Text"].fillna("")
 y = df["Email Type"]

 # Split the dataset
 X_train, X_test, y_train, y_test = train_test_split(
 X, y, test_size=0.2, random_state=42
 )

 # Load the trained model
 model = joblib.load("phishing_model.pkl")

 # Make predictions on the test set
 y_pred = model.predict(X_test)

 # Evaluate the model
 print("Accuracy: ", accuracy_score(y_test, y_pred))
 print("Classification Report:")
 print(classification_report(y_test, y_pred))

if __name__ == "__main__":
 main()

結果近乎完美，F1分數(shù)也非常出色。

python validate.py

Accuracy: 0.9723860589812332
Classification Report:
 precision recall   f1-score support

Phishing Email 0.96 0.97 0.96 1457
 Safe Email 0.98 0.97 0.98 2273

 accuracy 0.97 3730
 macro avg 0.97 0.97 0.97 3730
 weighted avg   0.97 0.97 0.97 3730

4. 使用Redis提供模型服務

為了提供模型服務，我們將使用FastAPI創(chuàng)建REST API，并集成Redis以緩存預測。

import asyncio
import json
import joblib
from fastapi import FastAPI
from pydantic import BaseModel
import redis.asyncio as redis

# Create an asynchronous Redis client (make sure Redis is running on localhost:6379)
redis_client = redis.Redis(host="localhost", port=6379, db=0, decode_respnotallow=True)

# Load the trained model (synchronously)
model = joblib.load("phishing_model.pkl")

app = FastAPI()

# Define the request and response data models
class PredictionRequest(BaseModel):
 text: str

class PredictionResponse(BaseModel):
 prediction: str
 probability: float

@app.post("/predict", response_model=PredictionResponse)
async def predict_email(data: PredictionRequest):
 # Use the email text as a cache key
 cache_key = f"prediction:{data.text}"
 cached = await redis_client.get(cache_key)
 if cached:
 return json.loads(cached)

 # Run model inference in a thread to avoid blocking the event loop
 pred = await asyncio.to_thread(model.predict, [data.text])
 prob = await asyncio.to_thread(lambda: model.predict_proba([data.text])[0].max())

 result = {"prediction": str(pred[0]), "probability": float(prob)}

 # Cache the result for 1 hour (3600 seconds)
 await redis_client.setex(cache_key, 3600, json.dumps(result))
 return result

if __name__ == "__main__":
 import uvicorn
 uvicorn.run(app, host="0.0.0.0", port=8000)

python serve.py

INFO: Started server process [17640]
INFO: Waiting for application startup.
INFO: Application startup complete.
INFO: Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)

你可以通過訪問URL來查看REST API 文檔。

本項目的源代碼、配置文件、模型和數(shù)據(jù)集可以在kingabzpro/Redis-ml-project GitHub代碼庫中找到。如果你在運行上述代碼時遇到任何問題，可隨時參閱。

Redis緩存在機器學習應用中的工作原理

下面逐步解釋Redis緩存在我們的機器學習應用程序中的運作方式，并附加一張流程圖加以說明：

客戶程序提交輸入數(shù)據(jù)，請求機器學習模型進行預測。
系統(tǒng)根據(jù)輸入數(shù)據(jù)生成獨特的標識符，以檢查預測是否已存在。
系統(tǒng)使用生成的鍵查詢Redis緩存，以查找先前存儲的預測。

A.如果找到緩存的預測，則檢索該預測并以JSON響應的形式返回。

B.如果沒有找到緩存的預測，則將輸入數(shù)據(jù)傳遞給機器學習模型以生成新的預測。

新生成的預測存儲在Redis緩存中供將來使用。
最終結果以JSON格式返回給客戶程序。

測試網(wǎng)絡釣魚電子郵件分類應用程序

構建完網(wǎng)絡釣魚電子郵件分類應用程序后，就可以測試其功能了。我們在本節(jié)中將使用 `cURL` 命令發(fā)送多封電子郵件并分析響應來評估該應用程序。此外，我們將驗證Redis數(shù)據(jù)庫，以確保緩存系統(tǒng)正常運行。

使用CURL命令測試 API

為了測試API，我們將向`/predict`端點發(fā)送五個請求。其中三個請求包含獨特的電子郵件文本，另外兩個請求是之前發(fā)送的電子郵件的復制版本。這將使我們能夠驗證預測準確性和緩存機制。

echo "\n===== Testing API Endpoint with 5 Requests =====\n"

# First unique email
echo "\n----- Request 1 (First unique email) -----"
curl -X 'POST' \
 'http://localhost:8000/predict' \
 -H 'accept: application/json' \
 -H 'Content-Type: application/json' \
 -d '{
 "text": "todays floor meeting you may get a few pointed questions about today article about lays potential severance of $ 80 mm"
}'

# Second unique email
echo "\n\n----- Request 2 (Second unique email) -----"
curl -X 'POST' \
 'http://localhost:8000/predict' \
 -H 'accept: application/json' \
 -H 'Content-Type: application/json' \
 -d '{
 "text": "urgent action required: your account has been compromised, click here to reset your password immediately"
}'

# First duplicate (same as first email)
echo "\n\n----- Request 3 (Duplicate of first email - should be cached) -----"
curl -X 'POST' \
 'http://localhost:8000/predict' \
 -H 'accept: application/json' \
 -H 'Content-Type: application/json' \
 -d '{
 "text": "todays floor meeting you may get a few pointed questions about today article about lays potential severance of $ 80 mm"
}'

# Third unique email
echo "\n\n----- Request 4 (Third unique email) -----"
curl -X 'POST' \
 'http://localhost:8000/predict' \
 -H 'accept: application/json' \
 -H 'Content-Type: application/json' \
 -d '{
 "text": "congratulations you have won a free iphone, click here to claim your prize now before it expires"
}'

# Second duplicate (same as second email)
echo "\n\n----- Request 5 (Duplicate of second email - should be cached) -----"
curl -X 'POST' \
 'http://localhost:8000/predict' \
 -H 'accept: application/json' \
 -H 'Content-Type: application/json' \
 -d '{
 "text": "urgent action required: your account has been compromised, click here to reset your password immediately"
}'

echo "\n\n===== Test Complete =====\n"
echo "Now run 'python check_redis.py' to verify the Redis cache entries"

運行上述腳本時，API應該返回每封電子郵件的預測結果。對于重復的請求，響應應該從Redis緩存中加以檢索，以確保更快的響應時間。

sh test.sh


\n===== Testing API Endpoint with 5 Requests =====\n
\n----- Request 1 (First unique email) -----
{"prediction":"Safe Email","probability":0.7791625553383463}\n\n----- Request 2 (Second unique email) -----
{"prediction":"Phishing Email","probability":0.8895319031315131}\n\n----- Request 3 (Duplicate of first email - should be cached) -----
{"prediction":"Safe Email","probability":0.7791625553383463}\n\n----- Request 4 (Third unique email) -----
{"prediction":"Phishing Email","probability":0.9169092144856761}\n\n----- Request 5 (Duplicate of second email - should be cached) -----
{"prediction":"Phishing Email","probability":0.8895319031315131}\n\n===== Test Complete =====\n
Now run 'python check_redis.py' to verify the Redis cache entries

驗證Redis緩存

為了確認緩存系統(tǒng)正常運行，我們將使用Python腳本`check_redis.py`來檢查Redis數(shù)據(jù)庫。該腳本檢索緩存的預測結果，并將其以表格形式顯示出來。

import redis
import json
from tabulate import tabulate

def main():
 # Connect to Redis (ensure Redis is running on localhost:6379)
 redis_client = redis.Redis(host="localhost", port=6379, db=0, decode_respnotallow=True)

 # Retrieve all keys that start with "prediction:"
 keys = redis_client.keys("prediction:*")
 total_entries = len(keys)
 print(f"Total number of cached prediction entries: {total_entries}\n")

 table_data = []
 # Process only the first 5 entries
 for key in keys[:5]:
 # Remove the 'prediction:' prefix to get the original email text
 email_text = key.replace("prediction:", "", 1)

 # Retrieve the cached value
 value = redis_client.get(key)
 try:
 data = json.loads(value)
 except json.JSONDecodeError:
 data = {}

 prediction = data.get("prediction", "N/A")

 # Display only the first 7 words of the email text
 words = email_text.split()
 truncated_text = " ".join(words[:7]) + ("..." if len(words) > 7 else "")

 table_data.append([truncated_text, prediction])

 # Print table using tabulate (only two columns now)
 headers = ["Email Text (First 7 Words)", "Prediction"]
 print(tabulate(table_data, headers=headers, tablefmt="pretty"))

if __name__ == "__main__":
 main()

當你運行check_redis.py腳本時，它會以表格形式顯示緩存條目數(shù)量和已緩存的預測結果。

python check_redis.py


Total number of cached prediction entries: 3

+--------------------------------------------------+----------------+
| Email Text (First 7 Words) | Prediction | 
+--------------------------------------------------+----------------+
| congratulations you have won a free iphone,... | Phishing Email |
| urgent action required: your account has been... | Phishing Email |
| todays floor meeting you may get a... | Safe Email |
+--------------------------------------------------+----------------+

結語

通過使用多個請求測試釣魚郵件分類應用程序，我們成功地演示了該API能夠準確識別釣魚郵件，同時還能使用Redis高效地緩存重復請求。這種緩存機制通過減少重復輸入的冗余計算顯著提升了性能，這在API處理龐大流量的實際應用場景中尤其大有助益。

雖然這是一個比較簡單的機器學習模型，但在處理更龐大、更復雜的模型（比如圖像識別）時，緩存的優(yōu)勢來得更為明顯。比如說，如果你在部署一個大規(guī)模圖像分類模型，緩存頻繁處理輸入的預測結果就可以節(jié)省大量計算資源，并顯著縮短響應時間。

原文標題：Accelerate Machine Learning Model Serving with FastAPI and Redis Caching，作者：Abid Ali Awan

責任編輯：姜華來源： 51CTO

?Redis 機器學習推薦模型

自拍偷在线精品自拍偷,亚洲欧美中文日韩v在线观看不卡