Model Metadata for Hugging Face
Add this YAML frontmatter to the top of your README.md on Hugging Face to improve discoverability:
---
language:
- en
tags:
- tabular-regression
- regression
- scikit-learn
- sklearn
- random-forest
- house-prices
- california-housing
- real-estate
- price-prediction
- tabular
license: mit
datasets:
- california-housing
metrics:
- rmse
- mean_squared_error
library_name: sklearn
pipeline_tag: tabular-regression
widget:
- text: "Example: Predict house price for NEAR BAY location with median income $83,252"
---
Model Card Sections (Already included in README.md)
Your README.md already includes:
- β Model description and overview
- β Use cases
- β Installation instructions
- β Quick start examples
- β Input feature documentation
- β Training details and evaluation
- β Requirements
- β License information
- β Author information
Additional Metadata You Can Add
Model Performance Metrics
Add these details if you have them:
- Training RMSE: ~$49,000 (10-fold cross-validation)
- Test RMSE: ~$47,000-49,000
- RΒ² Score: [Add if calculated]
- MAE (Mean Absolute Error): [Add if calculated]
Training Information
- Training time: ~30 minutes (on standard CPU)
- Number of trees: 30
- Max features: 8
- Training samples: 16,512
- Test samples: 4,128
Limitations
Already mentioned:
- Model trained on 1990 census data
- Limited to California housing market
- May not generalize to other regions/time periods
Bias and Fairness Considerations
Consider adding:
- Geographic bias (California-specific)
- Temporal bias (1990 data)
- Income-based predictions may reinforce existing patterns
Files Included in Repository
house-price-prediction/
βββ house_price_model.joblib # Trained Random Forest model (41.4 MB)
βββ preprocessing_pipeline.joblib # Scikit-learn preprocessing pipeline (3.8 KB)
βββ inference.py # Python API for model inference (8.3 KB)
βββ example_usage.py # Example usage scripts (6.1 KB)
βββ test_deployment.py # Deployment readiness tests (8.1 KB)
βββ housepriceprediction.ipynb # Training notebook with Gradio demo
βββ housing.csv # Original dataset
βββ README.md # Model card (comprehensive documentation)
βββ requirements.txt # Python dependencies
βββ LICENSE # MIT License
βββ .gitattributes # Git LFS configuration
βββ DEPLOYMENT_GUIDE.md # Step-by-step deployment guide
βββ MODEL_METADATA.md # This file (metadata reference)
Recommended Tags for Hugging Face
Primary tags:
tabular-regressionscikit-learnrandom-forest
Domain tags:
house-pricesreal-estatecalifornia-housing
Technical tags:
sklearnjoblibregressiontabular
Example API Usage for Hugging Face
When users want to use your model from Hugging Face:
from huggingface_hub import hf_hub_download
import joblib
# Download files
model_file = hf_hub_download(repo_id="YOUR_USERNAME/house-price-prediction",
filename="house_price_model.joblib")
pipeline_file = hf_hub_download(repo_id="YOUR_USERNAME/house-price-prediction",
filename="preprocessing_pipeline.joblib")
# Load model
model = joblib.load(model_file)
pipeline = joblib.load(pipeline_file)
Or clone the entire repo:
git clone https://huggingface.co/YOUR_USERNAME/house-price-prediction
cd house-price-prediction
python inference.py
Citation
If someone uses your model, they can cite it as:
@misc{california-house-price-prediction,
author = {nitish-niraj},
title = {California House Price Prediction Model},
year = {2025},
publisher = {Hugging Face},
journal = {Hugging Face Model Hub},
howpublished = {\url{https://huggingface.co/nitish-niraj/house-price-prediction}},
}