vishal-adithya
/

depth-estimator

Depth Estimation

English

xgboost

python

resnet50

Model card Files Files and versions Community

vishal-adithya commited on Jan 18

Commit

904c408

verified ·

1 Parent(s): e28febc

Update README.md

Browse files

Files changed (1) hide show

README.md +18 -19

README.md CHANGED Viewed

@@ -19,23 +19,28 @@ tags:
 # Depth Estimation Using ResNet50 and XGBoost
 ## Overview
-This project demonstrates a depth estimation model that predicts the average depth of images using features extracted from a pre-trained ResNet50 model and an XGBoost regressor. The model was trained using the **NYUv2 dataset** hosted on Hugging Face ([0jl/NYUv2](https://huggingface.co/datasets/0jl/NYUv2)). The trained model is saved as `model.pkl` using Python's `pickle` library for easy deployment and reuse.
 ### Loading the Model
 The model is saved as `model.pkl` using `pickle`. You can load and use it as follows:
 ```python
-import pickle
-# Load the trained model
 with open("model.pkl", "rb") as f:
     model = pickle.load(f)
-# Example usage
-features = extract_features("path/to/image.jpg")  # Use the same feature extraction pipeline
 predicted_depth = model.predict([features])
-print("Predicted Depth:", predicted_depth[0])
 ```
-## Features
 - **Model Architecture**:
   - Feature extraction: ResNet50 (pre-trained on ImageNet, with the top layers removed and global average pooling).
   - Regression: XGBoost, optimized for structured data prediction.
@@ -47,7 +52,7 @@ print("Predicted Depth:", predicted_depth[0])
 - Format: The dataset includes RGB images and corresponding depth maps.
 - Preprocessing:
   - Images were resized to 224x224 pixels to match the input requirements of ResNet50.
-  - Depth maps were converted into single average depth values for each image by taking the mean of the depth map.
 ## Model Training
 1. **Feature Extraction**:
@@ -58,12 +63,12 @@ print("Predicted Depth:", predicted_depth[0])
    - Hyperparameters were tuned using cross-validation techniques for optimal performance.
 ## Results
-- **R² Score**: 0.841.
 - Performance is reasonable for a first few implementation and can be further improved with additional tuning or by improving feature extraction methods.
 ## How to Use
 ### Requirements
-1. Python 3.8+
 2. Required libraries:
    - `numpy`
    - `pickle`
@@ -72,7 +77,6 @@ print("Predicted Depth:", predicted_depth[0])
    - `tensorflow`
    - 'scikit-learn'
 Install the dependencies using pip:
 ```bash
 pip install numpy tensorflow xgboost pickle-mixin datasets scikit-learn
@@ -80,7 +84,7 @@ pip install numpy tensorflow xgboost pickle-mixin datasets scikit-learn
 ### Training Pipeline
 If you want to retrain the model, follow these steps:
-NOTE: This pipeline has just the base fundamental code more additional parameter tunings and preprocessing steps were being conducted during the training of the original model
 1. Download the **NYUv2 dataset** from Hugging Face:
    ```python
    from datasets import load_dataset
@@ -106,12 +110,7 @@ NOTE: This pipeline has just the base fundamental code more additional parameter
    with open("model.pkl", "wb") as f:
        pickle.dump(regressor, f)
    ```
-## License
-This project is licensed under the Apache License 2.0.
-## Author
-**Vishal Adithya.A**
 ## Acknowledgments
 - Hugging Face for hosting the NYUv2 dataset.

 # Depth Estimation Using ResNet50 and XGBoost
 ## Overview
+This project demonstrates a depth estimation XgBoost Regressor model that predicts the average depth of images provided using features extracted from a pre-trained ResNet50 model.The model was trained upon the **NYUv2 dataset** ([0jl/NYUv2](https://huggingface.co/datasets/0jl/NYUv2)). The trained model is saved as using Python's `pickle` library for easy deployment and reuse.
+## License
+This project is licensed under the Apache License 2.0.
+## Author
+**Vishal Adithya.A**
 ### Loading the Model
 The model is saved as `model.pkl` using `pickle`. You can load and use it as follows:
 ```python
 with open("model.pkl", "rb") as f:
     model = pickle.load(f)
+features = extract_features("path/to/image.jpg")
 predicted_depth = model.predict([features])
+print(predicted_depth[0])
 ```
+extract_features() is a predefined function in the original code which uses ResNet50 to extract features out of the image
+## Key Features
 - **Model Architecture**:
   - Feature extraction: ResNet50 (pre-trained on ImageNet, with the top layers removed and global average pooling).
   - Regression: XGBoost, optimized for structured data prediction.
 - Format: The dataset includes RGB images and corresponding depth maps.
 - Preprocessing:
   - Images were resized to 224x224 pixels to match the input requirements of ResNet50.
+  - Depth maps were converted into single average depth values.
 ## Model Training
 1. **Feature Extraction**:
    - Hyperparameters were tuned using cross-validation techniques for optimal performance.
 ## Results
+- **R² Score**: 0.841
 - Performance is reasonable for a first few implementation and can be further improved with additional tuning or by improving feature extraction methods.
 ## How to Use
 ### Requirements
+1. Python 3.10+
 2. Required libraries:
    - `numpy`
    - `pickle`
    - `tensorflow`
    - 'scikit-learn'
 Install the dependencies using pip:
 ```bash
 pip install numpy tensorflow xgboost pickle-mixin datasets scikit-learn
 ### Training Pipeline
 If you want to retrain the model, follow these steps:
 1. Download the **NYUv2 dataset** from Hugging Face:
    ```python
    from datasets import load_dataset
    with open("model.pkl", "wb") as f:
        pickle.dump(regressor, f)
    ```
+NOTE: This pipeline has just the base fundamental code more additional parameter tunings and preprocessing steps were being conducted during the training of the original model
 ## Acknowledgments
 - Hugging Face for hosting the NYUv2 dataset.