capradeepgujaran commited on
Commit
b1c79c4
·
verified ·
1 Parent(s): f6a3081

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +125 -54
README.md CHANGED
@@ -1,85 +1,156 @@
1
- ---
2
- license: mit
3
- title: Construction Image Analyzer with Chat
4
- sdk: gradio
5
- emoji: 🏢
6
- colorFrom: blue
7
- colorTo: green
8
- short_description: AI Powered Snaging Tool
9
- sdk_version: 5.1.0
10
- ---
11
- ## Construction Image Analyzer with Chat
12
 
13
  ### Overview
14
- This project is a web application built using Gradio that allows users to upload construction site images and receive automated analysis and suggestions for improvement. The application utilizes the Groq AI platform to process and interpret the images, providing detailed insights and actionable steps.
 
15
 
16
  ### Features
17
- 1. **Image Upload**: Users can upload construction site images directly through the web interface.
18
- 2. **Automated Analysis**: The application uses advanced machine learning models to identify potential issues within the images, categorize them, and provide descriptions along with step-by-step solutions.
19
- 3. **Interactive Chat**: Users can ask follow-up questions based on the initial analysis, and receive real-time responses from the AI assistant.
20
- 4. **Logging and Error Handling**: Comprehensive logging and error handling ensure robust operation and easy troubleshooting.
 
21
 
22
  ### Installation
23
- To run this application, you need to have Python installed on your system. Follow these steps to install the required dependencies:
 
24
 
25
  ```bash
26
- pip install gradio pillow base64 io json groq logging
27
  ```
28
 
29
- Additionally, you will need to obtain a `GROQ_API_KEY` from the Groq platform and set it as an environment variable:
 
 
30
 
31
  ```bash
32
  export GROQ_API_KEY=your_groq_api_key_here
33
  ```
34
 
35
- ### Usage
36
- 1. **Launch the Application**:
37
- Run the main script to start the Gradio interface:
38
 
39
- ```bash
40
- python your_script_name.py
41
- ```
42
 
43
- 2. **Access the Interface**:
44
- Open a web browser and navigate to `http://localhost:7860`. You should see the Construction Image Analyzer interface.
 
45
 
46
- 3. **Upload an Image**:
47
- - Click on the "Upload Construction Image" button.
48
- - Select or drag-and-drop an image file from your local machine.
49
 
50
- 4. **Receive Analysis**:
51
- - Once the image is uploaded, click the "Analyze Image" button.
52
- - The application will display the analyzed results, including identified issues, categories, and suggested steps.
53
 
54
- 5. **Interact via Chat**:
55
- - Use the text box below the chatbot to ask additional questions related to the image analysis.
56
- - The AI assistant will respond in real-time with relevant information.
57
 
58
- ### Code Explanation
59
 
60
- #### Imports
61
- The code imports necessary libraries and modules, including Gradio for the web interface, Pillow for image processing, and Groq for AI inference.
 
 
 
 
 
 
 
62
 
63
- #### Logging Setup
64
- Logging is configured to capture debug and error messages, which helps in monitoring and debugging the application.
 
 
65
 
66
- #### Environment Variable Check
67
- The script checks if the `GROQ_API_KEY` is set in the environment variables. If not, it logs an error and raises a `ValueError`.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
68
 
69
- #### Image Encoding Function
70
- The `encode_image` function converts an image (either a file path or a PIL Image object) to a Base64-encoded string.
71
 
72
- #### Image Analysis Function
73
- The `analyze_construction_image` function handles the image analysis process. It encodes the image, sends it to the Groq API for analysis, and processes the response to extract and format the results.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
74
 
75
- #### Chat Function
76
- The `chat_about_image` function enables interactive chatting with the AI assistant. It constructs a conversation history and makes API calls to receive responses.
77
 
78
- #### Gradio Interface
79
- The Gradio interface is created with components for uploading images, displaying analysis results, and interacting via chat. Events are linked to the corresponding functions to handle user actions.
80
 
81
- #### Main Execution
82
- The script launches the Gradio interface when executed directly.
 
 
83
 
84
  ### Conclusion
85
- This project provides a powerful tool for construction site managers and inspectors to quickly assess and improve the quality of construction sites. By leveraging AI technology, it simplifies the inspection process and ensures timely resolution of potential issues.
 
 
1
+ ## 🏗️ Construction Site Safety Analyzer
2
+
3
+ Enhance workplace safety and compliance with AI-powered image analysis using Llama 3.2 90B Vision and expert chat assistance.
 
 
 
 
 
 
 
 
4
 
5
  ### Overview
6
+
7
+ This application leverages the power of Groq's AI models to analyze construction site images, identify safety issues, and provide detailed descriptions along with recommended resolutions. The Gradio interface allows users to upload images and interact with the system through a chatbot, making it easy to ask questions and get answers regarding safety measures and regulations.
8
 
9
  ### Features
10
+
11
+ - **Image Upload**: Users can upload construction site images either as a file path or a PIL Image object.
12
+ - **AI Analysis**: The application uses advanced computer vision techniques to detect and categorize safety issues within the images.
13
+ - **Chat Interaction**: A built-in chatbot provides detailed responses to questions related to the analyzed images, offering insights and guidance on safety measures and regulations.
14
+ - **Custom CSS**: The interface is styled with custom CSS to enhance readability and usability.
15
 
16
  ### Installation
17
+
18
+ To run this application, you need to have Python installed on your system. Additionally, ensure that you have the required dependencies installed. You can install these dependencies using pip:
19
 
20
  ```bash
21
+ pip install gradio pillow groq-python
22
  ```
23
 
24
+ ### Environment Variables
25
+
26
+ Before running the application, make sure to set the `GROQ_API_KEY` environment variable. This key is necessary for authenticating requests to the Groq API.
27
 
28
  ```bash
29
  export GROQ_API_KEY=your_groq_api_key_here
30
  ```
31
 
32
+ ### Running the Application
 
 
33
 
34
+ To launch the application, simply execute the main script:
 
 
35
 
36
+ ```bash
37
+ python main.py
38
+ ```
39
 
40
+ The application will start a local server, and you can access it via your web browser at `http://localhost:7860`.
 
 
41
 
42
+ ### Code Explanation
 
 
43
 
44
+ #### Imports and Setup
 
 
45
 
46
+ The code begins by importing necessary libraries and setting up logging for debugging purposes. It also loads the `GROQ_API_KEY` from the environment variables and initializes a Groq client.
47
 
48
+ ```python
49
+ import os
50
+ import base64
51
+ import gradio as gr
52
+ from PIL import Image
53
+ import io
54
+ import json
55
+ from groq import Groq
56
+ import logging
57
 
58
+ # Set up logging
59
+ logging.basicConfig(level=logging.DEBUG)
60
+ logger = logging.getLogger(__name__)
61
+ ```
62
 
63
+ #### Helper Functions
64
+
65
+ The `encode_image` function encodes an image into a base64 string, which is useful for sending images over APIs. The `analyze_construction_image` function handles the main logic of image analysis, including sending requests to the Groq API and processing the results.
66
+
67
+ ```python
68
+ def encode_image(image):
69
+ try:
70
+ if isinstance(image, str):
71
+ with open(image, "rb") as image_file:
72
+ return base64.b64encode(image_file.read()).decode('utf-8')
73
+ elif isinstance(image, Image.Image):
74
+ buffered = io.BytesIO()
75
+ image.save(buffered, format="PNG")
76
+ return base64.b64encode(buffered.getvalue()).decode('utf-8')
77
+ else:
78
+ raise ValueError(f"Unsupported image type: {type(image)}")
79
+ except Exception as e:
80
+ logger.error(f"Error encoding image: {str(e)}")
81
+ raise
82
+
83
+ def analyze_construction_image(image):
84
+ if image is None:
85
+ logger.warning("No image provided")
86
+ return [("No image uploaded", "Error: Please upload an image for analysis.")]
87
+ ...
88
+ ```
89
 
90
+ #### Gradio Interface
 
91
 
92
+ The Gradio interface is created using the `gr.Blocks` context manager. It includes a form for uploading images, a button to trigger analysis, and a chatbot for displaying results and handling user queries.
93
+
94
+ ```python
95
+ with gr.Blocks(css=custom_css, theme=gr.themes.Soft()) as iface:
96
+ gr.HTML(
97
+ """
98
+ <div class="container">
99
+ <div class="header">
100
+ <h1>🏗️ Construction Site Safety Analyzer</h1>
101
+ </div>
102
+ <p class="subheader">Enhance workplace safety and compliance with AI-powered image analysis using Llama 3.2 90B Vision and expert chat assistance.</p>
103
+ </div>
104
+ """
105
+ )
106
+
107
+ with gr.Row():
108
+ with gr.Column(scale=1):
109
+ image_input = gr.Image(type="pil", label="Upload Construction Site Image", elem_classes="image-container")
110
+ analyze_button = gr.Button("🔍 Analyze Safety Hazards", elem_classes="analyze-button")
111
+ with gr.Column(scale=2):
112
+ with gr.Group(elem_classes="chat-container"):
113
+ chatbot = gr.Chatbot(label="Safety Analysis Results and Expert Chat", elem_classes="chatbot")
114
+ with gr.Row(elem_classes="input-row"):
115
+ msg = gr.Textbox(
116
+ label="Ask about safety measures or regulations",
117
+ placeholder="E.g., 'What OSHA guidelines apply to this hazard?'",
118
+ show_label=False,
119
+ elem_classes="chat-input"
120
+ )
121
+ clear = gr.Button("🗑️ Clear", elem_classes="clear-button")
122
+
123
+ def update_chat(history, new_message):
124
+ history = history or []
125
+ history.append(new_message)
126
+ return history
127
+
128
+ analyze_button.click(
129
+ analyze_construction_image,
130
+ inputs=[image_input],
131
+ outputs=[chatbot],
132
+ postprocess=lambda x: update_chat(chatbot.value, x[0])
133
+ )
134
+
135
+ msg.submit(chat_about_image, [msg, chatbot], [msg, chatbot])
136
+ clear.click(lambda: None, None, chatbot, queue=False)
137
+
138
+ gr.HTML(
139
+ """
140
+ <div class="groq-badge">Powered by Groq</div>
141
+ """
142
+ )
143
+ ```
144
 
145
+ #### Launching the App
 
146
 
147
+ Finally, the application is launched with the `iface.launch()` method, which starts a local development server.
 
148
 
149
+ ```python
150
+ if __name__ == "__main__":
151
+ iface.launch(debug=True)
152
+ ```
153
 
154
  ### Conclusion
155
+
156
+ This Construction Site Safety Analyzer is a powerful tool for enhancing workplace safety and compliance through AI-driven image analysis and expert chat assistance. With its intuitive interface and advanced capabilities, it helps users quickly identify and address potential hazards on construction sites.