Spaces:

ronakreddy18
/

Zerotoheroinmachinelearning

Sleeping

App Files Files Community

ronakreddy18 commited on Dec 19, 2024

Commit

b4f594e

verified ·

1 Parent(s): 887e319

Update pages/LIFE_CYCLE_OF_MACHINE_LEARNING.py

Browse files

Files changed (1) hide show

pages/LIFE_CYCLE_OF_MACHINE_LEARNING.py +68 -173

pages/LIFE_CYCLE_OF_MACHINE_LEARNING.py CHANGED Viewed

@@ -130,12 +130,10 @@ print(excel_file.sheet_names)
     st.markdown('[Jupyter Notebook](https://colab.research.google.com/drive/1Dv68m9hcRzXsLRlRit0uZc-8CB8U6VV3?usp=sharing)')
     if st.button("Back to Structured Data"):
         st.session_state.page = "structured_data"
 # ----------------- Unstructured Data Page -----------------
 def unstructured_data_page():
     st.title(":blue[Unstructured Data]")
@@ -147,23 +145,34 @@ def unstructured_data_page():
     - Social media posts
     """)
     # Button to Navigate to Introduction to Image
     if st.button("Introduction to Image"):
         st.session_state.page = "introduction_to_image"
 def image():
-    st.header("🖼️ Handling Image Data")
     st.markdown("""
-    Image data can be processed using libraries like OpenCV and PIL (Pillow). Images often need to be preprocessed for tasks like analysis, classification, or feature extraction. Common operations include:
-Reading and displaying images
-Converting to grayscale
-Resizing and cropping
-Rotating and flipping
-Applying filters
-Edge detection and other transformations
 """)
     st.code("""
@@ -171,57 +180,46 @@ from PIL import Image
 import numpy as np
 import matplotlib.pyplot as plt
-Open an image file
 image = Image.open('sample_image.jpg')
 image.show()
-Convert image to grayscale
 gray_image = image.convert('L')
 gray_image.show()
-Resize the image
 resized_image = image.resize((200, 200))
 resized_image.show()
-Rotate the image by 90 degrees
 rotated_image = image.rotate(90)
 rotated_image.show()
-Convert the image to a NumPy array and display its shape
 image_array = np.array(image)
 print(image_array.shape)
-Display the image array as a plot
 plt.imshow(image)
 plt.title("Original Image")
 plt.axis('off')
 plt.show()
     """, language='python')
-    st.markdown("""
-    Common Image Processing Techniques:
-Resizing: Adjust the dimensions of an image for uniformity in models.
-Cropping: Extract a region of interest (ROI) from an image.
-Grayscale Conversion: Simplify image data by reducing it to a single channel.
-Rotation/Flipping: Perform augmentations to increase the dataset for model training.
-Edge Detection: Identify edges in images using filters like the Sobel or Canny filters.
-""")
-    ### Challenges and Solutions Section
-    st.markdown("### Challenges with Unstructured Data")
-    st.write("""
-Noise and Inconsistency: Data is often incomplete or noisy.
-Storage Requirements: Large size and variability in data types.
-Processing Time: Analyzing unstructured data is computationally expensive.
-""")
-    st.markdown("### Solutions")
-    st.write("""
-Data Cleaning: Preprocess data to remove noise.
-Efficient Storage: Use NoSQL databases (e.g., MongoDB) or cloud storage.
-Parallel Processing: Utilize frameworks like Apache Spark.
 """)
     # Navigation Button
@@ -255,7 +253,7 @@ def json_page():
     st.write("### What is JSON?")
     st.write("""
-    JSON (JavaScript Object Notation) is a lightweight data-interchange format that's easy for humans to read and write, and easy for machines to parse and generate. JSON is often used in APIs, configuration files,       and data transfer applications.
     """)
     st.write("### Reading JSON Files")
@@ -274,8 +272,7 @@ import json
 data = {
     "name": "Alice",
     "age": 25,
-    "skills
-: ["Python", "Machine Learning"]
 }
 with open('data.json', 'w') as file:
     json.dump(data, file, indent=4)
@@ -288,133 +285,31 @@ with open('data.json', 'w') as file:
     - JSON supports both strings and numbers, and other types like arrays and booleans, making it versatile for various data types.
     """)
-    st.markdown('[Jupyter Notebook](https://huggingface.co/spaces/ronakreddy18/Zerotoheroinmachinelearning/blob/main/pages/json_file__handling.ipynb)')
-    if st.button("Back to Semi-Structured Data"):
-        st.session_state.page = "semi_structured_data"
-# ----------------- CSV Data Page -----------------
-def csv_page():
-    st.title(":green[CSV Data Format]")
-    st.write("### What is CSV?")
-    st.write("""
-    CSV (Comma-Separated Values) files store tabular data in plain text, where each line is a data record and columns are separated by commas.
-    """)
-    st.write("### Reading CSV Files")
-    st.code("""
-import pandas as pd
-# Read a CSV file
-df = pd.read_csv('data.csv')
-print(df)
-    """, language='python')
-    st.write("### Error Handling for CSV Files")
-    st.code("""
-import pandas as pd
-try:
-    df = pd.read_csv('data.csv', encoding='utf-8', delimiter=',')
-    print("CSV File Loaded Successfully!")
-    print(df)
-except FileNotFoundError:
-    print("Error: File not found. Please check the file path.")
-except pd.errors.ParserError:
-    print("Error: The file is not a valid CSV format.")
-except UnicodeDecodeError:
-    print("Error: Encoding issue. Try specifying a different encoding like 'latin1' or 'utf-8'.")
-    """, language='python')
-    st.markdown('[Jupyter Notebook](https://huggingface.co/spaces/ronakreddy18/Zerotoheroinmachinelearning/blob/main/pages/CSV_HANDLING_GUIDE.ipynb)')
-    if st.button("Back to Semi-Structured Data"):
-        st.session_state.page = "semi_structured_data"
-# ----------------- XML Data Page -----------------
-def xml_page():
-    st.title(":green[XML Data Format]")
-    st.write("### What is XML?")
-    st.write("""
-    XML (Extensible Markup Language) is a markup language used for storing and exchanging structured data. It uses a hierarchical structure with tags to define elements.
-    """)
-    st.write("### Reading XML Files")
-    st.code("""
-import xml.etree.ElementTree as ET
-# Load and parse an XML file
-tree = ET.parse('data.xml')
-root = tree.getroot()
-# Access elements
-for child in root:
-    print(child.tag, child.text)
-    """, language='python')
-    st.write("### Sample XML Data")
-    st.code("""
-<company>
-    <employee>
-        <name>John Doe</name>
-        <role>Developer</role>
-    </employee>
-    <employee>
-        <name>Jane Smith</name>
-        <role>Manager</role>
-    </employee>
-</company>
-    """, language='xml')
-    st.write("### Issues Encountered")
-    st.write("""
-    - **File not found**: The specified XML file path is incorrect.
-    - **Malformed XML**: The XML structure has syntax errors.
-    - **XPath Errors**: Incorrect XPath expressions when querying data.
-    """)
-    st.write("### Solutions to These Issues")
-    st.code("""
-# Handle missing file
-try:
-    tree = ET.parse('data.xml')
-except FileNotFoundError:
-    print("File not found. Check the file path.")
-# Validate XML structure
-try:
-    root = ET.fromstring(xml_data)
-except ET.ParseError:
-    print("Malformed XML.")
-    """, language='python')
-    st.markdown('[Jupyter Notebook](https://huggingface.co/spaces/ronakreddy18/Zerotoheroinmachinelearning/blob/main/pages/XML_FILE_HANDLING.ipynb)')
-    # Back to Semi-Structured Data
     if st.button("Back to Semi-Structured Data"):
         st.session_state.page = "semi_structured_data"
-# Main control to call appropriate page
-if st.session_state.page == "home":
-    home_page()
-elif st.session_state.page == "data_collection":
-    data_collection_page()
-elif st.session_state.page == "structured_data":
-    structured_data_page()
-elif st.session_state.page == "excel":
-    excel_page()
-elif st.session_state.page == "csv":
-    csv_page()
-elif st.session_state.page == "json":
-    json_page()
-elif st.session_state.page == "unstructured_data":
-    unstructured_data_page()
-elif st.session_state.page == "semi_structured_data":
-    semi_structured_data_page()
-elif st.session_state.page == "xml":
-    xml_page()
-elif st.session_state.page == "introduction_to_image":
-    image()

     st.markdown('[Jupyter Notebook](https://colab.research.google.com/drive/1Dv68m9hcRzXsLRlRit0uZc-8CB8U6VV3?usp=sharing)')
     if st.button("Back to Structured Data"):
         st.session_state.page = "structured_data"
 # ----------------- Unstructured Data Page -----------------
 def unstructured_data_page():
     st.title(":blue[Unstructured Data]")
     - Social media posts
     """)
     # Button to Navigate to Introduction to Image
     if st.button("Introduction to Image"):
         st.session_state.page = "introduction_to_image"
 def image():
+    st.header("🖼️ What is Image")
     st.markdown("""
+   An image is a two-dimensional visual representation of objects, people, scenes, or concepts. It can be captured using devices like cameras, scanners, or created digitally. Images are composed of individual units called pixels, which contain information about brightness and color.
+Types of Images:
+- **Raster Images (Bitmap)**: Composed of a grid of pixels. Common formats include:
+    - JPEG
+    - PNG
+    - GIF
+- **Vector Images**: Defined by mathematical equations and geometric shapes like lines and curves. Common format:
+    - SVG (Scalable Vector Graphics)
+- **3D Images**: Represent objects or scenes in three dimensions, often used for rendering and modeling.
+Image Representation:
+- **Grayscale Image**: Each pixel has a single intensity value, typically ranging from 0 (black) to 255 (white), representing different shades of gray.
+- **Color Image**: Usually represented in the RGB color space, where each pixel consists of three values indicating the intensity of Red, Green, and Blue.
+Applications of Images:
+- **Photography & Visual Media**: Capturing moments and storytelling.
+- **Medical Imaging**: Diagnosing conditions using X-rays, MRIs, etc.
+- **Machine Learning & AI**: Tasks like image classification, object detection, and facial recognition.
+- **Remote Sensing**: Analyzing geographic and environmental data using satellite imagery.
+- **Graphic Design & Art**: Creating creative visual content for marketing and design.
 """)
     st.code("""
 import numpy as np
 import matplotlib.pyplot as plt
+# Open an image file
 image = Image.open('sample_image.jpg')
 image.show()
+# Convert image to grayscale
 gray_image = image.convert('L')
 gray_image.show()
+# Resize the image
 resized_image = image.resize((200, 200))
 resized_image.show()
+# Rotate the image by 90 degrees
 rotated_image = image.rotate(90)
 rotated_image.show()
+# Convert the image to a NumPy array and display its shape
 image_array = np.array(image)
 print(image_array.shape)
+# Display the image array as a plot
 plt.imshow(image)
 plt.title("Original Image")
 plt.axis('off')
 plt.show()
     """, language='python')
+    st.header("""
+    Color Spaces in Machine Learning
+    A color space is a mathematical model for representing colors. In machine learning, different color spaces can be used for preprocessing and analyzing image data, depending on the task.
+Common Color Spaces:
+- **RGB (Red, Green, Blue)**: The most common color space for digital images. Each pixel is represented by a combination of three values corresponding to the red, green, and blue channels.
+  - **Use Cases**: Image classification, general-purpose image analysis.
+- **HSV (Hue, Saturation, Value)**: Separates color information (hue) from intensity (value), making it useful for tasks where distinguishing between color variations and intensity is important.
+  - **Use Cases**: Color-based object detection, image segmentation, color tracking.
+- **CMYK (Cyan, Magenta, Yellow, Black)**: Primarily used for printing, not commonly used in machine learning, but useful for preparing images for printers.
+  - **Use Cases**: Printing applications.
+- **LAB (Lightness, A, B)**: Designed to be perceptually uniform, meaning that the perceptual difference between colors is consistent across the space.
+  - **Use Cases**: Color correction, image processing tasks requiring color consistency.
 """)
     # Navigation Button
     st.write("### What is JSON?")
     st.write("""
+    JSON (JavaScript Object Notation) is a lightweight data-interchange format that's easy for humans to read and write, and easy for machines to parse and generate. JSON is often used in APIs, configuration files, and data transfer applications.
     """)
     st.write("### Reading JSON Files")
 data = {
     "name": "Alice",
     "age": 25,
+    "skills": ["Python", "Machine Learning"]
 }
 with open('data.json', 'w') as file:
     json.dump(data, file, indent=4)
     - JSON supports both strings and numbers, and other types like arrays and booleans, making it versatile for various data types.
     """)
+    st.markdown('[Jupyter Notebook](https://huggingface.co/transformers/notebooks.html)')
     if st.button("Back to Semi-Structured Data"):
         st.session_state.page = "semi_structured_data"
+# ----------------- Main Execution -----------------
+def main():
+    page = st.session_state.page
+    if page == "home":
+        home_page()
+    elif page == "data_collection":
+        data_collection_page()
+    elif page == "structured_data":
+        structured_data_page()
+    elif page == "excel":
+        excel_page()
+    elif page == "unstructured_data":
+        unstructured_data_page()
+    elif page == "semi_structured_data":
+        semi_structured_data_page()
+    elif page == "json":
+        json_page()
+    elif page == "image":
+        image()
+if __name__ == "__main__":
+    main()