danhtran2mind commited on
Commit
413e633
·
verified ·
1 Parent(s): 9d59b6c

Upload README.md

Browse files
Files changed (1) hide show
  1. dataset/README.md +42 -0
dataset/README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ```markdown
2
+ # Landscape Pictures Dataset Processing
3
+
4
+ [![Kaggle](https://img.shields.io/badge/Dataset-Kaggle-blue.svg)](https://www.kaggle.com/datasets/arnaud58/landscape-pictures)
5
+
6
+ This README provides instructions for downloading, extracting, and processing the landscape pictures dataset from Kaggle.
7
+
8
+ ## Dataset Source
9
+
10
+ The dataset is sourced from Kaggle: Landscape Pictures by Arnaud58. Follow this link: [Kaggle Dataset](https://www.kaggle.com/datasets/arnaud58/landscape-pictures)
11
+
12
+ ## Setup
13
+
14
+ 1. **Create a Dataset Directory**: Create a directory to store the dataset:
15
+
16
+ ```python
17
+ import os
18
+
19
+ ds_path = "./dataset/landscape-pictures"
20
+ os.makedirs(ds_path, exist_ok=True)
21
+ ```
22
+
23
+ 2. **Download the Dataset**: Use the following command to download the dataset from Kaggle:
24
+
25
+ ```bash
26
+ curl -L https://www.kaggle.com/api/v1/datasets/download/arnaud58/landscape-pictures -o ./dataset/landscape-pictures.zip
27
+ ```
28
+
29
+ Note: You may need a Kaggle API token for authentication. Ensure you have the `kaggle.json` file configured in `~/.kaggle/` or set up the Kaggle API as per Kaggle's API documentation.
30
+
31
+ 3. **Extract the Dataset**: Run the following Python code to extract the downloaded zip file:
32
+
33
+ ```python
34
+ import zipfile
35
+ import os
36
+
37
+ with zipfile.ZipFile('dataset/landscape-pictures.zip', 'r') as zip_ref:
38
+ zip_ref.extractall(ds_path)
39
+ ```
40
+
41
+ This will extract the dataset into the `./dataset` directory.
42
+ ```