Spaces:
Runtime error
Runtime error
adding google drive service readme
Browse files
src/ctp_slack_bot/services/GOOGLE_DRIVE_README.md
ADDED
|
@@ -0,0 +1,228 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Google Drive Access Module
|
| 2 |
+
|
| 3 |
+
This Python module provides a simplified way to interact with Google Drive, focusing on easy access to files in nested folders using path-like syntax. It handles various Google file formats and provides comprehensive metadata for files and folders.
|
| 4 |
+
|
| 5 |
+
## Features
|
| 6 |
+
|
| 7 |
+
- **Path-based folder access**: Access files using simple paths like `folder1/folder2/folder3`
|
| 8 |
+
- **Efficient caching**: Folder IDs are cached to improve performance
|
| 9 |
+
- **Comprehensive metadata**: Get detailed information about files and folders
|
| 10 |
+
- **Read various file types**:
|
| 11 |
+
- Text files
|
| 12 |
+
- Google Docs
|
| 13 |
+
- VTT files
|
| 14 |
+
- **Robust folder finding**: Works with exact and partial name matching
|
| 15 |
+
- **Simple API**: Designed for ease of use with minimal code
|
| 16 |
+
|
| 17 |
+
## Setup Instructions
|
| 18 |
+
|
| 19 |
+
### 1. Create a Google Cloud Project
|
| 20 |
+
|
| 21 |
+
1. Go to the [Google Cloud Console](https://console.cloud.google.com/)
|
| 22 |
+
2. Click on the project dropdown at the top of the page and select "New Project"
|
| 23 |
+
3. Enter a project name and click "Create"
|
| 24 |
+
4. Once created, make sure your new project is selected in the dropdown
|
| 25 |
+
|
| 26 |
+
### 2. Enable the Google Drive API
|
| 27 |
+
|
| 28 |
+
1. In the Google Cloud Console, navigate to "APIs & Services" > "Library" in the left sidebar
|
| 29 |
+
2. Search for "Google Drive API" in the search bar
|
| 30 |
+
3. Click on "Google Drive API" in the results
|
| 31 |
+
4. Click the "Enable" button
|
| 32 |
+
|
| 33 |
+
### 3. Create OAuth Credentials
|
| 34 |
+
|
| 35 |
+
1. In the Google Cloud Console, go to "APIs & Services" > "Credentials" in the left sidebar
|
| 36 |
+
2. Click "Create Credentials" at the top and select "OAuth client ID"
|
| 37 |
+
3. If prompted to configure the OAuth consent screen:
|
| 38 |
+
- Choose "External" user type (or "Internal" if you're in a Google Workspace organization)
|
| 39 |
+
- Fill in the required information (App name, User support email, Developer contact email)
|
| 40 |
+
- Click "Save and Continue"
|
| 41 |
+
- Add the following scopes:
|
| 42 |
+
- `.../auth/drive` (Full access to Google Drive)
|
| 43 |
+
- Click "Save and Continue" and complete the registration
|
| 44 |
+
4. Return to the "Create OAuth client ID" screen
|
| 45 |
+
5. Select "Desktop application" as the Application type
|
| 46 |
+
6. Enter a name for your OAuth client (e.g., "Google Drive Access Desktop")
|
| 47 |
+
7. Click "Create"
|
| 48 |
+
8. Download the JSON file (this is your `client_secret.json`)
|
| 49 |
+
|
| 50 |
+
### 4. Project Setup
|
| 51 |
+
|
| 52 |
+
1. Setup a virtual environment and install dependencies:
|
| 53 |
+
```bash
|
| 54 |
+
python -m venv venv
|
| 55 |
+
source venv/bin/activate # On Windows: venv\Scripts\activate
|
| 56 |
+
pip install -r requirements.txt
|
| 57 |
+
```
|
| 58 |
+
|
| 59 |
+
2. Place your credentials:
|
| 60 |
+
- Create a `credentials` directory in your project root
|
| 61 |
+
- Move the downloaded OAuth client JSON file to the `credentials` directory
|
| 62 |
+
- Rename it to `client_secret.json`
|
| 63 |
+
|
| 64 |
+
### 5. Authentication Process
|
| 65 |
+
|
| 66 |
+
When you run the application for the first time:
|
| 67 |
+
1. A browser window will open automatically
|
| 68 |
+
2. You'll be asked to sign in to your Google account
|
| 69 |
+
3. You'll see a consent screen asking for permission to access your Google Drive
|
| 70 |
+
4. After granting permission, the browser will display a success message
|
| 71 |
+
5. The application will save a token file (`token.pickle`) in the credentials directory for future use
|
| 72 |
+
|
| 73 |
+
## Usage Guide
|
| 74 |
+
|
| 75 |
+
The `EasyGoogleDrive` class provides several methods to interact with Google Drive. Here's how to use the core functionality:
|
| 76 |
+
|
| 77 |
+
### Basic Usage
|
| 78 |
+
|
| 79 |
+
```python
|
| 80 |
+
from google_drive_access import EasyGoogleDrive
|
| 81 |
+
|
| 82 |
+
# Initialize the Google Drive client
|
| 83 |
+
drive = EasyGoogleDrive()
|
| 84 |
+
|
| 85 |
+
# Example folder path - replace with your actual folder path
|
| 86 |
+
folder_path = "Spring-2025-BAI"
|
| 87 |
+
subfolder_path = "Spring-2025-BAI/transcripts"
|
| 88 |
+
```
|
| 89 |
+
|
| 90 |
+
### Listing Folders
|
| 91 |
+
|
| 92 |
+
```python
|
| 93 |
+
# List folders in a directory
|
| 94 |
+
folders = drive.get_folders_in_folder(folder_path)
|
| 95 |
+
|
| 96 |
+
# Access folder properties
|
| 97 |
+
for folder in folders:
|
| 98 |
+
print(f"Folder: {folder['name']}")
|
| 99 |
+
print(f" Created: {folder.get('createdTimeFormatted', 'Unknown')}")
|
| 100 |
+
print(f" Modified: {folder.get('modifiedTimeFormatted', 'Unknown')}")
|
| 101 |
+
```
|
| 102 |
+
|
| 103 |
+
### Listing Files
|
| 104 |
+
|
| 105 |
+
```python
|
| 106 |
+
# List files in a directory
|
| 107 |
+
files = drive.get_files_in_folder(subfolder_path)
|
| 108 |
+
|
| 109 |
+
# Access file properties
|
| 110 |
+
for file in files:
|
| 111 |
+
print(f"File: {file['name']}")
|
| 112 |
+
print(f" Type: {file.get('fileType', 'Unknown')}")
|
| 113 |
+
print(f" Created: {file.get('createdTimeFormatted', 'Unknown')}")
|
| 114 |
+
print(f" Modified: {file.get('modifiedTimeFormatted', 'Unknown')}")
|
| 115 |
+
print(f" Size: {file.get('sizeFormatted', 'Unknown')}")
|
| 116 |
+
```
|
| 117 |
+
|
| 118 |
+
### Getting a Specific File
|
| 119 |
+
|
| 120 |
+
```python
|
| 121 |
+
# Get a specific file with metadata
|
| 122 |
+
file = drive.get_file("example.txt", subfolder_path, include_metadata=True)
|
| 123 |
+
|
| 124 |
+
if file:
|
| 125 |
+
print(f"File: {file['name']}")
|
| 126 |
+
print(f" Type: {file.get('fileType', 'Unknown')}")
|
| 127 |
+
print(f" Created: {file.get('createdTimeFormatted', 'Unknown')}")
|
| 128 |
+
print(f" Modified: {file.get('modifiedTimeFormatted', 'Unknown')}")
|
| 129 |
+
print(f" Size: {file.get('sizeFormatted', 'Unknown')}")
|
| 130 |
+
```
|
| 131 |
+
|
| 132 |
+
### Getting All Items in a Folder
|
| 133 |
+
|
| 134 |
+
```python
|
| 135 |
+
# Get all items (files and folders) in a folder
|
| 136 |
+
all_items = drive.get_all_files_in_folder(folder_path)
|
| 137 |
+
|
| 138 |
+
# Access item properties
|
| 139 |
+
for item in all_items:
|
| 140 |
+
item_type = "Folder" if item.get('mimeType') == drive.MIME_TYPES['folder'] else item.get('fileType', 'Unknown')
|
| 141 |
+
print(f"Item: {item['name']} ({item_type})")
|
| 142 |
+
```
|
| 143 |
+
|
| 144 |
+
### Checking if a File Exists
|
| 145 |
+
|
| 146 |
+
```python
|
| 147 |
+
# Check if a file exists
|
| 148 |
+
exists = drive.file_exists("example.txt", subfolder_path)
|
| 149 |
+
print(f"File exists: {exists}")
|
| 150 |
+
```
|
| 151 |
+
|
| 152 |
+
### Getting File Modified Time
|
| 153 |
+
|
| 154 |
+
```python
|
| 155 |
+
# Get file modified time
|
| 156 |
+
modified_time = drive.get_file_modified_time("example.txt", subfolder_path)
|
| 157 |
+
if modified_time:
|
| 158 |
+
print(f"Last modified: {modified_time}")
|
| 159 |
+
```
|
| 160 |
+
|
| 161 |
+
### Reading File Content
|
| 162 |
+
|
| 163 |
+
```python
|
| 164 |
+
# Get file with content
|
| 165 |
+
file_with_content = drive.get_file("example.txt", subfolder_path, include_content=True)
|
| 166 |
+
|
| 167 |
+
if file_with_content and 'file_content' in file_with_content:
|
| 168 |
+
content = file_with_content['file_content']
|
| 169 |
+
if content:
|
| 170 |
+
print(f"Content: {content[:100]}...") # Print first 100 characters
|
| 171 |
+
```
|
| 172 |
+
|
| 173 |
+
## Complete Example
|
| 174 |
+
|
| 175 |
+
For a complete example of how to use the `EasyGoogleDrive` class, see the `basic_usage.py` file included in this package. This file demonstrates all the core functionality with practical examples.
|
| 176 |
+
|
| 177 |
+
## Key Concepts
|
| 178 |
+
|
| 179 |
+
### Path-based Folder Access
|
| 180 |
+
|
| 181 |
+
The module uses a simple path-like syntax to access folders:
|
| 182 |
+
|
| 183 |
+
```python
|
| 184 |
+
# Access a deeply nested folder
|
| 185 |
+
folder_path = "folder1/folder2/folder3"
|
| 186 |
+
files = drive.get_files_in_folder(folder_path)
|
| 187 |
+
```
|
| 188 |
+
|
| 189 |
+
This makes it much easier to work with nested folder structures compared to using folder IDs.
|
| 190 |
+
|
| 191 |
+
### Metadata Fields
|
| 192 |
+
|
| 193 |
+
The module provides comprehensive metadata for files and folders, including:
|
| 194 |
+
|
| 195 |
+
- **Creation and modification dates**: Both as datetime objects and formatted strings
|
| 196 |
+
- **File size**: Both in bytes and human-readable format (KB, MB, GB)
|
| 197 |
+
- **File type**: Simplified type based on MIME type
|
| 198 |
+
- **Owner information**: Names and email addresses of file owners
|
| 199 |
+
- **Sharing status**: Whether the file is shared
|
| 200 |
+
- **Web links**: Direct links to view the file in a browser
|
| 201 |
+
|
| 202 |
+
## Error Handling
|
| 203 |
+
|
| 204 |
+
The module includes comprehensive error handling:
|
| 205 |
+
|
| 206 |
+
- **Authentication errors**: Clear messages when credentials are missing or invalid
|
| 207 |
+
- **Folder not found**: Helpful messages when a folder in the path cannot be found
|
| 208 |
+
- **File not found**: Attempts partial name matching before giving up
|
| 209 |
+
- **Decoding errors**: Handles issues with file content encoding
|
| 210 |
+
|
| 211 |
+
## Dependencies
|
| 212 |
+
|
| 213 |
+
- **Required**:
|
| 214 |
+
- google-auth-oauthlib
|
| 215 |
+
- google-auth-httplib2
|
| 216 |
+
- google-api-python-client
|
| 217 |
+
- python-dateutil
|
| 218 |
+
|
| 219 |
+
## Security Notes
|
| 220 |
+
|
| 221 |
+
- Never commit your `client_secret.json` or token files to version control
|
| 222 |
+
- Add `credentials/` to your `.gitignore` file
|
| 223 |
+
- Keep your credentials secure and don't share them
|
| 224 |
+
- For production applications, consider using service accounts with the minimum required permissions
|
| 225 |
+
|
| 226 |
+
## Contributing
|
| 227 |
+
|
| 228 |
+
Feel free to contribute to this project by submitting issues or pull requests.
|