OpenSight-Deepfake-Detection-Models-Playground

Running

App Files Files Community

LPX55 commited on 12 days ago

Commit

eff3634

1 Parent(s): 554718d

update inst

Browse files

Files changed (1) hide show

.github/instructions/copilot-instructions.md +38 -22

.github/instructions/copilot-instructions.md CHANGED Viewed

@@ -1,34 +1,50 @@
-# Copilot Instructions for OpenSight-Deepfake-Detection-Models-Playground
 ## Project Overview
-This project is a modular, agent-driven toolkit for deepfake detection and digital forensics. It uses an ensemble of models, advanced forensic tools, and smart agents to provide explainable, extensible, and robust detection—optimized for integration with vision LLMs and multimodal AI agents.
-## Architecture & Key Components
-- **Entrypoint:** `app.py` (Gradio app + MCP server).
-- **Forensics:** `forensics/` (e.g., `ela.py`, `gradient.py`, `minmax.py`, `bitplane.py`, `wavelet.py`). Each file implements a forensic technique, callable via LLM/MCP.
-- **Agents:** `agents/` (e.g., `EnsembleMonitorAgent`, `ModelWeightManager`, `ContextualIntelligenceAgent`, `ForensicAnomalyDetectionAgent`). Agents coordinate model weighting, context inference, and anomaly detection.
-- **Model Management:** Models are registered in `utils/registry.py` and managed as an ensemble with dynamic, context-aware weighting.
-- **Utilities:** `utils/` (logging, augmentation, registry, health checks).
 ## Data Flow & Prediction Pipeline
-1. **Image Preprocessing:** Normalize to PIL RGB; optionally augment (rotate, noise, sharpen).
-2. **Agent Initialization:** Monitoring, optimization, and context agents are set up.
-3. **Model Inference:** Each model predicts independently; results tracked by agents.
-4. **Consensus:** Model weights are dynamically adjusted based on context and agent feedback.
-5. **Forensic Analysis:** Multiple forensic tools run in parallel; outputs analyzed for anomalies.
-6. **Logging:** All results (images, predictions, agent data) are logged to Hugging Face datasets.
-## Developer Workflows
-- **Run the App:**
   ```bash
-  python app_optimized.py
   ```
-- **Dependencies:** See `requirements.txt` (notably: gradio, PIL, numpy, torch, smolagents, etc.).
 - **Extending Forensics/Agents:** Add new tools in `forensics/`, new agents in `agents/`, and register in the main app.
 - **Testing:** Unit tests for agents/models are planned (see roadmap in `README.md`).
-## Project-Specific Patterns & Conventions
 - **Forensic Tool Naming:** Use `tool_*` or descriptive names (e.g., `tool_ela`, `tool_waveletnoise`).
 - **Agent Classes:** Use `*Agent` suffix (e.g., `EnsembleMonitorAgent`).
 - **API Exposure:** Functions are exposed for LLM/MCP calls with clear parameter/return docs (see `README.md`).
@@ -38,7 +54,7 @@ This project is a modular, agent-driven toolkit for deepfake detection and digit
 ## Integration & Extension
 - **Add Models:** Update ensemble logic and register in agent system.
 - **Add Forensic Tools:** Implement in `forensics/`, expose via main app, and document parameters/returns.
-- **LLM/Multimodal Integration:** Hybrid input strategies (e.g., ELA+RGB, metadata+image) are encouraged; see `README.md` for detailed tables and guidance.
 ## References
 - **Forensic Techniques:** See `forensics/` for implementation details.
@@ -47,4 +63,4 @@ This project is a modular, agent-driven toolkit for deepfake detection and digit
 - **Roadmap:** Ongoing and planned features are tracked in `README.md`.
 ---
-**Tip:** When extending or debugging, always check agent logic and consensus weighting, as these are central to system behavior.

+# Copilot Instructions for OpenSight-Deepfake-Detection-Models-Playground (2025)
 ## Project Overview
+OpenSight is a modular, agent-driven toolkit for deepfake detection and digital forensics. It leverages an ensemble of models, advanced forensic tools, and smart agents for explainable, extensible, and robust detection. The system is optimized for integration with vision LLMs and multimodal AI agents, and supports logging to Hugging Face datasets.
+## Key Technologies
+- **Gradio**: Main UI and API server (`app.py`).
+- **Hugging Face Hub**: Model and dataset management, logging, and deployment.
+- **Git LFS**: Required for storing binary files (e.g., PNGs) in the repo. See `.gitattributes` for tracked types.
+- **Agents**: Smart agents for ensemble monitoring, weight optimization, system health, context intelligence, and anomaly detection (`agents/`).
+- **Forensic Tools**: Modular forensic techniques in `forensics/` (ELA, gradient, minmax, bitplane, wavelet, exif, etc.).
+- **Model Management**: Models are registered and managed in `utils/registry.py`, loaded via ONNX/Hugging Face/Gradio API.
+- **Utilities**: Logging, augmentation, health checks, and more in `utils/`.
 ## Data Flow & Prediction Pipeline
+1. **Image Preprocessing**: Normalize to PIL RGB, optional augmentation (rotate, noise, sharpen).
+2. **Agent Initialization**: Monitoring, optimization, and context agents setup.
+3. **Model Inference**: Each model predicts independently; results tracked by agents.
+4. **Consensus**: Model weights dynamically adjusted based on context and agent feedback.
+5. **Forensic Analysis**: Multiple forensic tools run in parallel; outputs analyzed for anomalies.
+6. **Logging**: All results (images, predictions, agent data) are logged to Hugging Face datasets (`hf_logger.py`).
+## Developer Workflow
+- **Run the App:**
+  ```bash
+  python app.py
+  ```
+- **Dependencies:** See `requirements.txt` (gradio, PIL, numpy, torch, huggingface_hub, etc.).
+- **Binary Files:** All PNGs and other binaries must be tracked with Git LFS. If you see a push error, run:
+  ```bash
+  git lfs track "*.png"
+  git add .gitattributes
+  git add <yourfile.png>
+  git commit -m "Track PNG files with Git LFS"
+  git push origin main
+  ```
+  If the file is already in history, use:
   ```bash
+  git lfs migrate import --include="*.png"
+  git push origin main
   ```
 - **Extending Forensics/Agents:** Add new tools in `forensics/`, new agents in `agents/`, and register in the main app.
 - **Testing:** Unit tests for agents/models are planned (see roadmap in `README.md`).
+## Project Patterns & Conventions
 - **Forensic Tool Naming:** Use `tool_*` or descriptive names (e.g., `tool_ela`, `tool_waveletnoise`).
 - **Agent Classes:** Use `*Agent` suffix (e.g., `EnsembleMonitorAgent`).
 - **API Exposure:** Functions are exposed for LLM/MCP calls with clear parameter/return docs (see `README.md`).
 ## Integration & Extension
 - **Add Models:** Update ensemble logic and register in agent system.
 - **Add Forensic Tools:** Implement in `forensics/`, expose via main app, and document parameters/returns.
+- **LLM/Multimodal Integration:** Hybrid input strategies (e.g., ELA+RGB, metadata+image) are encouraged; see `README.md` for details.
 ## References
 - **Forensic Techniques:** See `forensics/` for implementation details.
 - **Roadmap:** Ongoing and planned features are tracked in `README.md`.
 ---
+**Tip:** Always check agent logic and consensus weighting when extending or debugging, as these are central to system behavior. For binary file push errors, ensure Git LFS is set up and files are tracked correctly.