File size: 8,459 Bytes
9b44a02 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 |
# Active Graph Database (AGDB) & Active Graph Network (AGN) Framework
## Introduction
**Active Graph Database (AGDB)** and **Active Graph Network (AGN)** together form a revolutionary framework for handling complex, structured, and time-based data across diverse domains. This system allows raw data to be efficiently stored, transformed, and analyzed alongside feature-engineered data, with powerful synthetic relationships for contextual insights. It enables a seamless flow from raw data to feature engineering and then into domain-specific inference policies, ideal for use cases in trading, healthcare, finance, and other fields that rely on dynamic, data-driven decision-making.
### Overview of AGDB and AGN Layers
AGDB and AGN leverage a **3D structure** that combines:
- **X and Y Axes (AGN Layer)**: Define relationships and dependencies between raw data and feature-engineered attributes, allowing for feature engineering and complex relationship mapping.
- **Z Axis (AGDB Layer)**: Maintains temporal hierarchy, structuring data in time-based checkpoints for efficient querying, traversal, and lag-based analysis.
Through **real** and **synthetic relationships** defined by policies, this framework enables both immediate analysis of raw data and in-depth insights from engineered features.
---
## 1. Core Components and Architecture
### AGDB: Dual Database Structure
The AGDB framework utilizes **two main databases**:
1. **AGDB 1 (Raw Data)**: Stores raw trading data (e.g., Open, High, Low, Close, Volume) across a structured time series.
2. **AGDB 2 (Feature-Engineered Data)**: Stores feature-engineered indicators (e.g., RSI, MACD, Bollinger Bands), calculated from AGDB 1 data.
**Cross-AGDB Relationships**: These databases are interconnected through **real edges** (representing direct dependencies between raw and feature-engineered data) and **synthetic relationships** (showing inferred patterns, such as correlations between volume and RSI).
### Diagram: AGDB Dual Database Structure
```mermaid
graph TD
subgraph AGDB_RawData_Z
Year2024_Raw[2024 Year]
Month11_Raw[November]
Day04_Raw[4th]
Hour10_Raw[10:00 AM]
Minute45_Raw[10:45 AM]
RawNode[Raw Data Node]
RawNode --> Open[Open]
RawNode --> High[High]
RawNode --> Low[Low]
RawNode --> Close[Close]
RawNode --> Volume[Volume]
end
subgraph AGDB_Features_Z
Year2024_Features[2024 Year]
Month11_Features[November]
Day04_Features[4th]
Hour10_Features[10:00 AM]
Minute45_Features[10:45 AM]
FeatureNode[Feature Data Node]
FeatureNode --> RSI[RSI]
FeatureNode --> MACD[MACD]
FeatureNode --> Bollinger[Bollinger Bands]
FeatureNode --> EMA[EMA]
FeatureNode --> SMA[SMA]
end
RawNode -->|Real Edge| RSI
RawNode -->|Real Edge| MACD
RawNode -->|Real Edge| Bollinger
RawNode -->|Real Edge| EMA
RawNode -->|Real Edge| SMA
Volume -->|Synthetic Relationship| RSI
RSI -->|Synthetic Relationship| Bollinger
Volume -->|Synthetic Relationship| MACD
```
**Explanation**:
- **Real Edges**: Show the dependency of feature-engineered nodes on specific raw data attributes.
- **Synthetic Relationships**: These inferred relationships provide additional insights based on patterns, such as correlations between volume and feature indicators like RSI.
---
## 2. Layered Structure: AGN X and Y Axes
In the AGN layer, the X and Y axes define the relationships between raw features and engineered indicators, allowing for feature engineering, dependency mapping, and indicator calculation workflows.
### Feature Engineering Nodes and Calculations
Each feature is represented by a calculation node within AGN, specifying the computation required for indicators such as RSI, MACD, and Bollinger Bands.
#### Diagram: Feature Engineering and Calculation Nodes
```mermaid
graph TD
subgraph AGN_Layer_XY
Open[Open Price]
High[High Price]
Low[Low Price]
Close[Close Price]
Volume[Volume]
subgraph Indicators
RSI[RSI]
MACD[MACD]
Bollinger[Bollinger Bands]
EMA[EMA]
SMA[SMA]
end
Open --> RSI
Close --> RSI
Volume --> MACD
Close --> MACD
Close --> Bollinger
High --> Bollinger
Low --> Bollinger
Close --> EMA
Close --> SMA
end
```
### Explanation:
- **Calculation Nodes**: Each feature-engineered node (e.g., RSI, MACD) connects to specific raw features, enabling step-by-step transformations.
- **Indicator Dependencies**: Indicators rely on multiple features (e.g., Close, Volume) for dynamic calculations, enabling diverse queries and analysis.
---
## 3. Z Axis: Temporal Structure and Query Traversal
The Z-axis in AGDB represents a structured time series, allowing for efficient traversal of nodes by time (e.g., Year > Month > Day > Hour > Minute). Both AGDB 1 and AGDB 2 are organized by time to align raw and engineered data chronologically.
### Temporal Structure and Checkpoints
AGDB leverages **temporal checkpoints** for fast traversal across time intervals, while also allowing **lag-based queries** for analyzing patterns over time (e.g., moving averages).
#### Diagram: Temporal Structure and Time-Based Nodes
```mermaid
graph TD
subgraph AGDB_Layer_Z
Year2024[2024 Year]
Month11[November]
Day04[4th]
Hour10[10:00 AM]
Minute45[10:45 AM]
Data_Node[Data Node]
Data_Node --> Open[Open]
Data_Node --> High[High]
Data_Node --> Low[Low]
Data_Node --> Close[Close]
Data_Node --> Volume[Volume]
Data_Node --> RSI[RSI]
Data_Node --> MACD[MACD]
Data_Node --> Bollinger[Bollinger Bands]
Data_Node --> EMA[EMA]
Data_Node --> SMA[SMA]
end
```
### Checkpoints and Lagged Analysis
- **Checkpoints**: Temporal checkpoints simplify time-based querying by jumping directly to intervals like hourly or daily nodes.
- **Lag-Based Features**: Supports lags for indicators such as SMA and EMA, enabling momentum-based analysis.
---
## 4. Policy and Workflow Layer for Calculations and Queries
This layer applies policies to manage relationships, calculations, and synthetic inferences across AGDB and AGN. Policies define workflows for each indicator, manage access control, and establish synthetic relationships for contextual inferences.
### Sample Policies and Commands
#### Feature Calculation Policy Example
```json
{
"policies": {
"feature_calculation": {
"RSI": {
"dependencies": ["Close"],
"period": 14,
"method": "smoothing"
},
"MACD": {
"dependencies": ["Close", "Volume"],
"EMA_periods": [12, 26],
"signal_period": 9
}
}
}
}
```
### Unified Command Logic
Commands manage data across both AGDBs and AGNs, with syntax that reflects the operation and the target data:
- **create-graph**: Initializes a new graph for a dataset.
- **create-node**: Adds raw data or feature-engineered node.
- **get-node.attribute**: Retrieves specific attributes (e.g., "Close") at a timestamp.
- **get-relationship**: Queries relationships across nodes (e.g., correlations between Volume and RSI).
### Enhanced Query Examples
**Retrieve Raw and Feature-Engineered Node**:
```plaintext
get-node AGDB_1/2024/11/04/10:45
get-node AGDB_2/2024/11/04/10:45
```
**Cross-AGDB Synthesis Query**:
```plaintext
get-relationship synthetic_edge -from Volume -to RSI -relationship correlates_with
```
---
## 5. Expanded Application Scenarios
### Domain-Agnostic Utility
While trading data is the primary example, the AGDB-AGN framework applies to healthcare, finance, and any field where structured data relationships and temporal trends need dynamic insights.
1. **Healthcare**: AGDBs can connect patient data across multiple domains (e.g., treatments, diagnostics), while AGN policies manage relationships and patient histories.
2. **Finance**: Allows modeling of economic indicators, linking multiple datasets for comprehensive market analysis.
3. **Public Service**: Uses synthetic relationships to analyze data across social and transportation domains, helping in resource allocation and trend prediction.
### Conclusion
The **AGDB & AGN Framework** transforms raw data into actionable insights, using a 3D structure that scales effortlessly across various domains. By integrating real and synthetic relationships, checkpoints, and lagged analysis, this approach empowers users to uncover hidden patterns and leverage
|