File size: 4,817 Bytes
277b2e8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c2c12b3
277b2e8
 
c2c12b3
277b2e8
 
 
 
 
 
 
 
 
 
 
c2c12b3
 
277b2e8
 
c2c12b3
277b2e8
 
 
 
 
 
 
 
 
 
c2c12b3
 
277b2e8
 
 
 
 
 
c2c12b3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
277b2e8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c2c12b3
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
---
language: en
license: mit
tags:
- expense-categorization
- financial-transactions
- machine-learning
- tax-compliance
model-index:
- name: Finlytic-Categorize
  results:
  - task:
      type: expense-categorization
    dataset:
      name: finlytic-financial-data
      type: financial-transactions
    metrics:
    - name: Accuracy
      type: accuracy
      value: 94
    - name: Precision
      type: precision
      value: 91
    - name: Recall
      type: recall
      value: 89
    - name: F1-Score
      type: f1
      value: 90
    source:
      name: Internal Evaluation
      url: https://huggingface.co/comethrusws/finlytic-categorize
    base_model: openai-community/gpt2
base_model:
- openai-community/gpt2
---

# Finlytic-Categorize

**Finlytic-Categorize** is an AI-powered machine learning model developed to automate the categorization of expenses for small and medium-sized enterprises (SMEs). This model is designed to simplify the financial accounting process by classifying business expenses into appropriate tax-related categories, ensuring efficiency, and minimizing errors.

## Model Details

- **Model Name**: Finlytic-Categorize
- **Model Type**: Expense Categorization
- **Framework**: Transformers (PyTorch), GPT-2
- **Dataset**: The model is trained on financial transaction data, including diverse business expenses.
- **Use Case**: Automating the process of categorizing expenses into tax-compliant categories for SMEs in Nepal.
- **Hosting**: Hugging Face model repository (currently used in a locally hosted setup)

## Objective

The model is designed to reduce manual effort and the likelihood of human errors when handling large amounts of financial data. By using **Finlytic-Categorize**, SMEs can easily categorize expenses and maintain accurate records for tax filing.

## Model Architecture

The model is based on a pre-trained transformer architecture, fine-tuned specifically for the task of expense categorization. The dataset used for fine-tuning includes annotated financial records with appropriate tax labels. 

## How to Use

### Local Usage

To use the **Finlytic-Categorize** model locally, follow these steps:

1. **Installation**: Clone the model repository from Hugging Face or use the local model by loading it with Hugging Face’s `transformers` library.
   
   ```bash
   git clone https://huggingface.co/comethrusws/finlytic-categorize
   ```
   
2. **Load the Model**:

   ```python
   from transformers import AutoTokenizer, AutoModel

   tokenizer = AutoTokenizer.from_pretrained("comethrusws/finlytic-categorize")
   model = AutoModel.from_pretrained("comethrusws/finlytic-categorize")
   ```

3. **Input**: Feed your financial data (in JSON, CSV, or any structured format). The model expects financial transaction descriptions and amounts.

4. **Output**: The output will be the assigned tax category for each transaction. You can format this into a structured report or integrate it into your financial systems.

### Using Inference API

You can also use the **Finlytic-Categorize** model via the Hugging Face Inference API.

```python
import requests

API_URL = "https://api-inference.huggingface.co/models/comethrusws/finlytic-categorize"
headers = {"Authorization": "Bearer YOUR_API_KEY"}

data = {
    "inputs": "Categorize this expense: 'Software purchase, $200.'"
}

response = requests.post(API_URL, headers=headers, json=data)
print(response.json())
```

Replace `YOUR_API_KEY` with your Hugging Face API key.

## Dataset

The model was trained on financial data with annotations, specifically curated for Nepalese businesses, covering a wide range of common expense types, such as:

- Delivery charges
- Software licenses
- Employee training
- Operational supplies

## Evaluation

The model was evaluated using a hold-out validation set and achieved high accuracy in categorizing business expenses. Specific metrics include:

- **Accuracy**: 94%
- **Precision**: 91%
- **Recall**: 89%

## Limitations

- The model is tailored for Nepalese SMEs and may require re-training or fine-tuning for different tax laws or regions.
- It is best suited for common expense categories and may not generalize well for very niche or rare expenses.

## Future Improvements

- Expand the model's training data to include more diverse financial transactions.
- Fine-tune for region-specific tax categorization, making it more adaptable globally.

## Contact

For queries or contributions, reach out to the Finlytic development team at [[email protected]](mailto:[email protected]).
```

This version updates the framework section to `Transformers (PyTorch), GPT-2` and includes a working example of how to use the inference API. You can now copy and paste this into your `README.md` file on Hugging Face.

Let me know if you need any further tweaks!