File size: 1,974 Bytes
5b537e7
 
e40b7c3
 
 
5b537e7
 
08267ed
5b537e7
e40b7c3
5b537e7
 
 
 
 
e40b7c3
5b537e7
e40b7c3
 
 
 
5b537e7
e40b7c3
5b537e7
 
e40b7c3
 
5b537e7
 
e40b7c3
 
5b537e7
e40b7c3
5b537e7
e40b7c3
5b537e7
394f550
 
5b537e7
e40b7c3
 
 
 
 
 
5b537e7
e40b7c3
394f550
e40b7c3
 
394f550
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
---
library_name: transformers
tags:
  - text-classification
  - malicious-url-detection
---

# Malicious-Url-Detector

Leveraging this fine-tuned model, you can identify harmful links intended to exploit users—such as phishing or malware URLs—by accurately classifying them as either malicious or benign.

## Model Details

### Model Description

This model is a **fine-tuned** version of [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased), adapted specifically for malicious URL detection. It employs a text-classification approach to distinguish between benign and malicious URLs. By learning patterns from a curated dataset of phishing, malware, and legitimate URLs, it aims to help users and organizations bolster their defenses against a range of cyber threats.

- **Developed by:** Eason Liu  
- **Language:** English  
- **Model Type:** Text Classification (URL-focused)  
- **Finetuned From:** [distilbert/distilbert-base-uncased](https://huggingface.co/distilbert/distilbert-base-uncased)

## Intended Use

### Direct Use
- **URL Classification:** Detect whether a URL is malicious (e.g., phishing, malware) or benign.
- **Security Pipelines:** Integrate into email filtering systems or website scanning tools to flag harmful links.

### Out-of-Scope Use
- General text classification tasks not related to malicious URL detection.
- Tasks requiring more nuanced context beyond the URL string (e.g., domain reputation, real-time link behavior).

## How to Get Started

Below is a quick example showing how to use this model with the 🤗 Transformers `pipeline`:

```python
from transformers import pipeline

# Initialize the text-classification pipeline with this fine-tuned model
classifier = pipeline(
    "text-classification",
    model="Eason918/malicious-url-detector",
    truncation=True
)

# Example URL
url = "http://example.com/suspicious-link"

# Classify the URL
result = classifier(url)
print(result)