File size: 2,761 Bytes
77efb88
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
---
title: AI Child Behavior Assessment
emoji: πŸ§’
colorFrom: blue
colorTo: green
sdk: streamlit
app_file: app.py
pinned: false
sdk_version: 1.42.0
---

AI Child Behavior Assessment

Multimodal AI-powered tool for analyzing child emotions and speech patterns

πŸš€ Live Demo: Hugging Face Spaces

πŸ“Œ Overview

The AI Child Behavior Assessment app is designed to analyze children’s emotional and speech patterns using multimodal AI models. It integrates:
	β€’	Facial Emotion Recognition (DeepFace) πŸ§‘β€πŸŽ¨
	β€’	Speech Analysis & Transcription (Wav2Vec2) πŸŽ™οΈ
	β€’	Multimodal Analysis (Video + Audio combined) πŸŽ₯ + πŸ”Š

This tool helps in early mental health screening and behavioral assessments for research, caregivers, and psychologists.

✨ Features

βœ… 1. Video-Based Emotion Analysis
	β€’	Uses DeepFace AI to detect facial expressions and emotions.
	β€’	Processes video frames to determine dominant emotions.
	β€’	Generates a visual summary of detected emotions.

βœ… 2. Audio-Based Speech & Tone Analysis
	β€’	Uses Wav2Vec2 to transcribe spoken words.
	β€’	Applies speech emotion recognition to assess tone and sentiment.
	β€’	Includes noise reduction for clearer transcriptions.

βœ… 3. Multimodal Analysis (Video + Audio Combined)
	β€’	Extracts both visual and speech cues to detect behavior patterns.
	β€’	Compares facial emotions with speech tone to identify inconsistencies.
	β€’	Provides comprehensive insights into child behavior.

βœ… 4. Data Visualization
	β€’	Displays emotion distribution over time using bar charts πŸ“Š.
	β€’	Generates speech vs. video emotion comparison charts πŸ†.

πŸ”§ How to Use

1️⃣ Select an Analysis Mode:
	β€’	Upload a video file for emotion recognition.
	β€’	Upload an audio file for speech analysis.
	β€’	Upload a video + audio file for multimodal analysis.

2️⃣ Click β€œAnalyze” to process the uploaded file.
3️⃣ View Results:
	β€’	Detected emotions, speech transcription, and analysis insights will be displayed.

πŸ“‚ Supported File Formats

Analysis Type	Supported Formats
Video πŸŽ₯	MP4, AVI, MOV
Audio πŸŽ™οΈ	WAV, MP3
Multimodal (Video + Audio)	MP4, MOV

πŸ” Future Improvements

πŸš€ Planned Enhancements:
βœ… Real-time emotion tracking for live video
βœ… AI-driven predictive analysis for behavioral trends
βœ… Integration with clinical psychology datasets for validation
βœ… More advanced multimodal deep learning models

πŸ“œ Citation & Acknowledgment

If you use this tool in research or projects, please cite:
Durganihantri Low – AI Child Behavior Assessment (2025)
🌐 Hugging Face Spaces

πŸ‘¨β€πŸ’» Contact & Contributions

Have suggestions or want to contribute? Contact me:
πŸ“§ Email: [email protected]
πŸ”— LinkedIn: http://linkedin.com/in/durganihantri