Analyze video processing times and insights using Whisper models
Ask questions about images to get answers