Interesting. Did you learn how the creator of the dataset synthesised it?
nomadicsynth PRO
nomadicsynth
AI & ML interests
knowledge discovery
Recent Activity
replied to
their
post
27 days ago
Anyone using AI and ML to help neurodivergent people? I'd love to hear what you're doing.
replied to
their
post
28 days ago
Anyone using AI and ML to help neurodivergent people? I'd love to hear what you're doing.
updated
a Space
about 1 month ago
nomadicsynth/inkling
Organizations

replied to
their
post
27 days ago

replied to
their
post
28 days ago
10 days? Rookie numbers ๐. so many side-quests. no idea what to do with myself lol. i'd love to hear about your ideas, and happy to give some feedback (for what it's worth)

reacted to
jbilcke-hf's
post with ๐
2 months ago
Post
1920
Hi everyone,
I've seen some unsuccessful attempts at running Wan2GP inside a Hugging Face Space, which is a shame as it is a great Gradio app!
So here is a fork that you can use, with some instructions on how to do this:
jbilcke-hf/Wan2GP_you_must_clone_this_space_to_use_it#1
Note : some things like persistent models/storage/custom LoRAs might not be fully working out of the box. If you need those, you might have to dig into the Wan2GP codebase, see how to tweak the storage folder. Happy hacking!
I've seen some unsuccessful attempts at running Wan2GP inside a Hugging Face Space, which is a shame as it is a great Gradio app!
So here is a fork that you can use, with some instructions on how to do this:
jbilcke-hf/Wan2GP_you_must_clone_this_space_to_use_it#1
Note : some things like persistent models/storage/custom LoRAs might not be fully working out of the box. If you need those, you might have to dig into the Wan2GP codebase, see how to tweak the storage folder. Happy hacking!

reacted to
ArturoNereu's
post with ๐
2 months ago
Post
1756
I just finished AI Engineering by Chip Huyen. Probably the best resource Iโve seen that covers the full AI stack. People wondering how to shift their careers toward AI might find this very useful.
I recently shared this list of resources Iโve been using to learn AI:
๐ https://github.com/ArturoNereu/AI-Study-Group
I recently shared this list of resources Iโve been using to learn AI:
๐ https://github.com/ArturoNereu/AI-Study-Group

reacted to
codelion's
post with ๐
2 months ago
Post
3437
๐ง We just implemented Andrej Karpathy's "third paradigm" for LLM learning!
System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.
๐ How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.
๐ Results across math benchmarks:
Arena Hard: 29% โ 37.6% (+8.6%)
AIME24: 23.33% โ 30% (+6.67%)
OptILLMBench: 61% โ 65% (+4%)
The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.
โจ Key benefits:
๐ Cumulative learning over time
๐ Transparent, inspectable strategies
๐ Works with any OpenAI-compatible API
โก Simple integration: just add "spl-" prefix to your model
Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!
This feels like a genuine step toward AI that learns from experience while staying completely interpretable.
๐ GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
๐ Full article: https://huggingface.co/blog/codelion/system-prompt-learning
๐ฆ Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486
Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?
System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts.
๐ How it works:
Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates.
๐ Results across math benchmarks:
Arena Hard: 29% โ 37.6% (+8.6%)
AIME24: 23.33% โ 30% (+6.67%)
OptILLMBench: 61% โ 65% (+4%)
The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently.
โจ Key benefits:
๐ Cumulative learning over time
๐ Transparent, inspectable strategies
๐ Works with any OpenAI-compatible API
โก Simple integration: just add "spl-" prefix to your model
Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them!
This feels like a genuine step toward AI that learns from experience while staying completely interpretable.
๐ GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl
๐ Full article: https://huggingface.co/blog/codelion/system-prompt-learning
๐ฆ Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486
Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?

reacted to
openfree's
post with ๐
2 months ago
Post
2852
๐๏ธ Voice Clone AI Podcast Generator: Create Emotionally Rich Podcasts with Your Own Voice!
๐ Project Introduction
Hello! Today we're excited to introduce an AI-powered solo podcast generator that creates high-quality voice cloning with authentic emotional expression.
Transform any PDF document, web URL, or keyword into a professional podcast with just a few clicks! ๐โก๏ธ๐ง
VIDraft/Voice-Clone-Podcast
โจ Key Features
1. ๐ฏ Multiple Input Methods
URL: Simply paste any blog or article link
PDF: Upload research papers or documents directly
Keyword: Enter a topic and AI searches for the latest information to create content
2. ๐ญ Emotionally Expressive Voice Cloning
Powered by Chatterbox TTS:
๐ค Voice Cloning: Learn and replicate your unique voice perfectly
๐ข Natural intonation and emotional expression
๐ Customizable emotion intensity with Exaggeration control
โก Seamless handling of long texts with automatic chunking
3. ๐ค State-of-the-Art LLM Script Generation
Professional-grade English dialogue using Private-BitSix-Mistral
12 natural conversational exchanges
Real-time web search integration for up-to-date information
Fully editable generated scripts! โ๏ธ
๐ก Use Cases
๐ Educational Content
Transform complex research papers into easy-to-understand podcasts
Create English learning materials in your own voice
๐ฐ News & Information
Convert international articles into engaging audio content
Produce global trend analysis podcasts
๐จ Creative Content
Tell stories in English with your own voice
Build your global personal brand with custom audio content
๐ ๏ธ Tech Stack
๐ง LLM: Llama CPP + Private-BitSix-Mistral
๐ฃ๏ธ TTS: Chatterbox (Voice Cloning & Emotional Expression)
๐ Search: Brave Search API
๐ Document Processing: LangChain + PyPDF
๐ฅ๏ธ Interface: Gradio
๐ What Makes Us Special
๐ค Voice Cloning: Perfect voice replication from just a short audio sample
๐ Emotion Contro ๐ Unlimited Length ๐ Real-time Updates
๐ Project Introduction
Hello! Today we're excited to introduce an AI-powered solo podcast generator that creates high-quality voice cloning with authentic emotional expression.
Transform any PDF document, web URL, or keyword into a professional podcast with just a few clicks! ๐โก๏ธ๐ง
VIDraft/Voice-Clone-Podcast
โจ Key Features
1. ๐ฏ Multiple Input Methods
URL: Simply paste any blog or article link
PDF: Upload research papers or documents directly
Keyword: Enter a topic and AI searches for the latest information to create content
2. ๐ญ Emotionally Expressive Voice Cloning
Powered by Chatterbox TTS:
๐ค Voice Cloning: Learn and replicate your unique voice perfectly
๐ข Natural intonation and emotional expression
๐ Customizable emotion intensity with Exaggeration control
โก Seamless handling of long texts with automatic chunking
3. ๐ค State-of-the-Art LLM Script Generation
Professional-grade English dialogue using Private-BitSix-Mistral
12 natural conversational exchanges
Real-time web search integration for up-to-date information
Fully editable generated scripts! โ๏ธ
๐ก Use Cases
๐ Educational Content
Transform complex research papers into easy-to-understand podcasts
Create English learning materials in your own voice
๐ฐ News & Information
Convert international articles into engaging audio content
Produce global trend analysis podcasts
๐จ Creative Content
Tell stories in English with your own voice
Build your global personal brand with custom audio content
๐ ๏ธ Tech Stack
๐ง LLM: Llama CPP + Private-BitSix-Mistral
๐ฃ๏ธ TTS: Chatterbox (Voice Cloning & Emotional Expression)
๐ Search: Brave Search API
๐ Document Processing: LangChain + PyPDF
๐ฅ๏ธ Interface: Gradio
๐ What Makes Us Special
๐ค Voice Cloning: Perfect voice replication from just a short audio sample
๐ Emotion Contro ๐ Unlimited Length ๐ Real-time Updates

replied to
ProCreations's
post
3 months ago
Every time you use a HF space you randomly start dancing for 5 minutes
This one fr i'm dancing all day anyway idk how people survive in cubicles

reacted to
codelion's
post with ๐
3 months ago
Post
2857
๐งฌ Hey everyone! Just released **OpenEvolve** - an open-source implementation of Google DeepMind's AlphaEvolve system.
It's an evolutionary coding agent that uses LLMs to discover and optimize algorithms. I successfully replicated DeepMind's results on circle packing (99.97% match!) and evolved a random search into a simulated annealing algorithm.
โจ Key features:
- Evolves entire codebases (not just single functions)
- Works with any OpenAI-compatible API
- LLM ensemble approach for better results
- Multi-objective optimization
๐ Check it out:
GitHub: https://github.com/codelion/openevolve
Blog post: https://huggingface.co/blog/codelion/openevolve
Would love to hear your thoughts or answer any questions about it!
It's an evolutionary coding agent that uses LLMs to discover and optimize algorithms. I successfully replicated DeepMind's results on circle packing (99.97% match!) and evolved a random search into a simulated annealing algorithm.
โจ Key features:
- Evolves entire codebases (not just single functions)
- Works with any OpenAI-compatible API
- LLM ensemble approach for better results
- Multi-objective optimization
๐ Check it out:
GitHub: https://github.com/codelion/openevolve
Blog post: https://huggingface.co/blog/codelion/openevolve
Would love to hear your thoughts or answer any questions about it!
[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter


reacted to
ProCreations's
post with ๐
3 months ago
Post
1908
I made a space!
Check out
https://huggingface.co/spaces/ProCreations/realtime-ai-visualization
This cool space visualizes a real neural net in real time. It trains a real 199 parameter model on XOR. With baby mode for non-devs and advanced mode for developers or enthusiasts, (hopefully) everyone will understand!
Check out
https://huggingface.co/spaces/ProCreations/realtime-ai-visualization
This cool space visualizes a real neural net in real time. It trains a real 199 parameter model on XOR. With baby mode for non-devs and advanced mode for developers or enthusiasts, (hopefully) everyone will understand!

reacted to
seawolf2357's
post with ๐
3 months ago
Post
6332
Samsung Hacking Incident: Samsung Electronics' Official Hugging Face Account Compromised
Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:
https://huggingface.co/Samsung/MuTokenZero2-32B
https://huggingface.co/Samsung/MythoMax-L2-13B
The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist.
Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised.
There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack.
Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password.
Source: https://discord.gg/openfreeai
Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:
https://huggingface.co/Samsung/MuTokenZero2-32B
https://huggingface.co/Samsung/MythoMax-L2-13B
The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist.
Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised.
There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack.
Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password.
Source: https://discord.gg/openfreeai

reacted to
AdinaY's
post with ๐
3 months ago
Post
2535
Matrix Game ๐ฎ an interactive foundation model for controllable game world generation, released by Skywork AI.
Skywork/Matrix-Game
โจ 17B with MIT licensed
โจ Diffusion-based image-to-world video generation via keyboard & mouse input
โจ GameWorld Score benchmark for Minecraft world models
โจ Massive Matrix Game Dataset with fine-grained action labels
Skywork/Matrix-Game
โจ 17B with MIT licensed
โจ Diffusion-based image-to-world video generation via keyboard & mouse input
โจ GameWorld Score benchmark for Minecraft world models
โจ Massive Matrix Game Dataset with fine-grained action labels

reacted to
ArturoNereu's
post with ๐ฅ
3 months ago
Post
4403
Iโve been learning AI for several years (coming from the games industry), and along the way, I curated a list of the tools, courses, books, papers, and models that actually helped me understand things.
I turned this into a GitHub repo:
https://github.com/ArturoNereu/AI-Study-Group
If youโre just getting started, I recommend:
๐ Deep Learning โ A Visual Approach: https://www.glassner.com/portfolio/deep-learning-a-visual-approach
๐ฅ Dive into LLMs with Andrej Karpathy: https://youtu.be/7xTGNNLPyMI?si=aUTq_qUzyUx36BsT
๐ง The ๐ค Agents course](https://huggingface.co/learn/agents-course/
The repo has grown with help from the community (Reddit, Discord, etc.) and Iโll keep updating it.
If you have any favorite resources, Iโd love to include them.
I turned this into a GitHub repo:
https://github.com/ArturoNereu/AI-Study-Group
If youโre just getting started, I recommend:
๐ Deep Learning โ A Visual Approach: https://www.glassner.com/portfolio/deep-learning-a-visual-approach
๐ฅ Dive into LLMs with Andrej Karpathy: https://youtu.be/7xTGNNLPyMI?si=aUTq_qUzyUx36BsT
๐ง The ๐ค Agents course](https://huggingface.co/learn/agents-course/
The repo has grown with help from the community (Reddit, Discord, etc.) and Iโll keep updating it.
If you have any favorite resources, Iโd love to include them.