Debopam Dey's picture
1 1

Debopam Dey

pritamdeb68

AI & ML interests

AI , NLP and Data Science

Recent Activity

Organizations

None yet

pritamdeb68's activity

upvoted an article 9 days ago
view article
Article

How to train a new language model from scratch using Transformers and Tokenizers

โ€ข 27
reacted to s-emanuilov's post with ๐Ÿค about 2 months ago
view post
Post
2578
Hey HF community! ๐Ÿ‘‹

Excited to share Monkt - a tool I built to solve the eternal headache of processing documents for ML/AI pipelines.

What it does: Converts PDFs, Word, PowerPoint, Excel, Web pages or raw HTML into clean Markdown or structured JSON.

Great for:
โœ” LLM training dataset preparation;
โœ” Knowledge base construction;
โœ” Research paper processing;
โœ” Technical documentation management.

It has API access for integration into ML pipelines.

Check it out at https://monkt.com/ if you want to save time on document processing infrastructure.

Looking forward to your feedback!
  • 3 replies
ยท
New activity in rajpurkar/squad_v2 7 months ago