ds4sd
/

CodeFormulaV2

Model card Files Files and versions Community

CodeFormulaV2 / README.md

MatteoOmenetti's picture

Update README.md

15b5caf verified 3 days ago

|

history blame contribute delete

1.51 kB

	---
	license: cdla-permissive-2.0
	datasets:
	- ds4sd/SynthFormulaNet
	- ds4sd/SynthCodeNet
	tags:
	- ocr
	- code
	- math
	- formula
	---

	# Code Formula Model

	The Code Formula Model processes an image of a code snippet or formula at 120 DPI and outputs its content.

	- Code Snippets:
	The model identifies the programming language and outputs the code repsecting the indendation shown in the given image. The output format will be:<br>
	"<\_\<programming language\>\_> \<content of the image\>"<br>
	Example:<br>
	"<_Java_> System.out.println("Hello World.");"

	- Formulas:
	The model generates the corresponding LaTeX code.


	This model was trained using the following two datasets:
	1. https://huggingface.co/datasets/ds4sd/SynthFormulaNet
	2. https://huggingface.co/datasets/ds4sd/SynthCodeNet

	# References
	```bibtex
	@techreport{Docling,
	author = {Deep Search Team},
	month = {8},
	title = {{Docling Technical Report}},
	url={https://arxiv.org/abs/2408.09869},
	eprint={2408.09869},
	doi = "10.48550/arXiv.2408.09869",
	version = {1.0.0},
	year = {2024}
	}

	@article{nassar2025smoldocling,
	title={SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion},
	author={Nassar, Ahmed and Marafioti, Andres and Omenetti, Matteo and Lysak, Maksym and Livathinos, Nikolaos and Auer, Christoph and Morin, Lucas and de Lima, Rafael Teixeira and Kim, Yusik and Gurbuz, A Said and others},
	journal={arXiv preprint arXiv:2503.11576},
	year={2025}
	}

	```