
LLMs for Page Stream Segmentation
Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …...

Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …...
GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition....
MolRec achieves 95%+ accuracy on simple structures but struggles with complex diagrams, revealing rule-based OCSR …...
MolRec achieves 95% accuracy on 1000 molecular diagrams at TREC 2011 with detailed failure analysis....
SubGrapher creates molecular fingerprints from images via functional group segmentation, enabling retrieval without full …...
αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …...
Fujiyoshi et al.'s segment-based approach for recognizing chemical structures in challenging Japanese patent images with …...
Clevert et al.'s two-stage CNN approach for converting molecular images to SMILES using CDDD embeddings and extensive …...
Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …...
Filippov & Nicklaus's open-source rule-based system for converting molecular structure images into machine-readable …...

MolParser-7M is a 7.7M-pair dataset for molecule-to-text conversion, featuring real-world images and complex structures …
MolParser converts molecular images from scientific documents to machine-readable formats using E-SMILES....