Document Processing
Diagram showing page stream segmentation workflow: an input stream of pages is processed through binary classification of page pairs to predict document breaks, producing segmented output documents

LLMs for Page Stream Segmentation

Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …...

Computational Chemistry

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition....

Computational Chemistry

MolRec: Chemical Structure Recognition at CLEF 2012

MolRec achieves 95%+ accuracy on simple structures but struggles with complex diagrams, revealing rule-based OCSR …...

Computational Chemistry

MolRec: Performance Analysis at TREC 2011 Chemical Track

MolRec achieves 95% accuracy on 1000 molecular diagrams at TREC 2011 with detailed failure analysis....

Computational Chemistry

SubGrapher: Visual Fingerprinting of Chemical Structures

SubGrapher creates molecular fingerprints from images via functional group segmentation, enabling retrieval without full …...

Computational Chemistry

αExtractor: Automatic Chemical Information Extraction from Biomedical Literature

αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …...

Computational Chemistry

ChemInfty: Robust Segmentation and Recognition of Chemical Structures in Low-Quality Patent Images

Fujiyoshi et al.'s segment-based approach for recognizing chemical structures in challenging Japanese patent images with …...

Computational Chemistry

Img2Mol: Accurate SMILES Recognition from Molecular Graphical Depictions

Clevert et al.'s two-stage CNN approach for converting molecular images to SMILES using CDDD embeddings and extensive …...

Computational Chemistry

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …...

Computational Chemistry

OSRA: Optical Structure Recognition for Chemical Information Extraction

Filippov & Nicklaus's open-source rule-based system for converting molecular structure images into machine-readable …...

Document Processing
A colored molecule with annotations, representing the diverse drawing styles found in scientific papers that OCSR models must handle.

MolParser-7M and WildMol Datasets for Robust Chemical Structure Recognition

MolParser-7M is a 7.7M-pair dataset for molecule-to-text conversion, featuring real-world images and complex structures …

Computational Chemistry

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

MolParser converts molecular images from scientific documents to machine-readable formats using E-SMILES....