Computational Chemistry
Precision and recall comparison of 8 OCSR tools on patent images

Benchmarking Eight OCSR Tools on Patent Images (2024)

Comprehensive evaluation of 8 optical chemical structure recognition tools using a newly curated dataset of 2,702 patent images. Proposes ChemIC, a ResNet-50 classifier to route images to specialized tools based on content type, demonstrating that no single tool excels at all tasks.

Computational Chemistry

Review of OCSR Techniques and Models (Musazade 2022)

This systematization paper traces the history of OCSR, comparing early rule-based systems like OSRA with modern deep learning approaches like DECIMER. It highlights the shift from image classification to image captioning and identifies critical gaps in dataset standardization and evaluation metrics.

Computational Chemistry

A Review of Optical Chemical Structure Recognition Tools

This paper reviews three decades of OCSR development, transitioning from rule-based heuristics to early deep learning approaches. It includes a benchmark study comparing the performance of three open-source tools (OSRA, Imago, MolVec) on four diverse datasets.

Computational Chemistry
Overview of CLEF-IP 2012 tasks including patent passage retrieval, flowchart recognition, and chemical structure extraction

CLEF-IP 2012: Patent and Chemical Structure Benchmark

A resource paper detailing the CLEF-IP 2012 benchmarking lab. It introduces specific IR tasks for patent processing along with ground-truth datasets.

Computational Chemistry

Overview of the TREC 2011 Chemical IR Track Benchmark

This resource paper details the third TREC Chemical IR campaign, introducing a novel Image-to-Structure task and analyzing 36 runs from 9 groups to benchmark chemical information retrieval.

Computational Social Science
NOMINATE spatial plot showing Senate vote on Balanced Budget Amendment (1995) with legislators positioned on liberal-conservative dimension

A Spatial Model for Legislative Roll Call Analysis

This paper introduces NOMINATE, a probabilistic spatial model that recovers metric coordinates for legislators and roll calls from nominal voting data, demonstrating that a single liberal-conservative dimension explains the vast majority of Congressional voting behavior.

Computational Chemistry

OCSR Methods: A Taxonomy of Approaches

A comprehensive categorization of OCSR methods, organizing techniques by their fundamental approach: deep learning, traditional ML, and rule-based systems.

Research Methods
Abstract visualization of seven basis vectors represented as curved lines with dots, centered around a psi symbol

AI & Physical Sciences Taxonomy: A Seven-Vector Framework

Personal working taxonomy for categorizing papers as Method, Theory, Resource, Systematization, Position, Discovery, or Application contributions using a superposition model.

Planetary Science
Venus as seen by Mariner 10, showing swirling cloud patterns in the dense atmosphere

Venus Evolution Through Time: Key Questions and Missions

A comprehensive 2023 roadmap for Venus exploration synthesizing open questions about the planet’s evolution from potentially habitable to extreme greenhouse state, detailing the coordinated VERITAS, DAVINCI, and EnVision missions planned for the 2030s and identifying future technology requirements for answering fundamental habitability questions.