Time Series Forecasting
Forecasting comparison of different neural architectures on the Multiscale Lorenz-96 system

Optimizing Sequence Models for Dynamical Systems

Ablation study deconstructing sequence models. Attention-augmented Recurrent Highway Networks outperform Transformers on …...

Document Processing
Diagram showing page stream segmentation workflow: an input stream of pages is processed through binary classification of page pairs to predict document breaks, producing segmented output documents

LLMs for Page Stream Segmentation

Enhanced TabMe benchmark for page stream segmentation, creating TabMe++, showing fine-tuned decoder-based LLMs …...

Computational Chemistry

GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

GTR-CoT uses graph traversal chain-of-thought reasoning to improve optical chemical structure recognition....

Computational Chemistry

αExtractor: Automatic Chemical Information Extraction from Biomedical Literature

αExtractor uses ResNet-Transformer to extract chemical structures from literature images, including noisy and hand-drawn …...

Computational Chemistry

MolNexTR: A Generalized Deep Learning Model for Molecular Image Recognition

Chen et al.'s dual-stream encoder approach for robust molecular structure recognition from diverse real-world images …...

Computational Chemistry

MolParser: End-to-end Visual Recognition of Molecule Structures in the Wild

MolParser converts molecular images from scientific documents to machine-readable formats using E-SMILES....

Computational Chemistry
Adaptive grid merging visualization for benzene molecule showing multi-resolution spatial discretization

Beyond Atoms: Enhancing Molecular Pretrained Representations with 3D Space Modeling

Lu et al. introduce SpaceFormer, a Transformer that models entire 3D molecular space (not just atoms) for superior …...

Document Processing
Stream accuracy versus relative throughput for Mistral-7B and XGBoost models

LLMs for Insurance Document Automation

LLM applications for insurance document automation using parameter-efficient fine-tuning and analysis of calibration …...

Computational Chemistry

Invalid SMILES are Beneficial Rather than Detrimental to Chemical Language Models

Skinnider (2024) shows that generating invalid SMILES actually improves chemical language model performance through …...

Natural Language Processing
Word vector illustration showing text classification and NLP concepts

Sarcasm Detection with Transformers: A Cautionary Tale

Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …

Computational Chemistry

IMG2SMI: Translating Molecular Structure Images to SMILES

Campos & Ji's method for converting 2D molecular images to SMILES strings using Transformers and SELFIES representation....

Computational Social Science
Diagram of the Universal Message schema showing fields like ID, Text, Author, and Reply Sets that normalize data across platforms

PyConversations: Social Media Conversational Analysis

Undergraduate thesis exploring representation learning for social media text and developing tools for cross-platform …