Natural Language Processing
Word vector illustration showing text classification and NLP concepts

Sarcasm Detection with Transformers: A Cautionary Tale

Learn how dataset bias can lead to misleading results in NLP: a sarcasm detection model that actually learned to …

Computational Social Science
Top features for Armed Forces and National Security policy classification showing veterans, defense, military keywords

Classifying Congressional Bills with Machine Learning

Testing ML classification of congressional bills by policy area. Comparing Naive Bayes, Logistic Regression, and XGBoost …

Computational Social Science
Top features for Social Welfare policy classification showing social, poverty, benefits keywords

Congressional Knowledge Graph & Policy Classification

A 47,000+ bill knowledge graph from Congress.gov with sponsor networks and 87% policy classification accuracy....

Natural Language Processing
One-hot encoding and count vectorization visualization showing sparse vector representation

Count Vectorization with scikit-learn in Python

Learn count vectorization in Python: convert text to numerical vectors using scikit-learn's CountVectorizer with …