In "Data Catalysis: Facilitating Large-Scale Natural Language Data Processing" (presented at ISUC 2007) Patrick Pantel presents a USC project to extend such expertise to social scientists.While there may still be a gap between such tools and the needs and understanding of most of us, Daniel Hopkins and Gary King recently demonstrated the feasibility of "Extracting Systematic Social Science Meaning from Text," using machine learning to categorize millions of political texts (websites, blogs) with accuracy rates rivaling human coders.
H/T to Mark Liberman at Language Log for the link to Pantel's paper.
No comments:
Post a Comment