Skip to content

Comparing Taiwanese Media on China and US through NLP

License

Notifications You must be signed in to change notification settings

ethanelasky/tmc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

76 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data H195 Honors Thesis - Taiwan Media Comparison

Ethan Elasky

Binder

This is my thesis for my Data Science honors course. It aims to compare Taiwanese media coverage of America and China using natural language processing techniques. This is informative in understanding polarization and media bias in Taiwan, which is largly split along differing attitudes about the threat posed to the island by the mainland's Communist regime. For more detailed information, see my Pre-Analysis Plan. Lucy Li is my advisor.

Directory Structure

This repo contains a few subfolders which contain the elements of this project.

Folder Description
data scraped article links in parquet and CSV format
images image files used in the Jupyter Notebooks
analysis Jupyter Notebooks containing data analysis
scrapers website spiders (Scrapy), scrapers (Playwright), and preprocessing scripts

About

Comparing Taiwanese Media on China and US through NLP

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published