nyt-pos-analysis

This is an analysis, using pandas, of parts of speech of text between headlines and an article's preview as both appear on the New York Times print frontpage.

This code uses data from 42 stories from several New York Times print front pages from October-December 2021 and ascertains if there are lexical differences between a story's headline and preview text. I use the NLTK library to determine the frequency of various parts of speech in headlines and preview text and find that plural nouns (NNP), singular nouns (NN), and singular verbs (VBP) appear more frequently in headlines than they do in an article's front page preview, at the 95% confidence level.

This result, then, supports the idea that headlines are a unique lexical form likely because they are one of the first elements in a print newspaper a reader interacts with and so their purpose is to capture a reader's attention (while normal text, on the other hand, can afford to be more normal since "information scent" has already been disseminated).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
NYT Frontpage Headlines Dataset.xlsx		NYT Frontpage Headlines Dataset.xlsx
NYTHeadlineAnalysis.ipynb		NYTHeadlineAnalysis.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nyt-pos-analysis

About

Releases

Packages

Languages

shahanshahidnawaz/nyt-pos-analysis

Folders and files

Latest commit

History

Repository files navigation

nyt-pos-analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages