Skip to content

Analysis of parts of speech between headlines and an article's preview text as both appear on the New York Times print frontpage

Notifications You must be signed in to change notification settings

shahanshahidnawaz/nyt-pos-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

nyt-pos-analysis

This is an analysis, using pandas, of parts of speech of text between headlines and an article's preview as both appear on the New York Times print frontpage.

This code uses data from 42 stories from several New York Times print front pages from October-December 2021 and ascertains if there are lexical differences between a story's headline and preview text. I use the NLTK library to determine the frequency of various parts of speech in headlines and preview text and find that plural nouns (NNP), singular nouns (NN), and singular verbs (VBP) appear more frequently in headlines than they do in an article's front page preview, at the 95% confidence level.

TEST

This result, then, supports the idea that headlines are a unique lexical form likely because they are one of the first elements in a print newspaper a reader interacts with and so their purpose is to capture a reader's attention (while normal text, on the other hand, can afford to be more normal since "information scent" has already been disseminated).

About

Analysis of parts of speech between headlines and an article's preview text as both appear on the New York Times print frontpage

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published