absurd-words

A system and website to score the absurdity and uselessness of words in English, with other languages possibly to come. Website

API Routes

/getWord/
- Returns score and datapoints for any given word
- Accepts parameter calculate as a boolean
  - If true will use external sources to calculate score
  - If false will only use records within database
- Example Use: /getWord/absurd
/words
- Paramters
  - results
    - integer up to 50, number of results to return
    - default 20
  - startIndex
    - integer, which value to start from (for getting further loaded results)
    - default 0
  - sortMethod
    - string, options:
      - score
      - scoreInverse
      - a-z
      - z-a
      - humour
      - humourInverse
      - util
      - utilInverse
    - default score

Scoring System

Full Equation:

Phonemic Humour

Phonemic humour is calculated as described in Reference #1, calculating Shannon entropy for unigram letter frequency. To acquire letter frequency I am using the top 100000 most frequently used words in english according to Rachael Tatman's analysis of the Google Web Trillion Word Corpus (Ref. #2)

where pi is the probablility of a letter being present in a word and l is the number of letters in the word.

Word Utilization

Word utilization is calculated using Google NGram Viewer (Ref. #3), and is based average frequency from 1990 (or as recent as the word has been observed) to 2019. This is to adjust for inflated word frequency that can appear through trends which are even more prevalent with the internet.

where f is the set of word frequencies for each year, and y is the total number of years.

Word Ambiguity

This is based on the number of definitions as given by Wordnet (Ref. #4)

Represented as q in equation.

Related Word Abundance

Relative word abundance suggests a more important word, so the Synset of a given word has its 2 layers of its hyponyms counted as given by Wordnet (Ref. #4). A large number of hyponyms suggests that its meaning is applicable to many other words, increasing its importance and decreasing its absurdity.

Where h1 is the number of single edge related hyponyms, and h2 is the number of double edge related hyponyms.

Referenced Works

Westbury, C., et al. Telling the world’s least funny jokes: On the quantification of humor as entropy. Journal of Memory and Language (2015), http://dx.doi.org/10.1016/j.jml.2015.09.001
Tatman, Rachael English Word Frequency, 1/3 Million Most Frequent English Words on the Web (2017), https://www.kaggle.com/rtatman/english-word-frequency
Google, http://books.google.com/ngrams
Princeton University "About WordNet." https://wordnet.princeton.edu/. Princeton University. 2010.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

absurd-words

API Routes

`/getWord/`

`/words`

Scoring System

Phonemic Humour

Word Utilization

Word Ambiguity

Related Word Abundance

Referenced Works

Files

README.md

Latest commit

History

README.md

File metadata and controls

absurd-words

API Routes

/getWord/

/words

Scoring System

Phonemic Humour

Word Utilization

Word Ambiguity

Related Word Abundance

Referenced Works

`/getWord/`

`/words`