Name		Name	Last commit message	Last commit date
parent directory ..
1gramstop10k.csv		1gramstop10k.csv
README.md		README.md
bigrams.csv		bigrams.csv
extract.py		extract.py
tetragrams.csv		tetragrams.csv
transform.py		transform.py
trigrams.csv		trigrams.csv

README.md

Extraction and Transformation of French n-grams

This section contains the code used to extract French n-grams.

The source data 1gramstop10k.csv was obtained from https://github.com/orgtre/frenchngrams, which is a frequency table of the most common french words, "built from the top 400 most popular French books on Project Gutenberg (as of 2016-12-04)"

I then used chatGPT to create a extract.py and transform.py script to perform the data wrangling. The initial prompts are included in the scripts. Note that this code may not be optimal at all, but it does the job.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

etl

etl

README.md

Extraction and Transformation of French n-grams

Files

etl

Directory actions

More options

Directory actions

More options

Latest commit

History

etl

Folders and files

parent directory

README.md

Extraction and Transformation of French n-grams