Skip to content
This repository has been archived by the owner on Dec 1, 2017. It is now read-only.

Automatically tag content on GOV.UK using machine learning (experiment, not live)

License

Notifications You must be signed in to change notification settings

alphagov/govuk-auto-tagging

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AutoTagger

What is it?

This is an experimental automatic content tagger for GOV.UK pages based on the Ankusa gem, using the naive Bayes algorithm.

It attempts to determine correct tags for a page by learning from other, manually tagged pages.

How to use it?

To run the script locally, run ./bin/tag.rb file_name in your command line.

The file you pass to the script should be in CSV format with three columns - URL, tag and content. For an example, see the sample_content.csv file.

How to run the tests?

Just run rspec in the command line (which will work once the tests are written).

License

See the LICENSE file.

About

Automatically tag content on GOV.UK using machine learning (experiment, not live)

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages