Skip to content

A module for organization and its website matcher. Calculate match_score between 0-100.

Notifications You must be signed in to change notification settings

sumbono/organization_website_matcher

Repository files navigation

organization_website_matcher

A module for organization and its website matcher. Calculate match_score between 0-100.

get the data from kaggle: https://www.kaggle.com/buraksimsek/linkedin-dataset?select=dump.csv

For each company, scrape its website and grab existing website_title and copyright_statement.

Take companies have: company_name, website_title, and copyright_statement. Export them to company_webtitle_copyright.xlsx.

use cnameMatcher class in organizationWebsiteMatcher.py file to calculate matched_score for records in company_webtitle_copyright.xlsx.

I have provide the example in: test.py.

About

A module for organization and its website matcher. Calculate match_score between 0-100.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages