-
Notifications
You must be signed in to change notification settings - Fork 0
Finding the right person for the right role is crucial to the candidate and also to the company’s success. Talent acquisition is time taking and complex for the HR team and their task becomes much more difficult based on the fact that the market is humongous and it keeps increasing. The HR requires lot of domain knowledge based on the role he is…
RohanParkar/Resume-Retrieval-and-Classification
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
==================================================================================================================== Resume Retrieval and Classification System ==================================================================================================================== This system is divided into two sections: A. Resume Retrieval using inverted index. This section takes a collection of resume documents in .txt format and converts it into inverted index. The input is in the form of query entered by the user. The resumes relevant to the input query are retrieved. The user can select number of the document id from displayed list to view the resume. The user can segregate top5 resumes based on the query input. This is achieved using ranked retrieval. B. Resume Classification using Vector Space Model This section is capable of classifying the document provided by the user into 25 different resume profiles classes.This is done using the vector space model annd cosine similarity. The program takes data from a .csv file and combines all the resume data regarding a particular class together. The resume(in txt format) provided by the user is checked for cosine similarity with every resume profile class. The highest ranked class is determined as the resume profile for the document(resume). ------------------------------------------------------------------------------------------------------------------- Importing the project ------------------------------------------------------------------------------------------------------------------- 1.Download and install IntelliJ Idea Community Edition. 2.Download the'Team04_CheckPoint4.zip' file and unzip it at a desired location. 3.Select the open option at the IntelliJ Idea welcome window or select File-->Open and select the folder 'Resume Retrieval and Classification' from the location you unzipped the zip file. 4.Compile and run the main.java file to run the program. ------------------------------------------------------------------------------------------------------------------- How to use the user interface? ------------------------------------------------------------------------------------------------------------------- 1.The main gui window has two options Resume Classification and Resume Retrieval. 2.Press the button to choose the option. 3.On clicking resume classification or retrieval the respective window will open. For resume retrieval : 1. Enter the skills you want to find in a resume. 2. Hit the search button to get the list of matching resumes. 3. To view the listed resume select the number from the dropdown list below and press display. 4. To get the top 5 matching resumes hit the top 5 resumes button. You can find them in the Ranked Resume folder. 5. Refresh to search again. For resume classification: 1.Browse and select a resume in text format. 2. Hit the classify button to get the profile of the resume. 3. View the resume profile at the bottom of the window. 4. Refresh to search again. ------------------------------------------------------------------------------------------------------------------- Classes in Resume Retrieval and Classification ------------------------------------------------------------------------------------------------------------------- #Main.java It has the main class to run the program along with user interface. #Inverted_Index_Resume.java It has the main logic to generate the inverted index and answer the user queries. #Vector_Space_resume.java It has the main logic to generate the vector space model and classify the input resume. #Rank_Search.java It has the main logic to generate the vector space model and perform ranked retrieval. #Merge.java It is useful in processing the posting list of the inverted index to answer user queries. #ParserA.java It performs the natural language processing the data. #FileOpen.java It opens the resume selected by the user from the retrived resume list. #ReadFiles.java It is used to read the input data and stop words file. #Stemmer Performs porter's stemming on the input provided. #Doc.java Tt is a data structure to store document ids and their term weights. #ParserB.java It cleans the data and removes the stop words. ===================================================================================================================
About
Finding the right person for the right role is crucial to the candidate and also to the company’s success. Talent acquisition is time taking and complex for the HR team and their task becomes much more difficult based on the fact that the market is humongous and it keeps increasing. The HR requires lot of domain knowledge based on the role he is…
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published