Skip to content

Commit

Permalink
Removed int() from src_doc_id.
Browse files Browse the repository at this point in the history
  • Loading branch information
ontoligent committed Mar 19, 2020
1 parent fad3ab1 commit 408d48b
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion polite/polite.py
Original file line number Diff line number Diff line change
Expand Up @@ -105,7 +105,7 @@ def import_table_doctopic(self):
doctopic = pd.read_csv(src_file, sep='\t', header=None)
doc = pd.DataFrame(doctopic.iloc[:, 1])
doc.columns = ['doc_tmp']
doc['src_doc_id'] = doc.doc_tmp.apply(lambda x: int(x.split(',')[0]))
doc['src_doc_id'] = doc.doc_tmp.apply(lambda x: x.split(',')[0])
doc['doc_label'] = doc.doc_tmp.apply(lambda x: x.split(',')[1])
doc = doc[['src_doc_id', 'doc_label']]
doc.index.name = 'doc_id'
Expand Down

0 comments on commit 408d48b

Please sign in to comment.