The purpose of this project is to create time efficiency and reduce the margin of error when working with multiple documents.
Process Summarized:
Obtain key words from repetitive formatted documents, then extract as a Dataframe to an .xlsx. When working with multiple documents of the same format, it can be tedious to repeat the same task.
Link to Github Repository:
https://github.com/aeonx/ExtractKeywordsFromPDFdocuments
Download
0 formats
No download links available.
Extract keywords from multiple pdf files using Python | NatokHD