Hi beautiful soup is used for scraping the data from web page. But how could we scrap like example table from the pdf which is stored locally in the machine.
How could we scrap the data from PDF
Hi @shreyashgupta68 you could use tabula-py python library
tabula-py
is a Python library , which can read tables in a PDF. You can read tables from a PDF and convert them into a pandas DataFrame. tabula-py also enables you to convert a PDF file into a CSV, a TSV or a JSON file.
I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.
On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.