Python package that parses PDF files and extracts text, tables, and images, essentially a AWS Textract alternative.
Python package that converts Excel formulas to Python code. Similar to the open-source library 'formulas' but instead of providing formulas and cells, this is built specifically for Excel tables. Excel tables are converted to pandas dataframes and the formulas are converted to pandas operations.