Home > Forum Home > Developing Business Administration Solutions > Importing Data from PDF Files Share

Importing Data from PDF Files

Excel Help for Importing Data From Pdf Files in Developing Business Administration Solutions


Forum TopicPost Reply Login

Importing Data From Pdf Files

Rate this:
(4.2/5 from 24 votes)
HappyBusiness Spreadsheets has developed a free Excel program to extract and import PDF data into Excel which can be downloaded and used without restriction.

There is a common need to extract and import specific data from PDF files into Excel. Since Excel does not natively support the reading of PDF content, utilities are needed to convert the PDF file content for the Excel format. Several commercial applications accomplish this; however it is often the case where only specific data is required to be imported from multiple PDF files into one structured format.

We created such an application by using VBA code in conjunction with an open source PDF to Text conversion utility, which can be found at Foolabs.

[Download the free PDF data import Excel program here]

Update: 19-Feb-2012
A new version also extracts multiple instances of the same data matching pattern from one or more PDF files.

The program relies on the conversion utility (included in the download) and all PDF files to reside in the same directory as the Excel application. Text or data to extract are defined in the Control sheet by specifying start text, end text and multiple replacements routines with wildcard support. This enables flexibility to obtain comparable data from multiple PDF files based on patterns independent of different PDF file structures.

As many extraction rules as required can be set in order to create a table of information imported by extraction rule and PDF file name. Information on how to set up rules is available within the Excel application with a help icon and cell comments. The VBA code is commented and open for modification.

Any improvements or new features to the code are welcome to be posted here so that we can update the download version to the benefit of everyone.
 Excel Business Forums Administrator
 Posted by on
 
Replies - Displaying 1 to 10 of 88Order Replies By: Most recent | Chronological | Highest Rated
Happy
Rate this:
(3/5 from 1 vote)
The VBA code is open for modification and integration into your projects.
 Excel Business Forums Administrator
 Posted by on
Confused
Rate this:
(4/5 from 2 votes)
Thank you so much for creating such useful program. Could i get code written for this program using VBA? it will be very beneficial to me. 

Warm Regards.

 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
Does anyone know how to get this macro to work in sharepoint? It's not pulling the data from the pdf's.

Thanks
 LEWIS EVANS
 Posted by on
Fedup
Rate this:
(3/5 from 1 vote)
The PDF files can be multiple pages and the resulting text extracted and analyzed includes the entire PDF content.

If your data is on the last page, you'll need to make sure that the start and end text for extraction is unique to that page so that isolates it in the extraction.
 Excel Business Forums Administrator
 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
The data that I want to extract is at last but one page of invoices that are in pdf format. Your programs goes to only first page. Please help me ASAP.
 apache
 Posted by on
Grateful
Rate this:
(3/5 from 1 vote)
The importing routine processes all PDF files in the same folder for which the Excel file and extraction tool reside.

This was to keep the solution simple but of course the code can be altered to change the target directory.  The workaround is to create a new folder and move/copy all PDF files as well as Excel and .exe to it for processing. 
 Excel Business Forums Administrator
 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
hi all,
This tool works perfectly. Is there a way to import data of only specified pdf in different location. while importing i should have option to select the pdf from a folder in another location other than the activeworkbook "import pdf to excel" sheet included folder.
 munna
 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
We can extract multiple columns of data in one text block and then use the Text to Columns feature in Excel to separate the data into cells.
 Excel Business Forums Administrator
 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
Is it possible to amment the script to allow multiple columns to be extracted from the pdf. e.g. invoice lines?
 Posted by on
Confused
Rate this:
(3/5 from 1 vote)
thats a real nice program! congrats i want to make my own website (something like http://exceltopdf.org/ ) maybe you can give me some tips on how this works :D
 Posted by on
 Displaying page 1 of 9 

Excel templates and solutions matched for Importing Data from PDF Files:

Solutions: Export MapPoint Waypoints Survey Data Analysis