This tutorial explains how to extract emails from PDF. Here I have listed two free tools that you can use to extract all email addresses from a PDF file. These software take a PDF file from you, extract all the email addresses, and save result to a file. In the following post, I will talk about 2 free software which help to extract email addresses from a PDF file. One of the following software directly takes a PDF file and then returns an Excel file containing all the email addresses that it has extracted. While one of these software uses a work around to get you a list of email addresses from a PDF.
If you have a large PDF file and want to extract email addresses from it which are mixed with the other data like text, images, then you can use the tools that I have mentioned below. Using the following tools, you can easily extract all the email addresses and then do what you want. These are the useful tools specially in data scraping work. You just need a PDF file that has some email addresses in it and then you can handover that to these tools.
How to Extract Emails from PDF:
PDF Mail Extractor
Pdf Mail Extractor is the simplest and one of the best software to extract emails from PDF. It just takes a local PDF file from you and then extracts all the email addresses from it. And after extracting email addresses, you can save them to an Excel file. This is a very straightforward software that works perfectly when it comes to extract email addresses from a PDF file. However, this software has different language but as it works in a straightforward manner, then you will not find it difficult to use it.
This is a portable software that you can start using right after downloading it from the above link. Also, this is an open source software tool and maybe in a few days, the developer will fix its language too.
Here is how to use this tool to extract email addresses from PDF for free.
Step 1: After downloading the software, just run it. After that, click on the “Fichier local” button and specify the PDF file from which you want to extract email addresses.
Step 2: It will automatically extracts all the email addresses and will save them to a Excel file. The Excel file which contains the extracted email addresses is at “C:\Users\YourUserName\PdfMailExtractor“. You can see the below screenshot.
That is how this free software works to extract all the email addresses from PDF files. However, it doesn’t support batch processing to process multiple PDF files at once. But, if you want to extract email addresses from a single PDF file, then you can use this tool without any problem.
JTextExtractor
JTextExtractor is another software that you can use to extract email addresses from a PDF file. However, this software itself can’t process PDF file on its own. It basically takes a text file to extract email addresses from it. So, what you can do is first convert your PDF file to text. After extracting the PDF text in TXT file, you can give that as an input to this software. There are some software that can extract the text from PDF files and save to TXT file. You can use any of them, and easily convert a PDF to text. Personally, I will recommend you to use PDF2Text Pilot Software to convert PDF to TXT.
Just like the software above, JTextExtractor is also an open source software. You just need to give it a text file to get all the email addresses from it. Let’s assume that you have already converted your PDF to text. After that, just open this software and then you can click on the “Add Files” button to add that text file. Do note that, this software supports batch processing. If you want to extract email addresses from multiple PDF files, then simply convert them to TXT and then import them all in this software.
Finally, hit the Extract button from the toolbar to start the extraction process. It will list all the email addresses on its interface. You can see that in the screenshot above. After it has extracted all the email addresses, simply export them to a CSV file using the “Export” option from the toolbar. And you can even specify a custom delimiter to be used in the final CSV file.
Final words
These are the best free software to extract emails from PDF that I could find. You can use any of them to extract email addresses without any problem from PDF files. And one of these software can be used to extract emails from multiple PDF files as well. So, if you are looking for some free software to extract emails from PDFs, then you can try these software.