Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Computer Vision - PDF to Text (table extract)

jerryjiang0413
5 - Atom

Hi guys,

 

I have a PDF table (attached) and I would like to extract it into Excel for further analysis. I used the PDF to Text and selected Alteryx Table as output option, but it doesn't give me the result that I want. I wonder if there is better way to extract PDF table information in Excel.

 

The ideal Excel template:

 

 
Related Scope Current year 2012Public SectorPrivate Sector Others
Overall OpinionNISNINI
Control EnvironmentSSSS
Lorem ipsum
Lorum ipsum
Lorum ipsum
SUN/AN/A
 
 

 

1 REPLY 1
BS_THE_ANALYST
14 - Magnetar

@jerryjiang0413  interesting problem. I can see the issue. The PDF tool seems happy to scrape all the text from the pdf but it's facing issues with the table contents as they are actually images?

 

All the best,

BS

Labels