|
Hi, I’d like to open a pdf, extract the text, but also determine the color of the text during extraction.
For example, if the pdf text is 1 page, and nicely column formatted (maybe 3 columns with whitespace in between), maybe the 1st column is numbers, if the number is ‘green’, it has one meaning, but if it’s blue, a different meaning…etc…The 2nd column & 3rd columns the same thing –depending on the color of the text, the meaning is different
Thanks ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ iText-questions mailing list [hidden email] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php |
|
Op 1/05/2012 17:37, Lowe, Wyan schreef:
That's not impossible, but iText doesn't offer this kind of functionality out of the box. You'll have to study the classes in the com.itextpdf.text.pdf.parser package. ------------------------------------------------------------------------------ Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ _______________________________________________ iText-questions mailing list [hidden email] https://lists.sourceforge.net/lists/listinfo/itext-questions iText(R) is a registered trademark of 1T3XT BVBA. Many questions posted to this list can (and will) be answered with a reference to the iText book: http://www.itextpdf.com/book/ Please check the keywords list before you ask for examples: http://itextpdf.com/themes/keywords.php |
|
Look at how getTextRenderMode() works - something similar can be done for text color. If you wind up enhancing the parser to include this, shoot me a patch file and I'll review and incorporate it into the code base.
|
| Powered by Nabble | Edit this page |
