Quantcast

Unicode(UTF-8) to PDF

classic Classic list List threaded Threaded
4 messages Options
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Unicode(UTF-8) to PDF

Satish
We are working in web application and it has to support all the language. The purpose of this application is to generate a PDF document of the data submitted by the end user.
The end user can enter data in the following format. (This description is just to make reader understand the situation)

1. He/ she has to enter some data in English and some in local language. And application is not going to validate those and also do not have any constraint which data which language(strongly depends on UTF-8 for web UI). But in PDF we need to show both.
2. User can enter mix of two three languages in single field also. In that case we need to print it as it is.
3. Numeric in all language along with English and special characters also allowed..

I tried one approach to print the data in Hindi + English language. But the following observations was seen
I used the mangal font for hindi..

bFont = BaseFont.createFont("mangal.ttf", BaseFont.IDENTITY_H, BaseFont.EMBEDDED);

1.  I got all the hindi characters, but not in proper format… eq. It print "hindi" as "hinadi".
2.  It ignored all the English characters out of that. but English numeric it printed.

Note: We are storing data in Unicode(utf8) format in oracle database. We are able to get all the data in correct format in web UI.

Please suggest some feasible way to deal with this requirment.
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Unicode(UTF-8) to PDF

iText Software
satish.napster wrote:
> 1.  I got all the hindi characters, but not in proper format… eq. It print
> "hindi" as "hinadi".

That's documented. So far we didn't get any working code contributions
supporting Hindi. iText is capable of making ligatures in Arabic, but
not for Hindic languages. You can help us (and many others) by
contributing a HindicLigaturizer class.

> 2.  It ignored all the English characters out of that. but English numeric
> it printed.

Does mangal.ttf contain English characters? Maybe you should use a
FontSelector.

> Note: We are storing data in Unicode(utf8) format in oracle database. We are
> able to get all the data in correct format in web UI.
>
> Please suggest some feasible way to deal with this requirment.

Ask an developer from India to write us a HindicLigaturizer. We don't
understand Hindi which makes it very difficult for us to do this.
--
This answer is provided by 1T3XT BVBA
http://www.1t3xt.com/ - http://www.1t3xt.info

------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Unicode(UTF-8) to PDF

Leonard Rosenthol-3
In reply to this post by Satish
You are going to need one or more fonts that support a wide range of Unicode glyphs.

-----Original Message-----
From: satish.napster [mailto:[hidden email]]
Sent: Wednesday, March 24, 2010 6:12 AM
To: [hidden email]
Subject: [iText-questions] Unicode(UTF-8) to PDF


We are working in web application and it has to support all the language. The
purpose of this application is to generate a PDF document of the data
submitted by the end user.
The end user can enter data in the following format. (This description is
just to make reader understand the situation)

1. He/ she has to enter some data in English and some in local language. And
application is not going to validate those and also do not have any
constraint which data which language(strongly depends on UTF-8 for web UI).
But in PDF we need to show both.
2. User can enter mix of two three languages in single field also. In that
case we need to print it as it is.
3. Numeric in all language along with English and special characters also
allowed..

I tried one approach to print the data in Hindi + English language. But the
following observations was seen
I used the mangal font for hindi..

bFont = BaseFont.createFont("mangal.ttf", BaseFont.IDENTITY_H,
BaseFont.EMBEDDED);

1.  I got all the hindi characters, but not in proper format… eq. It print
"hindi" as "hinadi".
2.  It ignored all the English characters out of that. but English numeric
it printed.

Note: We are storing data in Unicode(utf8) format in oracle database. We are
able to get all the data in correct format in web UI.

Please suggest some feasible way to deal with this requirment.

--
View this message in context: http://old.nabble.com/Unicode%28UTF-8%29-to-PDF-tp28012968p28012968.html
Sent from the iText - General mailing list archive at Nabble.com.


------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
iText-questions mailing list
[hidden email]
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/
Reply | Threaded
Open this post in threaded view
|  
Report Content as Inappropriate
star

Re: Unicode(UTF-8) to PDF

Prasanna Joshi
This post has NOT been accepted by the mailing list yet.
Hello,

Even I used iText APi to generate PDF. the Font I have used is Mangal Font and UTF-8 for file encoding.

Source Text : Cargo (कार्गो)
Target Text on PDF I see : Caargo (कार््गो)

I see disjointed characters where joint words are used.

I created an instance of BaseFont for using Mangal and referred it for generating PDF.

BaseFont bfont = BaseFont.createFont("C:\\......\\Received Files\\MANGAL.ttf", BaseFont.IDENTITY_H, BaseFont.EMBEDDED);

1) Am I missing some setting or using incorrect font?

Kindly assist. Do you have any solution?

Prasanna Joshi
Loading...