phpminer
Posts: 28
|
Posted: 12/26/2011, 8:20 AM |
|
What collating sequeunce (i.e. latin1_general_ci, utf8_bin, etc.) would be the most
efficient in terms of space used to store and retrieve German text?
What challenges have some of you had in dealing with German text within your
CCS PHP applications?
|
|
|
magus
Posts: 98
|
Posted: 12/27/2011, 5:21 PM |
|
Hi PhpMiner,
Latin1 ISO-8859-1 will be the most efficient space-wise.
But these days I do everything with utf8 to minimize character set problems.
There is a consideration of where your data is coming from. If it is being typed in then you should not have any problems with latin1. If you are pasting text in, then you cannot be sure of the source character set and you could find illegal characters coming through especially from Microsoft documents. If you are republishing feeds or scraping the web for content then it is likely you will have character compatibility problems.
Don
|
|
|
|