align-toparrow-leftarrow-rightbackbellblockcalendarcamerachatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditfacebookglobegoogleimagesinstagramlocation-pinmagnifying-glassmailmoremuplabelShape 3 + Rectangle 1outlookpersonplusImported LayersImported LayersImported Layersshieldstartwitteryahoo

Main Meeting: Character Sets and PHP

Occasionally you may find a web page that renders a series of nonsense characters in the midst of otherwise sensible text. The nonsense characters may be question marks inside black diamonds, or inverted question marks, or things that look like à (the A-Tilde) or Š(the A-Ring) followed by some other characters. Whenever you see this, it's the signature of a character set encoding error. While there are many ways to botch character set encoding, as a practical matter these errors almost always arise when Extended-ASCII data and UTF-8 data are intermixed. PHP is changing its position on some aspects of character encoding at release 5.4. This presentation looks at the history of PHP character encoding and gives some strategies for dealing with legacy data in an increasingly UTF-8 world.

Ray Paseur is a consulting PHP developer who lives and works in McLean, VA. He is a member of the DC PHP Group and serves on the Product Advisory Committee for Experts-Exchange.

Join or login to comment.

Our Sponsors

  • php[architect]

    Pays for our Meetup page and provides occasional beverages.

  • Canvas Co/work

    Canvas gives us our space each month!

  • AOL

    AOL sponsors pizzas and other eats for the main meetup!

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy