Encoding and Unicode Demystified in Python
0.0 (0 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
6 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Encoding and Unicode Demystified in Python to your Wishlist.

Add to Wishlist

Encoding and Unicode Demystified in Python

Everything you need to know about handling text in programming (with examples in Python)
0.0 (0 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
6 students enrolled
Created by Jordan Powell
Last updated 5/2017
English
Curiosity Sale
Current price: $10 Original price: $25 Discount: 60% off
30-Day Money-Back Guarantee
Includes:
  • 39 mins on-demand video
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Understand how encoding works in detail
  • Build software that correctly handles text
  • Free code samples in Python
  • Support via Q&A
View Curriculum
Requirements
  • Access to a computer with an internet connection
Description

Have you ever seen some ordinary text like 'ça va' go through a program or website and get totally ruined, ending up like 'ça va'?

This is due to broken handling of encoding. If you write software that handles text, you need to understand how to use Unicode and its encodings.

Unicode is the worldwide standard for character representations, supporting over 135 different writing systems and over 128,000 individual characters. This course provides an in-depth explanation of unicode, encoding and how it all works together, with practical examples in both Python 2 and Python 3.

By the end of this short course you will know everything you need to write software that handles text correctly, every time!

Who is the target audience?
  • Anyone interested in text encoding
  • Anyone who wants to correctly code for internationalised text
Students Who Viewed This Course Also Viewed
Curriculum For This Course
+
Part 1: Encoding and Unicode
3 Lectures 13:53

An introduction to cover the outline of the course. The code samples I use are available on my GitHub repository, linked in the resources section of this lecture.

Preview 01:58

I describe how characters are laid out in computer memory as raw bytes and provide an example of how you might represent a string of text. 

Characters in memory
03:05

This lecture covers the fundamentals of encoding, teaching you how to go from a character all the way through to a sequence of bytes in memory. I look at specific examples in detail, covering both ASCII and UTF-8.

Encoding and Unicode
08:50
+
Part 2: Encoding in Python
5 Lectures 25:15

Building on the first part of the course, I describe the different ways of representing textual data in both Python 2 and Python 3. This is particularly important to know if you're migrating code between Python 2 and Python 3.

Preview 04:48

This lecture introduces a number of useful utility functions that can be used to investigate and analyse strings in Python, which can really help you understand what's happening with the encoding process.

Ord, chr and unichr
03:06

This lecture teaches you how to confidently move between byte strings and Unicode strings in both major versions of Python. I discuss some general strategies for converting strings between the different encoded and unencoded formats.

Preview 03:50

This lecture covers reading and writing UTF-8 encoded files, and provides some general rules for dealing with encoded text to make sure you always get it right. The example code I use is available on my GitHub repository, linked in the resources section of the first lecture.

Getting it right - File IO
06:41

I wrap up the course with some points about handling Unicode characters in Python source files and cover a few common errors such as the well-known UnicodeDecodeError.

Unicode in source files, common errors and conclusion
06:50
About the Instructor
Jordan Powell
0.0 Average rating
0 Reviews
6 Students
1 Course
Software Developer

Jordan has 6 years of professional software development experience across a number of industries and enjoys programming in a variety of languages, including Python, Java, Lisp and C.

He recently became a Director of a tech start-up, Zumatech Ltd, and works on projects of all types and scales, from enterprise-scale web applications to small throwaway utility scripts. The most important thing for Jordan is to write high quality code and have fun doing so. A self-confessed geek, he has recently been getting a kick out of writing his own operating system and learning x86 assembly for his own amusement. 

He believes that programming is a creative and dynamic process, more akin to composing a piece of music or creating a work of art than calculating formulas in a spreadsheet. He feels that creating elegant software solutions is something most people can enjoy if given the correct instruction and guidance and programming is not just for the mathematically or logically inclined.

He lives in the beautiful city of Brighton, on the South Coast of England, which is also known as “Silicon Beach”, and spends most of his time there hiding from the sun and sampling the excellent beers on offer across the many pubs and bars in the city.