Page tree

Version 1.0, 27 May 2016

What is bidirectional text?

Many of the world’s scripts run horizontally from left to right (in lines from top to bottom). Some, like Arabic, Hebrew, and Syriac, predominantly run from right to left. However, these scripts also exhibit left-to-right character direction in the representation of numbers, so they are inherently bidirectional.

We also speak of ‘bidirectional text’ when a text run contains both left-to-right and right-to-left text phrases.

Note: ‘bidi’ is often used as shorthand for ‘bidirectional text’.

Handling of bidirectional text in software is not trivial

While a human can, after very little training, correctly decode most runs of bidirectional text, computer software has traditionally struggled with bidi, or even not supported such text at all. At the time of this writing (May of 2016), software support for bidi has still not settled across the board: some software implementations are very good and stable, such as Adobe InDesign CC and – up to a point – MS Word 2016. But rendering bidi in current web browsers is not always the same.

Enter the Unicode Bidirectional Algorithm

From an early date in the development of the Unicode Standard, around 1999-2000, measures were taken to provide software implementers with standardized tools to support the correct ordering in computer memory, as well as the correct representation on screen and in print of bidirectional text. If you’re not really familiar with the Unicode Bidirectional Algorithm, then before reading further you should read the basic introduction to how the bidi algorithm works.

Required reading

Unicode in XML and other Markup Languages: 3.3 Bidi Embedding Controls

Unicode controls vs. markup for bidi support

Structural markup and right-to-left text in HTML

Inline markup and bidirectional text in HTML

CSS vs. markup for bidi support

W3C CSS Writing Modes Level 3

Bidi space loss