ABSTRACT
The present update on the global distribution of Mycobacterium tuberculosis complex spoligotypes provides both the octal and binary descriptions of the spoligotypes for M. tuberculosis complex, including Mycobacterium bovis, from >90 countries (13,008 patterns grouped into 813 shared types containing 11,708 isolates and 1,300 orphan patterns). A number of potential indices were developed to summarize the information on the biogeographical specificity of a given shared type, as well as its geographical spreading (matching code and spreading index, respectively). To facilitate the analysis of hundreds of spoligotypes each made up of a binary succession of 43 bits of information, a number of major and minor visual rules were also defined. A total of six major rules (A to F) with the precise description of the extra missing spacers (minor rules) were used to define 36 major clades (or families) of M. tuberculosis. Some major clades identified were the East African-Indian (EAI) clade, the Beijing clade, the Haarlem clade, the Latin American and Mediterranean (LAM) clade, the Central Asian (CAS) clade, a European clade of IS6110 low banders (X; highly prevalent in the United States and United Kingdom), and a widespread yet poorly defined clade (T). When the visual rules defined above were used for an automated labeling of the 813 shared types to define nine superfamilies of strains (Mycobacterium africanum, Beijing, M. bovis, EAI, CAS, T, Haarlem, X, and LAM), 96.9% of the shared types received a label, showing the potential for automated labeling of M. tuberculosis families in well-defined phylogeographical families. Intercontinental matches of shared types among eight continents and subcontinents (Africa, North America, Central America, South America, Europe, the Middle East and Central Asia, and the Far East) are analyzed and discussed.