The Data Matrix barcode (ISO/IEC 16022) is a high-density, two-dimensional (2D) symbology that encodes text, numbers, files, and actual data bytes. This FAQ provides information and answers to commonly asked questions about the 2D Data Matrix barcode symbology.
- Data Matrix Overview
- Printing and Generation
- GS1 Digital Link Encoding
- DOD UID | MIL-STD-130
- ISO/IEC 15434
- UPS Mail Innovations
- ASCII Functions
- Encoding Double Byte, Unicode, and Extended ASCII Characters
- Amount of Data Encoded
- DataMatrix Encoding Modes
- Control Characters
- Symbol Formats
- Size and Accuracy
- Reading Symbols
- The Inventor of Data Matrix and License fee Requests
Data Matrix is a very efficient, two-dimensional (2D) barcode symbology that uses a small area of square modules with a unique perimeter pattern, which helps the barcode scanner determine cell locations and decode the symbol. Characters, numbers, text, and actual bytes of data may be encoded, including Unicode characters and photos.
The encoding and decoding process of Data Matrix is very complex. Several methods have been used for error correction in the past. All current implementations have been standardized on the ECC200 error correction method, which is approved by ANSI/AIM BC11 and the ISO/IEC 16022 specification. IDAutomation 2D Data Matrix barcode products all support ECC200 by default and are based on the ANSI/AIM BC11 and the ISO/IEC 16022 specifications. The Reed-Solomon error correction algorithms of ECC200 allow the recognition of barcodes that are up to 60% damaged.
Specific standards & specs for the 2D Data Matrix barcode symbology:
- AS9132 – Society of Automotive Engineers (SAE) parts marking
- Air Transportation Association’s (ATA) Spec 2000
- Automotive Industry Action Group (AIAG):
AIAG B-4 – parts identification
AIAG B-8 – shipping labels
AIAG B-11 – tire and wheel identification
AIAG B-13 – symbology white paper
AIAG B-17 – direct parts making
- DOD UID – Department of Defense Guide to Uniquely Identify items
- EIA 706 – Electronics Industry Marking Standard
- EIA 802 – Electronics Industry Marking Standard
- GS1 Data Matrix – standard for encoding GTIN numbers on small items with camera-based readers
- ISO/IEC 15418 – symbol data format semantics
- ISO/IEC 15434 – symbol data format syntax
- ISO/IEC 15415 – 2-D print quality standard
- MIL-STD-130 – US DOD Department of Defense standard
- NASA-STD-6002 – aerospace parts marking
- NASA-HDBK-6003 – aerospace direct part marking methods and techniques
- SAE AS9132 – Society of Automotive Engineers (SAE) aerospace industry marking standard
- Semiconductor Equipment and Materials International (SEMI):
T2-0298E – silicon wafers marking
T3-0302 – wafer box labels
T7-0302 – double-sided wafers
T8-0698E – flat panel display substrates
T9-0200E – lead-frame strips marking
T10-0701 – direct mark quality test method
- X6721 – Korean standard
Data Matrix is one of the smallest and most dependable barcode symbologies. Compared to other barcode types, DataMatrix is approximately 30 times smaller than a Code 39 barcode representing the same data. The size difference of popular barcode types is compared in the Barcode Symbology Evaluation and Test Sheet. The 2D Data Matrix barcode is also the recommended choice when sending barcodes over faxed documents because the symbol can withstand many poor resolution and scanning issues.
The required Reed-Solomon error correction built into Data Matrix ECC200 is able to reconstruct and verify the data scanned for improved accuracy. In a study at The Center for Automatic Identification at Ohio University, the statistical probability of a misread error with Data Matrix is 1 in 10.5 million scans, compared to a misread error probability of 1 in 1.7 million with the Code 39 barcode.
The IDAutomation Data Matrix Barcode Font and Encoder is a collection of encoders and components that generate ECC200 Data Matrix symbols with fonts or graphics. Several types of encoders are available in the package to generate Data Matrix symbols.
- Crystal Reports
- FileMaker Pro
- Reporting Services | SSRS
- VB 6 and Visual Basic .NET
- Web Barcodes
Symbols may be verified with the IDAutomation Barcode Data Decoder Verifier App or the Barcode Scanner ASCII String Decoder desktop application. Both reveal hidden control characters such as the FNC1, GS, RS, and EOT. The app has additional capabilities such as decoding HRI in GS1 symbols and reporting the symbol size. When downloaded to a mobile device, it easily reads from the screen or printed material.
The most common method of reading Data Matrix barcodes is with a camera-based image reader (aka: barcode imager). A common implementation is to use the camera on a mobile device. IDAutomation offers the Barcode Data Decoder Verifier App & SDK which allows Xamarin app developers to customize the app for specific implementations.
On desktop computers, most hand-held barcode imagers perform keyboard emulation and receive power from the USB port so that no external power supply is needed. When a barcode symbol is read using keyboard emulation, the data appears at the cursor as if it had been typed in from the keyboard. In many cases, it may be desired to have the scanner trigger a form or action in an application. IDAutomation has documented simple methods of accomplishing this task in the USB Barcode Scanner Application Integration Guide.
DataMatrix allows ASCII codes to be easily encoded for various functions such as tabs, returns, and the RS, GS, and EOT characters required for ISO/IEC 15434 and DOD UID labels. In all IDAutomation products, the tilde (~) may be used to encode ASCII functions according to the documentation. For example; ~d009 is used to encode a tab and ~d013 encodes a return. These functions are not usually visible when scanned unless the Barcode Scanner ASCII String Decoder or the Barcode Decoder App is used.
In many development environments, Chr or Char may also be used to encode the ASCII value directly.
For example, the programming examples below encode ECC<tab>200:
ProcessTilde: = “ECC~d009200″
Java: = “ECC” + (char)9 + “200”;
Visual Basic: = “ECC” & Chr(9) & “200”
All 256 ASCII characters can easily be encoded with the BASE256 encoding mode, which encodes all data, byte-by-byte. For information about encoding and decoding UTF-8 and Unicode characters, refer to the UTF-8 and Unicode Encoding FAQ.
It is recommended to limit the amount of data encoded in each symbol to 800 characters or less if possible. Although the AIM Data Matrix specifications state, “up to 2335 alphanumeric characters can be encoded,” it has been determined that these numbers are not realistic. The amount of data that can be encoded will vary depending on the type of data, the encoding mode, and what the scanner can read. In most implementations, the amount of data that can be encoded is significantly decreased due to mode switching between different types of characters, such as between numbers, upper case, lower case, and punctuation.
Most camera-based imagers and hand-held scanners have a difficult time reading symbols that contain over 800 characters. In the best-case scenario, up to 1200 ASCII characters have been successfully encoded and read by using the text encoding mode of the IDAutomation Data Matrix Barcode Forms Control with the IDAutomation 2D Barcode Scanner.
Products such as the IDAutomation Data Matrix Barcode Fonts and the Data Matrix Components all support the encoding modes listed below. By default, the encoding mode for most components is BASE256. If the choice is to encode text or numbers only and size is a concern, a change of the encoding mode to ASCII may produce a smaller symbol. Valid values are 0 for BASE256, 1 for C40, 2 for TEXT, and 3 for ASCII.
- 0 – BASE256 is used to encode 8 bit values, bytes of data.
- 1 – C40 is used to encode data that primarily consists of numeric and upper case characters. C40 encodes three alphanumeric data characters into two bytes.
- 2 – TEXT is used to encode data that primarily contains numeric and lowercase characters. TEXT encodes three alphanumeric data characters into two bytes.
- 3 – ASCII is used to encode data that mainly contains ASCII characters (0-127). It encodes one alphanumeric or two numeric characters per byte. This mode is required when encoding GS1-DataMatrix, DOD UID, or ISO/IEC 15434.
Most encoding systems can be used to encode any data; however, encoding binary data with C40 will generate much more overhead (a larger symbol) than with BASE256.
IDAutomation Data Matrix Barcode Fonts, Components, and Applications use the tilde character “~” to recognize special characters when “Apply Tilde” or “Process Tilde” is enabled. The following tilde options are available:
- ~dNNN: Represents the ASCII character encoded by the 3 digits NNN. For example, ~d009 represents a tab, ~d013 represents a return and ~d065 represents the character ‘A’.
- ~1: Represents the character FNC1 commonly used for GS1-DataMatrix. When FNC1 appears in the first position, it indicates that the data conforms to the GS1 Application Identifier format.
|Format Number||Symbol Size||Max Numeric Capacity||Max Alphanumeric Capacity||Max Binary Capacity||Max Correctable Error/Erasure|
|0||10 x 10||6||3||1||2|
|1||12 x 12||10||6||3||3|
|2||14 x 14||16||10||6||5/7|
|3||16 x 16||24||16||10||6/9|
|4||18 x 18||36||25||16||7/11|
|5||20 x 20||44||31||20||9/15|
|6||22 x 22||60||43||28||10/17|
|7||24 x 24||72||52||34||12/21|
|8||26 x 26||88||64||42||14/25|
|9||32 x 32||124||91||60||18/33|
|10||36 x 36||172||127||84||21/39|
|11||40 x 40||228||169||112||24/45|
|12||44 x 44||288||214||142||28/53|
|13||48 x 48||348||259||172||34/65|
|14||52 x 52||408||304||202||42/78|
|15||64 x 64||560||418||278||56/106|
|16||72 x 72||736||550||366||72/132|
|17||80 x 80||912||682||454||96/180|
|18||88 x 88||1152||862||574||112/212|
|19||96 x 96||1392||1042||694||136/260|
|20||104 x 104||1632||1222||814||168/318|
|21||120 x 120||2100||1573||1048||204/390|
|22||132 x 132||2608||1954||1302||248/472|
|23||144 x 144||3116||2335||1556||310/590|
|24||8 x 18||10||6||3||3|
|25||8 x 32||20||13||8||5|
|26||12 x 26||32||22||14||7/11|
|27||12 x 36||44||31||20||9/15|
|28||16 x 36||64||46||30||12/21|
|29||16 x 48||98||72||47||14/25|
Acuity CiMatrix / Siemens invented the Data Matrix ECC200 symbology and placed it in the public domain. Acuity CiMatrix and AIM Global both believe the Data Matrix ECC200 barcode is a public domain symbology and those license fees are not necessary for Data Matrix Patent Claims made by Acacia and other companies. Cognex challenged and won a case relating to Data Matrix patent claims by Acacia and Veritec.