You can configure the Mainframe Connector transcoder by adding the required configuration in a JSON file.
This file is referred to as the transcoder configuration file. You must define the configuration as specified in
section Configuration.
The qsam encode and qsam decode commands use the transcoder configuration file to perform data transcoding.
This page describes the various ways in which you can configure the Mainframe Connector transcoder.
Configuration
The Configuration object is the root of the transcoder configuration.
It contains all the configuration options for the transcoder.
| JSON representation | 
|---|
| { "defaults": object ( | 
| Fields | |
|---|---|
| defaults | 
 Specify default field modifiers for Cobol archetypes. | 
| field_suffixes | 
 Specify field suffixes. | 
| field_overrides | 
 Specify field overrides. | 
| transformations | 
 Specify field transformations. | 
| schema_validation_mode | 
 Specify the schema validation mode. | 
| header_records_to_skip | 
 Specify the number of first records to skip. | 
| record_filter_condition | 
 Specify a filter condition for records. The filter supports the following operators: 
 Example: 
 | 
DefaultsSection
The DefaultsSection object can be used to specify default modifications by cobol types.
These are applied before any suffix or override modifications.
| JSON representation | 
|---|
| { "alpha_numeric_display": object ( | 
| Fields | |
|---|---|
| alpha_numeric_display | 
 Specify defaults for alphanumeric (PIC X) fields. | 
| numeric_display | 
 Specify defaults for numeric display (zoned decimal) fields. | 
| binary | 
 Specify defaults for binary number (COMP) fields. | 
| packed_decimal | 
 Specify defaults for packed decimal (COMP-3) fields. | 
| national | 
 Specify defaults for national (PIC N) fields. | 
| utf8 | 
 Specify defaults for UTF-8 (PIC U) fields. | 
| dbcs | 
 Default for dbcs (DISPLAY-1) fields. | 
| hexadecimal_floating_point | 
 Default for hexadecimal floating point (COMP-1, COMP-2) fields. | 
FieldSuffix
Field suffixes apply to all fields that have a suffix.
Fields are matched if they end with a hyphen (-) or underscore (_) followed by the suffix.
Suffixes are case-insensitive.
The FieldSuffix modifier is applied after the FieldOverride modifier.
For example, the modifier defined for the suffix NID will be applied to the field named
FLD-NID but not for the field FUNID.
| JSON representation | 
|---|
| { "suffix": string, "is_inverse": boolean, "modifier": object ( | 
| Fields | |
|---|---|
| suffix | 
 The field with this suffix will have the modifier applied to it. | 
| is_inverse | 
 Specify whether the modifier is an inverse field modifier or not.
An inverse field modifier applies the modifier on another field that has the same name as the field with the modifier
without the modifier. For example, if both  When using an inverse field modifier, the special identifier  For example, to create a null-indicator field, you can use the  | 
| modifier | 
 Specify the modifier to apply to matching fields. | 
FieldOverride
Override or modify the decode and encode chain for the specified field.
| JSON representation | 
|---|
| { "field": string, "modifier": object ( | 
| Fields | |
|---|---|
| field | 
 Specify the name of the field to apply the modifier to. | 
| modifier | 
 Specify the modifier to apply to matching field. | 
Transformation
View transformations are used to modify the relationship between the table and the QSAM file. Transformations are always phrased from the point of view of the data. The concept is similar to view tables in BigQuery.
| JSON representation | 
|---|
| { "exclude": object ( | 
| Fields | |
|---|---|
| exclude | 
 | 
| unnest | 
 | 
| move | 
 | 
| rename | 
 | 
FieldModifier
A field modifier lets you modify the encoding or decoding of a specific field. Note that not all modifiers can be applied to all fields. See the documentation for the specific modifiers for more information.
| JSON representation | 
|---|
| { "filler": object ( | 
| Fields | |
|---|---|
| filler | 
 Excludes the field from processing and output. | 
| null_if | 
 Conditionally sets the field to null based on the value of another field. | 
| format_date | 
 Formats a string field as a date. | 
| chain | 
 Chains multiple modifiers to be applied sequentially. | 
| zoned_decimal | 
 Overrides default configuration for zoned decimal fields. | 
| binary | 
 Overrides default configuration for binary numeric fields. | 
| packed_decimal | 
 Overrides default configuration for packed decimal fields. | 
| null_if_invalid | 
 Sets the field to null if a transcoding error occurs, preventing record spillover. | 
| bytes | 
 Treats the field as a raw sequence of bytes, ignoring prior type information. | 
| varlen | 
 Set the record as a variable length field. | 
| string | 
 Overrides default configuration for string fields. | 
| null_if_empty | 
 Sets the field to null if its content is considered empty. | 
| format_timestamp | 
 Formats a string field as a timestamp. | 
| hfp | 
 Interprets the field as a Hexadecimal Floating-Point (HFP) number. | 
| decode_as_null | 
 Defines how null values must be decoded. | 
| encode_null_as | 
 Defines how null values must be encoded. | 
Exclude
Exclude a field from the resulting table, but still undergo decoding or encoding. This is useful when the field doesn't need to be transferred to the table, but is required for transcoding. For example, null indicator or length fields can be omitted from the table.
To bypass transcoding altogether, apply the filler modifier.
| JSON representation | 
|---|
| { "field": string | 
| Fields | |
|---|---|
| field | 
 Specify the field to exclude. | 
Unnest
Unnest the field.
| JSON representation | 
|---|
| { "field": string, "format": string | 
| Fields | |
|---|---|
| field | 
 Specify the field to unnest | 
| format | 
 Specify the new field format. The  For unnested structs,  For unnested arrays and lists,  | 
Move
Move a field in the record.
| JSON representation | 
|---|
| { "field": string, "offset": int | 
| Fields | |
|---|---|
| field | 
 Specify the field to move. | 
| offset | 
 Specify the number of places, forward or backwards, the field needs to be moved to. | 
Rename
Rename one or more fields based on a regular expression match.
For example, to replace all hyphens with underscores, use the following JSON format:
    {"find": "\\-", "replace":"_"}.
 
| JSON representation | 
|---|
| { "find": string, "replace": string | 
| Fields | |
|---|---|
| find | 
 Specifies a Java regular expression pattern to identify the field(s) to rename. The pattern is matched against the full field name. If the pattern matches any part of the field name, the field is considered a match. Examples: 
 | 
| replace | 
 Specifies the new name for the matched field(s). Capture groups from the  Examples: 
 | 
Filler
Specifies that a field will be ignored during processing. The field won't be decoded from the input or encoded to the output and will be excluded from the resulting schema and data table during decoding. You can apply this modifier to any field that has a static known size.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
NullIf
Set a field to null if a condition is satisfied. You must specify either
null_value or non_null_value or both.
To create a null-indicator field, you can use a FieldSuffix with a null_if field modifier,
and set is_inverse to true as shown in the following examples:
Example: Null-indicator
To create a null-indicator field, we can use thenull_if field modifier like so.
 {
  "field_suffixes": [
   {
     "suffix": "NID",
     "is_inverse": true,
     "modifier": {
     "null_if": {
       "null_value": "?",
       "target_field": "$self"
     }
    }
   }
  ]
 }
NID to effectively be null indicators as shown in the
following copybook snippet:
01 REC. 02 FIELD PIC X(10). 02 FIELD-NID PIC X(1).
Example: Binary null-indicator
To create abinary null-indicator field, we can use the binary and null_if field modifiers like so.
 {
  "field_suffixes": [
   {
     "suffix": "NID",
     "modifier": {
       "binary": {}
     }
   },
   {
     "suffix": "NID",
     "is_inverse": true,
     "modifier": {
     "null_if": {
       "null_value": "15",
       "target_field": "$self"
     }
    }
   }
  ]
 }
NID to effectively be binary
null indicators using the same copybook from the earlier example.
Example: Bytes null-indicator
To create abytes null-indicator field, we can use the bytes and null_if field modifiers like so.
The values for null and not-null are expressed as HEX.
 {
  "field_suffixes": [
   {
     "suffix": "NID",
     "modifier": {
       "bytes": {}
     }
   },
   {
     "suffix": "NID",
     "is_inverse": true,
     "modifier": {
     "null_if": {
       "null_value": "FF",
       "target_field": "$self"
     }
    }
   }
  ]
 }
NID to effectively be a bytes null indicator using the same copybook
from the earlier example.
    | JSON representation | 
|---|
| { "target_field": string, "null_value": string, "null_values": string, "non_null_value": string, "non_null_values": string | 
| Fields | |
|---|---|
| target_field | 
 Specify the field whose value you want to check. The field must be in scope. | 
| null_value | 
 When specified, if  | 
| null_values | 
 When specified, if  | 
| non_null_value | 
 When specified, if  | 
| non_null_values | 
 When specified, if  | 
FormatDate
Format a string to a date using one of the supported formats. You can only apply this modifier to sized fields. During the decoding process, the formats are tested in order until one of the formats matches the string. During the encoding process, the first format is used and the rest are ignored.
| JSON representation | 
|---|
| { "formats": object ( | 
| Fields | |
|---|---|
| formats | 
 List of date formats. | 
ModifierChain
Specify a modifier chain to apply multiple modifiers in series. The modifiers are applied in the order they are specified.
| JSON representation | 
|---|
| { "modifiers": object ( | 
| Fields | |
|---|---|
| modifiers | 
 Specify the list of modifiers to apply. | 
ZonedDecimal
Sets various options related to the encoding and decoding of zoned decimals. You can only be apply this modifier on a decimal field.
| JSON representation | 
|---|
| { "logical_type": enum ( | 
| Fields | |
|---|---|
| logical_type | 
 Specify the logical type to use when decoding or encoding the field. | 
| encoding | 
 The encoding with which the field is encoded. | 
Binary
Ignore any previous modifiers and treat this field as a binary number.
| JSON representation | 
|---|
| { "signedness": enum ( | 
| Fields | |
|---|---|
| signedness | 
 The signedness of the number. | 
PackedDecimal
Set this field to a PackedDecimal.
| JSON representation | 
|---|
| { "logical_type": enum ( | 
| Fields | |
|---|---|
| logical_type | 
 Override the logical type. By default, Mainframe Connector uses the optimal logical type based on the precision and scale. | 
NullIfInvalid
Treat the value as null if transcoding fails. You can only apply this modifier to sized fields. The error is ignored and is not logged in the spillover dataset. During the decoding process, the value of this field will be null for this record. During the encoding process, if the data can't be written, the entire field with be filled with null bytes.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
Bytes
Treats the field as a raw sequence of bytes. This modifier overrides any prior type information, causing the field's raw byte data to be preserved as-is without specific character encoding or numeric interpretation. You can be apply this modifier to any field regardless of its original type or size.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
VarLen
Represents a variable-length field.
A variable-length field contains three parts:
- A group item that contains two subfields.
- A field within the group item that contains the length of the transaction data.
- A field within the group item that contains the data.
The name of the variable-length field will be the group name.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
String
Sets the various options related to string decoding and encoding. Can only be applied on a string field.
| JSON representation | 
|---|
| { "encoding": string, "trim_suffix": boolean, "pad_char": string | 
| Fields | |
|---|---|
| encoding | 
 The encoding with which the field is encoded. | 
| trim_suffix | 
 When set to true, any whitespace at the end of the string will be trimmed. trim_suffix affects only decoding, encoding ignores trim_suffix. Note that strings that consist only of whitespaces will become empty strings. | 
| pad_char | 
 When set padding export strings with  | 
NullIfEmpty
Field should be set to null if all the bytes in that field are 0.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
FormatTimestamp
Format a string to a timestamp using one of the provided formats. This can only be applied to sized fields. During decode, the formats are tested in order until one of the formats matches the string. During encode, the first format will be used and the rest will be ignored.
| JSON representation | 
|---|
| { "formats": object ( | 
| Fields | |
|---|---|
| formats | 
 List of timestamp formats. | 
HFP
Set this field as Hexadecimal Floating-Point.
Provide an empty JSON object as follows:
| JSON representation | 
|---|
| { | 
DecodeAsNull
Defines how null values are interpreted during the decoding process. As COBOL doesn't natively support nulls, this parameter specifies what values must be treated as null.
| JSON representation | 
|---|
| { "values": string, "hex_bytes": string | 
| Fields | |
|---|---|
| values | 
 A list of string representations. After the initial decoding of the field to its string form, if the field's content matches any of these values, it will be treated as null. | 
| hex_bytes | 
 A list of hexadecimal representations of a single byte.
 When a field contains repetitions any of these bytes, it is treated as null.
 For example, using  | 
EncodeNullAs
Defines how null values are represented during the encoding process.
| JSON representation | 
|---|
| { "value": string, "hex_byte": string | 
| Fields | |
|---|---|
| value | 
 Encode this specific value when the source value is null. Verify that the string is valid for the field's type. | 
| hex_byte | 
 Encode this specific byte sequence (represented as a hexadecimal string)
 when the source value is null. For example,  | 
DateTimeFormat
Size and pattern to use when converting the field to a date.
| JSON representation | 
|---|
| { "size": int, "pattern": string | 
| Fields | |
|---|---|
| size | 
 Specify the size of the field this pattern applies to. | 
| pattern | 
 Specify the date formatter pattern. For more information on valid formatter patterns, see Class DateTimeFormatter. | 
ZonedDecimalEncoding
Specify the encoding to use when decoding or encoding a zoned decimal field.
| Enums | |
|---|---|
| UNSPECIFIED | Keep the encoding that is specified in the modifier chain.
If no modifier is specified, EBCDICis used. | 
| EBCDIC | Use EBCDICencoding. | 
| ASCII | Use ASCIIencoding. | 
BinarySignedness
Logical type to use for a decimal field.
| Enums | |
|---|---|
| UNSPECIFIED | Use the most optimal type based on the scale and precision. | 
| SIGNED | Use 64-bits to store the value. This modifier only works for numbers whose precision is less than or equal to 18, and the scale is 0. | 
| UNSIGNED | Use 64-bits to store the value. This modifier only works for numbers whose precision is less than or equal to 18. | 
DecimalLogicalType
Logical type to use for a decimal field.
| Enums | |
|---|---|
| AUTO | Use the most optimal type based on the scale and precision. | 
| LONG | Use 64-bits to store the value. This modifier only works for numbers whose precision is less than or equal to 18, and the scale is 0. | 
| DECIMAL64 | Use 64-bits to store the value. This modifier only works for numbers whose precision is less than or equal to 18. | 
| BIG_DECIMAL | Store the value as an unbounded decimal value. This is the slowest option but supports any decimal of any precision at any scale. | 
| BIG_INTEGER | Store the value as an unbounded integer value. This is the slowest option but supports any integer of any precision. | 
SchemaValidationMode
Specify the schema validation mode to use during the copybook compilation. This mode verifies compatibility with a specific target data format.
| Enums | |
|---|---|
| DEFAULT | Default schema validation mode. This mode verifies that unique field names are in the copybook. | 
| BIG_QUERY | Schema validation mode for BigQuery compatibility. This mode extends the default validation to verify that the copybook's schema is compatible with BigQuery's data types. | 
| POSTGRES | Schema validation mode for PostgreSQL compatibility. This mode extends the default validation to verify that the copybook's schema is compatible with PostgreSQL data types. | 
| MYSQL | Schema validation mode for MySQL compatibility. This mode extends the default validation to verify that the copybook's schema is compatible with MySQL data types. |