This is a reference documentation of the CSV file reader in hale studio. Usually you will not need this reference, as in hale studio the configuration of I/O providers is done via the respective wizards for import and export. Where this reference can come in handy is if you want launch a transformation via the command line.
Name | CSV file |
---|---|
Identifier | eu.esdihumboldt.hale.io.csv.reader.instance |
Implementation | eu.esdihumboldt.hale.io.csv.reader.internal.CSVInstanceReader |
Following are the specific configuration parameters for this reader.
States how many lines should be skipped when reading instances from a CSV file (e.g. because of the header). Also boolean values are supported for backwards compatibility. When a boolean is used, true is translated into 1 (i.e. skip the first line) and false into 0 (i.e. do not skip any lines)
Setting name | skip |
---|---|
Any integer equal or larger than 0 | |
Default value | 0 |
Specify the format for importing String values that represent a Date
Setting name | dateTimeFormatterDefault |
---|---|
Default value | dd.mm.yyyy |
Default to dd.mm.yyyy |
Setting name | typename |
---|---|
Example value | {namespace}localname |
The type name is represented like in the given example, with the namespace in curly braces. |
Separator character that delimits columns in the CSV file
Setting name | separator |
---|---|
Defaults to tabulator (\t) |
Character used to quote CSV entries
Setting name | quote |
---|---|
Default value | " |
Character used to escape special characters, e.g. the quote character
Setting name | escape |
---|---|
Default value | \ |
Separator is used to identify float value
Setting name | decimal |
---|---|
Default value | . |
The following configuration parameters are shared for all instance readers.
For cases where the spatial reference system cannot be detected
from the data you can specify a default SRS to assume for these
geometries. The SRS can be provided as code (with prefix
code:
) or as WKT (with prefix
wkt:
)
Setting name | defaultSrs |
---|---|
Example value | code:EPSG:25832 |
The following configuration parameters are shared for all I/O providers.
During import the content type is tried to be auto-detected, for export this is not possible, but I/O providers may have a default content type associated. If an I/O provider only supports one content type, it's usually not necessary to set this parameter. In any case you can override the content type to use/assume using a content type identifier (see the list of supported content types at the bottom of the page).
Setting name | contentType |
---|---|
Example value | eu.esdihumboldt.hale.io.csv |
Specifying a character encoding is important especially for the
import, so data is interpreted correctly. Not every I/O provider
may respect this setting though (e.g. if the encoding is given by
the data source).
Specify a character encoding by its name
as recognised by Java.
Setting name | charset |
---|---|
Example value | UTF-8 |
eu.esdihumboldt.hale.io.csv
)