This is a reference documentation of the CSV file reader in hale studio. Usually you will not need this reference, as in hale studio the configuration of I/O providers is done via the respective wizards for import and export. Where this reference can come in handy is if you want launch a transformation via the command line.
| Name | CSV file |
|---|---|
| Identifier | eu.esdihumboldt.hale.io.csv.reader.instance |
| Implementation | eu.esdihumboldt.hale.io.csv.reader.internal.CSVInstanceReader |
Following are the specific configuration parameters for this reader.
States how many lines should be skipped when reading instances from a CSV file (e.g. because of the header). Also boolean values are supported for backwards compatibility. When a boolean is used, true is translated into 1 (i.e. skip the first line) and false into 0 (i.e. do not skip any lines)
| Setting name | skip |
|---|---|
| Any integer equal or larger than 0 | |
| Default value | 0 |
Specify the format for importing String values that represent a Date
| Setting name | dateTimeFormatterDefault |
|---|---|
| Default value | dd.mm.yyyy |
| Default to dd.mm.yyyy |
| Setting name | typename |
|---|---|
| Example value | {namespace}localname |
| The type name is represented like in the given example, with the namespace in curly braces. |
Separator character that delimits columns in the CSV file
| Setting name | separator |
|---|---|
| Defaults to tabulator (\t) |
Character used to quote CSV entries
| Setting name | quote |
|---|---|
| Default value | " |
Character used to escape special characters, e.g. the quote character
| Setting name | escape |
|---|---|
| Default value | \ |
Separator is used to identify float value
| Setting name | decimal |
|---|---|
| Default value | . |
The following configuration parameters are shared for all instance readers.
For cases where the spatial reference system cannot be detected
from the data you can specify a default SRS to assume for these
geometries. The SRS can be provided as code (with prefix
code:
) or as WKT (with prefix
wkt:
)
| Setting name | defaultSrs |
|---|---|
| Example value | code:EPSG:25832 |
The following configuration parameters are shared for all I/O providers.
During import the content type is tried to be auto-detected, for export this is not possible, but I/O providers may have a default content type associated. If an I/O provider only supports one content type, it's usually not necessary to set this parameter. In any case you can override the content type to use/assume using a content type identifier (see the list of supported content types at the bottom of the page).
| Setting name | contentType |
|---|---|
| Example value | eu.esdihumboldt.hale.io.csv |
Specifying a character encoding is important especially for the
import, so data is interpreted correctly. Not every I/O provider
may respect this setting though (e.g. if the encoding is given by
the data source).
Specify a character encoding by its name
as recognised by Java.
| Setting name | charset |
|---|---|
| Example value | UTF-8 |
eu.esdihumboldt.hale.io.csv)