| |
CSV Parser
Overview
The CSV (Comma Separated Values) parser reads and writes CSV style data.
Configuration
The parser needs the following parameters:
Parameter |
Description |
class |
com.architech.parser.rspCSVParser |
csvColumns |
This parameter specifies the name for each
column the parser should read/write. If not specified the parser will read
the first line and use the value as field names. You should either use the
Field Separator between the field names, or you could specify each name on
a separate line. |
csvColumnSeparator |
This parameter specifies the character
used to separate each column. If not specified the parser will attempt to
guess when reading and use a comma when writing. You can use backslash as escape
character to specify non printable characters. For
example \t denotes the TAB character. |
csvEnableQuoting |
This parameter should normally be set. For
versions 4.6.4 and earlier: when the parser output attributes containing the column separator, newline or a quote
character("), the field would be output with quotes around it whether
or not this parameter was set to true. Quotes and
column separators inside a quoted field
were preceded by a backslash (\). From version 4.6.5, when this parameter
is set to true, the field will be output with quotes around it under the
same conditions as in previous versions, however, quotes inside a
quoted field are now doubled. Note: if csvEnableQuoting is set
to false, the field will be output "as is" which may cause
problems.
When reading, quotes around the
field will be stripped if this parameter is set to true, and the parser is
able to read quoted attributes containing for example the column
separator. If this parameter is set to false, the parsing will probably return
unexpected values when the input contains fields delimited by quotes.
|
csvWriteHeader |
The default value for this parameter is true.
If csvWriteHeader is set, the first line output by the parser will contain
all the field names separated by the column separator.
|
characterSet |
Optional
character set conversion. This parameter is not directly available
from the parser configuration, you must edit the configuration file
manually to set it. |
Note that in the Admin Tool, the parameters are set in the Parser tab of the
File Connector. If you want to use TAB as a Field Separator you need to specify \t, but
when supplying
Field Names you have to use the actual tab character between field names.
| |
|