Scroll to navigation

Data::TableReader::Decoder::IdiotCSV(3pm) User Contributed Perl Documentation Data::TableReader::Decoder::IdiotCSV(3pm)

NAME

Data::TableReader::Decoder::IdiotCSV - Access rows of a badly formatted comma-delimited text file

VERSION

version 0.014

DESCRIPTION

This decoder is like ::Decoder::CSV, but can additionally parse the garbage resulting from those special people who write "CSV Export" code that looks like

  print join(",", map qq{"$_"}, @record)."\n";

(or rather, the equivalent code in Visual Basic or PHP which is what they're probably using) regardless of their data containing quote characters or newlines, resulting in garbage like:

  "First Name","Last Name","email"
  "Joseph "Joe","Smith",""Smith, Joe" <jsmith@example.com>"

This can actually be processed by (recent versions of) the Text::CSV module with the following configuration:

  {
    binary => 1,
    allow_loose_quotes => 1,
    allow_whitespace => 1,
    escape_char => undef,
  }

And so this module is simply a subclass of Data::TableReader::Decoder::CSV which provides those defaults to the parser.

How does the parsing work though? Well, some guesswork and patterns. It's not super reliable, and you should always complain loudly to whoever generated that data, unless they're a much larger company than you and would never listen, or went out of business a while back, in which case you can justify using this module in production.

AUTHOR

Michael Conrad <mike@nrdvana.net>

COPYRIGHT AND LICENSE

This software is copyright (c) 2024 by Michael Conrad.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.

2024-04-12 perl v5.38.2