[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[no subject]



Here is a sample piece of data.

Id    LASTNAME    FIRSTNAME
      Adams       Portia
10572 Alexander   Robert

You can see that the first row does not have an ID.  This can be true
for all columns.  They may or may not have values.  

Here is how I'm trying the parse:

open TXT, "< Expanded_2005_Select_1.csv";
while(<TXT>) {
        m/^(\d+?)\t/;
        print "$1\n";
}

Each columns is tab delimeted.  When I run this I get the lastname in $1
for the first line and the the ID in $1 for the second line.  I need to
somehow create a regex that would be unforgiving of nothing being there.

Data file looks like this:
      1 "ID"    "LASTNAME"      "FIRSTNAME"     "TITLE" "COMPANY"
"ADDRESS        "       "ADDRESS2"      "CITY"  "STATE" "ZIPCODE"
"COUNTRY"               "PHONE" "EMAIL" "REGTYPE"       "DATE"  "TIME"
"Question1"     "Questio        n2"     "Question3"     "READERID"
      2         "Adams" "Portia"        "Director"      "The Rockefeller
Univers        ity"    "1230 York Ave "                "New York"
"NY"    "10021-6        399"    "USA"   2123277719
"adams at rockefeller.edu" "Member"                
      3 10572   "Alexander"     "Robert"        "Manager Voice & Video
Solution"                "Air Products and Chemicals, Inc"       "7201
Hamilton Blvd"                    "Allentown"     "PA"    "18195-1501"
"USA"   "610-481-7156"          "alexanrw at airproducts.com"      "Member"
06/12/2005      06:06:14         pm                             60711

The 1,2,3 that you see is the line numbers in VI



</pre>
<!--X-Body-of-Message-End-->
<!--X-MsgBody-End-->
<!--X-Follow-Ups-->
<hr>
<ul><li><strong>Follow-Ups</strong>:
<ul>
<li><strong><a name="00082" href="msg00082.html">[ale] Parsing CSV file in perl</a></strong>
<ul><li><em>From:</em> jimmyoliver at gmail.com (Jimmy Oliver)</li></ul></li>
<li><strong><a name="00095" href="msg00095.html">[ale] Parsing CSV file in perl</a></strong>
<ul><li><em>From:</em> fletch at phydeaux.org (Fletch)</li></ul></li>
</ul></li></ul>
<!--X-Follow-Ups-End-->
<!--X-References-->
<!--X-References-End-->
<!--X-BotPNI-->
<ul>
<li>Prev by Date:
<strong><a href="msg00077.html">[ale] sorting large delimited text files</a></strong>
</li>
<li>Next by Date:
<strong><a href="msg00079.html">[ale] link to an excellent review of SUSE 9.3 Professional</a></strong>
</li>
<li>Previous by thread:
<strong><a href="msg00076.html">[ale] sorting large delimited text files</a></strong>
</li>
<li>Next by thread:
<strong><a href="msg00082.html">[ale] Parsing CSV file in perl</a></strong>
</li>
<li>Index(es):
<ul>
<li><a href="maillist.html#00078"><strong>Date</strong></a></li>
<li><a href="threads.html#00078"><strong>Thread</strong></a></li>
</ul>
</li>
</ul>

<!--X-BotPNI-End-->
<!--X-User-Footer-->
<!--X-User-Footer-End-->
</body>
</html>