[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[no subject]
- <!--x-content-type: text/plain --> "http://www.w3.org/TR/html4/loose.dtd">
- <!--x-date: Wed Jul 6 16:50:35 2005 -->
- <!--x-from-r13: psbjyre ng bhgcbfgfragvary.pbz (Quevfgbcure Tbjyre) -->
- <!--x-message-id: [email protected] -->
- <!--x-subject: [ale] Parsing CSV file in perl -->
- <li><em>date</em>: Wed Jul 6 16:50:35 2005</li>
- <li><em>from</em>: cfowler at outpostsentinel.com (Christopher Fowler)</li>
- <li><em>subject</em>: [ale] Parsing CSV file in perl</li>
Here is a sample piece of data.
Id LASTNAME FIRSTNAME
Adams Portia
10572 Alexander Robert
You can see that the first row does not have an ID. This can be true
for all columns. They may or may not have values.
Here is how I'm trying the parse:
open TXT, "< Expanded_2005_Select_1.csv";
while(<TXT>) {
m/^(\d+?)\t/;
print "$1\n";
}
Each columns is tab delimeted. When I run this I get the lastname in $1
for the first line and the the ID in $1 for the second line. I need to
somehow create a regex that would be unforgiving of nothing being there.
Data file looks like this:
1 "ID" "LASTNAME" "FIRSTNAME" "TITLE" "COMPANY"
"ADDRESS " "ADDRESS2" "CITY" "STATE" "ZIPCODE"
"COUNTRY" "PHONE" "EMAIL" "REGTYPE" "DATE" "TIME"
"Question1" "Questio n2" "Question3" "READERID"
2 "Adams" "Portia" "Director" "The Rockefeller
Univers ity" "1230 York Ave " "New York"
"NY" "10021-6 399" "USA" 2123277719
"adams at rockefeller.edu" "Member"
3 10572 "Alexander" "Robert" "Manager Voice & Video
Solution" "Air Products and Chemicals, Inc" "7201
Hamilton Blvd" "Allentown" "PA" "18195-1501"
"USA" "610-481-7156" "alexanrw at airproducts.com" "Member"
06/12/2005 06:06:14 pm 60711
The 1,2,3 that you see is the line numbers in VI
</pre>
<!--X-Body-of-Message-End-->
<!--X-MsgBody-End-->
<!--X-Follow-Ups-->
<hr>
<ul><li><strong>Follow-Ups</strong>:
<ul>
<li><strong><a name="00082" href="msg00082.html">[ale] Parsing CSV file in perl</a></strong>
<ul><li><em>From:</em> jimmyoliver at gmail.com (Jimmy Oliver)</li></ul></li>
<li><strong><a name="00095" href="msg00095.html">[ale] Parsing CSV file in perl</a></strong>
<ul><li><em>From:</em> fletch at phydeaux.org (Fletch)</li></ul></li>
</ul></li></ul>
<!--X-Follow-Ups-End-->
<!--X-References-->
<!--X-References-End-->
<!--X-BotPNI-->
<ul>
<li>Prev by Date:
<strong><a href="msg00077.html">[ale] sorting large delimited text files</a></strong>
</li>
<li>Next by Date:
<strong><a href="msg00079.html">[ale] link to an excellent review of SUSE 9.3 Professional</a></strong>
</li>
<li>Previous by thread:
<strong><a href="msg00076.html">[ale] sorting large delimited text files</a></strong>
</li>
<li>Next by thread:
<strong><a href="msg00082.html">[ale] Parsing CSV file in perl</a></strong>
</li>
<li>Index(es):
<ul>
<li><a href="maillist.html#00078"><strong>Date</strong></a></li>
<li><a href="threads.html#00078"><strong>Thread</strong></a></li>
</ul>
</li>
</ul>
<!--X-BotPNI-End-->
<!--X-User-Footer-->
<!--X-User-Footer-End-->
</body>
</html>