[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[no subject]
- <!--x-content-type: text/plain --> "http://www.w3.org/TR/html4/loose.dtd">
- <!--x-date: Tue Jan 25 09:50:52 2005 -->
- <!--x-from-r13: xnsxn ng nagvpuev.fg (Urbetr Qneyrff) -->
- <!--x-message-id: [email protected] -->
- <!--x-subject: [ale] wget oddity grabbing pages with URL parameters -->
- <li><em>date</em>: Tue Jan 25 09:50:52 2005</li>
- <li><em>from</em>: kafka at antichri.st (George Carless)</li>
- <li><em>subject</em>: [ale] wget oddity grabbing pages with URL parameters</li>
I'm having a little strange behaviour with wget... I'm trying to grab some pages from our
CMS-based intranet site, using the following:
wget -nH -r --level=1 -I valuemap --directory-prefix=/home/httpd/oursite/ <a rel="nofollow" href="http://oursite.com/valuemap/index.html">http://oursite.com/valuemap/index.html</a>
The problem is, it's grabbing all of the pages, BUT when it does so it seems not to be grabbing any
pages properly when they're in the format index.html?_function=detail&_op=1234 (which is what my
CMS uses in this case). The pages ARE downloaded--I wind up with a bunch of files with the
appropriate file names, but each one has the content of the MAIN index.html as it would return if
no url parameters were passed. (i.e. my app returns different content depending on the value of
$_GET["_function"], in PHP terminology).
BUT if I wget the pages manually - e.g. "wget
<a rel="nofollow" href="http://oursite.com/valuemap/index.html?_function=detail\&_op=1234"">http://oursite.com/valuemap/index.html?_function=detail\&_op=1234"</a>; - the page is returned
correctly.
Any of you smart folks have any idea what might be happening here?
Thanks,
--George
--------------------------------------
George Carless ... kafka at antichri.st
Words are just dust in deserts of sound
</pre>
<!--X-Body-of-Message-End-->
<!--X-MsgBody-End-->
<!--X-Follow-Ups-->
<hr>
<ul><li><strong>Follow-Ups</strong>:
<ul>
<li><strong><a name="01042" href="msg01042.html">[ale] wget oddity grabbing pages with URL parameters</a></strong>
<ul><li><em>From:</em> pras at cycloeastern.com (pras at cycloeastern.com)</li></ul></li>
</ul></li></ul>
<!--X-Follow-Ups-End-->
<!--X-References-->
<!--X-References-End-->
<!--X-BotPNI-->
<ul>
<li>Prev by Date:
<strong><a href="msg01039.html">[ale] comcast static IP?</a></strong>
</li>
<li>Next by Date:
<strong><a href="msg01041.html">[ale] Sendmail latency</a></strong>
</li>
<li>Previous by thread:
<strong><a href="msg01038.html">[ale] OT OT OT This list is flooding with messages</a></strong>
</li>
<li>Next by thread:
<strong><a href="msg01042.html">[ale] wget oddity grabbing pages with URL parameters</a></strong>
</li>
<li>Index(es):
<ul>
<li><a href="maillist.html#01040"><strong>Date</strong></a></li>
<li><a href="threads.html#01040"><strong>Thread</strong></a></li>
</ul>
</li>
</ul>
<!--X-BotPNI-End-->
<!--X-User-Footer-->
<!--X-User-Footer-End-->
</body>
</html>