[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[no subject]
- <!--x-content-type: text/plain -->
- <!--x-date: Tue Jan 25 10:05:49 2005 -->
- <!--x-from-r13: cenf ng plpybrnfgrea.pbz (cenf ng plpybrnfgrea.pbz) -->
- <!--x-message-id: [email protected] -->
- <!--x-reference: [email protected] --> "http://www.w3.org/TR/html4/loose.dtd">
- <!--x-subject: [ale] wget oddity grabbing pages with URL parameters -->
- <li><em>date</em>: Tue Jan 25 10:05:49 2005</li>
- <li><em>from</em>: pras at cycloeastern.com (pras at cycloeastern.com)</li>
- <li><em>in-reply-to</em>: <<a href="msg01040.html">[email protected]</a>></li>
- <li><em>references</em>: <<a href="msg01040.html">[email protected]</a>></li>
- <li><em>subject</em>: [ale] wget oddity grabbing pages with URL parameters</li>
On Tue, Jan 25, 2005 at 09:41:56AM -0500, George Carless wrote:
> Hi all,
>
> I'm having a little strange behaviour with wget... I'm trying to grab some pages from our
> CMS-based intranet site, using the following:
>
> wget -nH -r --level=1 -I valuemap --directory-prefix=/home/httpd/oursite/ <a rel="nofollow" href="http://oursite.com/valuemap/index.html">http://oursite.com/valuemap/index.html</a>
>
> The problem is, it's grabbing all of the pages, BUT when it does so it seems not to be grabbing any
> pages properly when they're in the format index.html?_function=detail&_op=1234 (which is what my
> CMS uses in this case). The pages ARE downloaded--I wind up with a bunch of files with the
> appropriate file names, but each one has the content of the MAIN index.html as it would return if
> no url parameters were passed. (i.e. my app returns different content depending on the value of
> $_GET["_function"], in PHP terminology).
>
> BUT if I wget the pages manually - e.g. "wget
> <a rel="nofollow" href="http://oursite.com/valuemap/index.html?_function=detail\&_op=1234"">http://oursite.com/valuemap/index.html?_function=detail\&_op=1234"</a>; - the page is returned
> correctly.
>
> Any of you smart folks have any idea what might be happening here?
>
> Thanks,
> --George
>
> --------------------------------------
> George Carless ... kafka at antichri.st
> Words are just dust in deserts of sound
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> <a rel="nofollow" href="http://www.ale.org/mailman/listinfo/ale">http://www.ale.org/mailman/listinfo/ale</a>
</pre>
<!--X-Body-of-Message-End-->
<!--X-MsgBody-End-->
<!--X-Follow-Ups-->
<hr>
<!--X-Follow-Ups-End-->
<!--X-References-->
<ul><li><strong>References</strong>:
<ul>
<li><strong><a name="01040" href="msg01040.html">[ale] wget oddity grabbing pages with URL parameters</a></strong>
<ul><li><em>From:</em> kafka at antichri.st (George Carless)</li></ul></li>
</ul></li></ul>
<!--X-References-End-->
<!--X-BotPNI-->
<ul>
<li>Prev by Date:
<strong><a href="msg01041.html">[ale] Sendmail latency</a></strong>
</li>
<li>Next by Date:
<strong><a href="msg01043.html">[ale] Consumer Rights Legal Issues presentation [was: Seeking LVM presentation]</a></strong>
</li>
<li>Previous by thread:
<strong><a href="msg01040.html">[ale] wget oddity grabbing pages with URL parameters</a></strong>
</li>
<li>Next by thread:
<strong><a href="msg01048.html">[ale] the new browser war</a></strong>
</li>
<li>Index(es):
<ul>
<li><a href="maillist.html#01042"><strong>Date</strong></a></li>
<li><a href="threads.html#01042"><strong>Thread</strong></a></li>
</ul>
</li>
</ul>
<!--X-BotPNI-End-->
<!--X-User-Footer-->
<!--X-User-Footer-End-->
</body>
</html>