2010-07-01から1ヶ月間の記事一覧

mechanize:WWW::Mechanize

>> require 'mechanize' => true >> WWW::Mechanize !!!!! DEPRECATION NOTICE !!!!! The WWW constant is deprecated, please switch to the new top-level Mechanize constant. WWW will be removed in Mechanize version 2.0 You've referenced the WWW c…

mechanize:Mechanize#visited_page

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb749737c ... >> agent.visited_page("http://www.foo.org/") => nil >> agent.visited_page("http://www.google.com/") => nil >> agent.get("http://www.google.com/") => #</mechanize:0xb749737c>

mechanize:Mechanize#visited?

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74f835c ... >> agent.visited?("http://www.foo.org/") => false >> agent.visited?("http://www.google.com/") => false >> agent.get("http://www.google.com/") => #</mechanize:0xb74f835c>

mechanize:Mechanize#user_agent_alias=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb745335c ... >> agent.user_agent => "WWW-Mechanize/1.0.0 (http://rubyforge.org/projects/mechanize/)" >> agent.user_agent_alias = "Windows IE 6" => "Windows IE 6" >> agent.user_agent_alias = "Wi</mechanize:0xb745335c>…

mechanize:Mechanize#transact

require 'mechanize' agent = Mechanize.new agent.get('http://www.yahoo.co.jp/') p agent.history.size agent.get('http://www.google.com/') p agent.history.size agent.transact do |x| page = x.get('http://google.com/') google_form = page.form('…

mechanize:Mechanize#submit

以下のような HTML と <html> <head> <title>form test</title> </head> <body> <form method="post" action="cgi/test_form00.cgi" name="form1"> <p> 氏名:<input type="text" name="text1" /> 性別: </p></form></body></html>

mechanize:Mechanize#request_with_entity

以下のような確認用 CGI を用意(再利用)し、webrick で確認 #!/usr/local/ruby-1.8.7/bin/ruby require "cgi" print "Content-Type: text/html\n\n" print "<html><head></head><body>" c = CGI.new ["text1", "radio1", "checkbox1", "select1", "textarea1"].each do |x| puts "#{x}</body></html>…

mechanize:Mechanize#put

post のことかと勘違いしてしまったが、違った。put を呼んだら、webrick が以下を表示していた。 [2010-xx-xx 08:23:51] ERROR unsupported method `PUT'.

mechanize:Mechanize#post_connect_hooks, #pre_connect_hooks

mechanize.rb の中を確認。詳細の挙動や使い方は不明 def initialize ... @pre_connect_hook = Chain::PreConnectHook.new @post_connect_hook = Chain::PostConnectHook.new def pre_connect_hooks @pre_connect_hook.hooks end def post_connect_hooks @po…

mechanize:Mechanize#post

以下のような確認用 CGI を用意(再利用)し、webrick で確認 #!/usr/local/ruby-1.8.7/bin/ruby require "cgi" print "Content-Type: text/html\n\n" print "<html><head></head><body>" c = CGI.new ["text1", "radio1", "checkbox1", "select1", "textarea1"].each do |x| puts "#{x}</body></html>…

mechanize:Mechanize#max_history, #max_history=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74a92e8 ... >> agent.max_history => nil >> agent.history.size => 0 >> agent.get('http://www.google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb749ab58 URL:http://www.google.co.jp/>} ... >> agent.history.size => 2 >> agent.…</mechanize::page></mechanize:0xb74a92e8>

mechanize:Mechanize#head

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74fa328 ... >> agent.head('http://www.google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb74ee370 URL:http://www.google.co.jp/>} {meta} {title nil} {iframes} {frames} {links} {forms}> >> agent.head('http://www.google.com/')…</mechanize::page></mechanize:0xb74fa328>

mechanize:Mechanize#get_file

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74662e0 ... >> agent.get_file('http://www.google.com/') => "<html><head><meta http-equiv=\"content-type\" ... >> f = agent.get_file('http://www.google.com/') => "<html><head></head></html></meta></head></html></mechanize:0xb74662e0>

mechanize:Mechanize#get

「agent.get({:url => 'http://127.0.0.1:8080/test_form00.html'}, [], "http://foo.org")」のようにやってみたが、"http://foo.org" は Referer に出ていないような気もする。ログだけの問題かもしれないけど。

mechanize:Mechanize#delete

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb742937c ... >> page = agent.get('http://www.google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb7447674 URL:http://www.google.co.jp/>} ... >> google_form = page.form('f') => #<Mechanize::Form ... >> google_form.q => "" >> a…</mechanize::form></mechanize::page></mechanize:0xb742937c>

mechanize:Mechanize#current_page

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb742c2e8 >> agent.current_page => nil >> agent.get('http://google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb741b6f0 URL:http://www.google.co.jp/>} >> agent.current_page => #</mechanize::page></mechanize:0xb742c2e8>

mechanize:Mechanize#cookies

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74b5840 ... >> agent.cookies => [] >> agent.get('http://google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb74a4d60 URL:http://www.google.co.jp/>} ... >> agent.cookies => [#</mechanize::page></mechanize:0xb74b5840>

mechanize:Mechanize#click

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74b27f8 ... >> agent.get('http://google.com/') => #<Mechanize::Page {url #<URI::HTTP:0xb74a2c54 URL:http://www.google.co.jp/>} ... >> link = agent.page.link_with(:text => 'ニュース') => #<Mechanize::Page::Link ... >> agent.click(l…</mechanize::page::link></mechanize::page></mechanize:0xb74b27f8>

mechanize:Mechanize#back

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74dd840 ... >> agent.get("http://www.google.com/") => #<Mechanize::Page {url #<URI::HTTP:0xb74d0f3c URL:http://www.google.co.jp/>} ... >> agent.get("http://www.yahoo.com/") => #</mechanize::page></mechanize:0xb74dd840>

mechanize:Mechanize#auth, #basic_auth

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74e97f8 ... >> agent.get("http://c*.*.*.com/") Mechanize::ResponseCodeError: 401 => Net::HTTPUnauthorized ... >> agent.auth("user-foo", "********") => "********" >> agent.get("http://c*.*.*.co</mechanize:0xb74e97f8>…

mechanize:Mechanize#log, #log=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb743d840 ... >> agent.log => nil >> require 'logger' => true >> agent = Mechanize.new { |a| a.log = Logger.new("hoge.log") } => #<Mechanize:0xb73f6eb8 ... >> agent.log => #</mechanize:0xb73f6eb8></mechanize:0xb743d840>

mechanize:Mechanize#pluggable_parser

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb74281d4 ... >> agent.pluggable_parser => #<Mechanize::PluggableParser:0xb742815c @default=Mechanize::File, @parsers={"application/xhtml+xml"=>Mechanize::Page, "text/html"=>Mechanize::Page, "application/vnd.wap.xhtml+xml"=>Mech…</mechanize::pluggableparser:0xb742815c></mechanize:0xb74281d4>

mechanize:Mechanize#html_parser

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb742afec ... >> agent.html_parser => Nokogiri::HTML</mechanize:0xb742afec>

mechanize:Mechanize#redirection_limit, #redirection_limit=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb742c888 ... >> agent.redirection_limit => 20 >> agent.redirection_limit = 10 => 10 >> agent.redirection_limit => 10</mechanize:0xb742c888>

mechanize:Mechanize#scheme_handlers

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb742c888 ... >> agent.scheme_handlers => {"https"=>#<Proc:0xb7b5a2e0@/usr/local/ruby-1.8.7/lib/ruby/gems/1.8/gems/mechanize-1.0.0/lib/mechanize.rb:169>, "file"=>#</proc:0xb7b5a2e0@/usr/local/ruby-1.8.7/lib/ruby/gems/1.8/gems/mechanize-1.0.0/lib/mechanize.rb:169></mechanize:0xb742c888>

mechanize:Mechanize#history_added, #history_added=, #history

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb743986c ... >> agent.history_added => nil >> agent.history => [] >> agent.get("http://www.google.com/") => #<Mechanize::Page ... >> agent.history => [#<Mechanize::Page ... >> agent.history.size => 2 >> agent.get("http://www.go…</mechanize::page></mechanize::page></mechanize:0xb743986c>

mechanize:Mechanize#keep_alive, #keep_alive=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb7382ff4 ... >> agent.keep_alive => true</mechanize:0xb7382ff4>

mechanize:Mechanize#keep_alive_time, #keep_alive_time=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb7385f38 ... >> agent.keep_alive_time => 300</mechanize:0xb7385f38>

mechanize:Mechanize#gzip_enabled, #gzip_enabled=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb7387d38 ... >> agent.gzip_enabled => true</mechanize:0xb7387d38>

mechanize:Mechanize#redirect_ok, #redirect_ok=

>> require 'mechanize' => true >> agent = Mechanize.new => #<Mechanize:0xb7440888 ... >> agent.redirect_ok => true >> agent.get("http://www.google.com/") => #<Mechanize::Page {url #<URI::HTTP:0xb7432f6c URL:http://www.google.co.jp/>} {meta} {title "Google"} ... >> agent.redirect_ok = false => false …</mechanize::page></mechanize:0xb7440888>