2016-07-17 4 views
-1

Scrapy 거미를 작동하지 마십시오 그리고 난 거미가 :내가 하나 개의 사이트를 분석

# -*- coding: utf-8 -*- 



from quoka.items import QuokaItem 
from scrapy.spiders import CrawlSpider, Rule 
from scrapy.linkextractors import LinkExtractor 
from scrapy.loader.processors import TakeFirst 
from scrapy.loader import XPathItemLoader 
from scrapy.selector import HtmlXPathSelector 

class QuokaLoader(XPathItemLoader): 
    default_output_processor = TakeFirst() 


class QuokaSpider(CrawlSpider): 

    name = "quoka" 
    allowed_domains = ["quoka.de"] 
    start_urls = ["http://www.quoka.de/immobilien/bueros-gewerbeflaechen/"] 

rules = (
     Rule(LinkExtractor(allow=('kleinanzeigen/cat_27_2710_ct_0_page_')), follow=True), 
     Rule(LinkExtractor(allow=('immobilien/bueros-gewerbeflaechen/')), callback='parse_item'), 
     ) 

def parse_item(self, response): 
    hxs = HtmlXPathSelector(response) 
    l = QuokaLoader(QuokaItem(), hxs) 

    # 
    l.add_xpath('date',response.xpath("/html/body/div[3]/div[2]/div[1]/main/div[8]/div/div[2]/strong/span/text()").extract()) 
    l.add_xpath('cost',response.xpath("/html/body/div[3]/div[2]/div[1]/main/div[8]/div/div[3]/div[2]/div[2]/text()").extract()) 
    # l.add_value('url', response.url) 

    return l.load_item() 

입력을 명령 sudo scrapy crawl quoka_spider.py

그러나 나는이 신비 오류가 있습니다

/home/gadzhibala/PycharmProjects/quoka/quoka/spiders/quoka_spider.py:14: ScrapyDeprecationWarning: quoka.spiders.quoka_spider.QuokaLoader inherits from deprecated class scrapy.loader.XPathItemLoader, please inherit from scrapy.loader.ItemLoader. (warning only on first subclass, there may be others) 
class QuokaLoader(XPathItemLoader): 
2016-07-17 14:07:01 [scrapy] INFO: Scrapy 1.1.1 started (bot: quoka) 
2016-07-17 14:07:01 [scrapy] INFO: Overridden settings: {'BOT_NAME': 'quoka', 'SPIDER_MODULES': ['quoka.spiders'], 'ROBOTSTXT_OBEY': True, 'NEWSPIDER_MODULE': 'quoka.spiders'} 
Traceback (most recent call last): 
File "/usr/local/lib/python3.5/dist-packages/scrapy/spiderloader.py", line 41, in load 
return self._spiders[spider_name] 
KeyError: 'quoka_spider.py' 

During handling of the above exception, another exception occurred: 

Traceback (most recent call last): 
File "/usr/local/bin/scrapy", line 11, in <module> 
sys.exit(execute()) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/cmdline.py", line 142, in execute 
_run_print_help(parser, _run_command, cmd, args, opts) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/cmdline.py", line 88, in _run_print_help 
func(*a, **kw) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/cmdline.py", line 149, in _run_command 
cmd.run(args, opts) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/commands/crawl.py", line 57, in run 
self.crawler_process.crawl(spname, **opts.spargs) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/crawler.py", line 162, in crawl 
crawler = self.create_crawler(crawler_or_spidercls) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/crawler.py", line 190, in create_crawler 
return self._create_crawler(crawler_or_spidercls) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/crawler.py", line 194, in _create_crawler 
spidercls = self.spider_loader.load(spidercls) 
File "/usr/local/lib/python3.5/dist-packages/scrapy/spiderloader.py", line 43, in load 
raise KeyError("Spider not found: {}".format(spider_name)) 
KeyError: 'Spider not found: quoka_spider.py' 

내가 사용을 우분투 16.04, python3.5. scraby pip3 설치 Scraby를 설치하십시오. 나는 Scraby를 재설치하지만 성공하지는 못합니다. 어떻게 수정합니까?

+0

./home/gadzhibala/PyCharmproject/ –

+0

@MohammadAmin 홈/gadzhibala/PycharmProjects/quoka/quoka/거미 –

답변

0

당신은 다음과 대신, 거미의
를 이름 속성을 사용한다 :

sudo scrapy crawl quoka_spider.py 

는 입력 : 난 내 홈 디렉토리를 실행

scrapy crawl quoka 
관련 문제