cybertools/agent/crawl
helmutm 771cf29cc2 work in progress: agent application with commandline controllers
git-svn-id: svn://svn.cy55.de/Zope3/src/cybertools/trunk@2496 fd906abe-77d9-0310-91a1-e0d9ade77398
2008-04-04 08:07:22 +00:00
..
__init__.py added cybertools.agent package (work in progress...) 2008-02-23 14:07:15 +00:00
base.py keep state information with jobs; provide feedback to master and controller via 'inform()' methods 2008-04-03 10:59:51 +00:00
mail.py keep state information with jobs; provide feedback to master and controller via 'inform()' methods 2008-04-03 11:17:37 +00:00
README.txt work in progress: agent application with commandline controllers 2008-04-04 08:07:22 +00:00

================================================
Agents for Job Execution and Communication Tasks
================================================

  ($Id$)

  >>> from cybertools.agent.base.agent import Master

  >>> config = '''
  ... controller(names=['core.sample'])
  ... scheduler(name='core')
  ... logger(name='default', standard=30)
  ... '''
  >>> master = Master(config)
  >>> master.setup()


Crawler
=======

The agent uses Twisted's cooperative multitasking model.

Crawler is the base class for all derived crawlers like the filesystem crawler
and the mailcrawler. The SampleCrawler returns a deferred that already had a
callback being called, so it will return at once.

Returns a deferred that must be supplied with a callback method (and in
most cases also an errback method).

We create the sample crawler via the master's controller. The sample
controller provides a simple method for this purpose.

  >>> controller = master.controllers[0]
  >>> controller.createAgent('crawl.sample', 'crawler01')

In the next step we request the start of a job, again via the controller.

  >>> controller.enterJob('sample', 'crawler01')

The job is not executed immediately - we have to hand over control to
the twisted reactor first.

  >>> from cybertools.agent.tests import tester
  >>> tester.iterate()
  SampleCrawler is collecting.
  Job 00001 completed; result: [];