work in progress: loops.agent specification
git-svn-id: svn://svn.cy55.de/Zope3/src/loops/trunk@1780 fd906abe-77d9-0310-91a1-e0d9ade77398
This commit is contained in:
parent
8616289e2c
commit
ad281796ea
6 changed files with 232 additions and 29 deletions
147
agent/README.txt
147
agent/README.txt
|
@ -11,35 +11,152 @@ This package does not depend on zope or the other loops packages
|
||||||
but represents a standalone application.
|
but represents a standalone application.
|
||||||
|
|
||||||
|
|
||||||
Basic Configuration
|
Basic Implementation, Agent Core
|
||||||
===================
|
================================
|
||||||
|
|
||||||
Parameter(s): URL of target loops site.
|
The agent uses Twisted's cooperative multitasking model.
|
||||||
|
|
||||||
(Future extension: use more than one target loops site)
|
This means that all calls to services (like crawler, transporter, ...)
|
||||||
|
return a deferred that must be supplied with a callback method (and in
|
||||||
|
most cases also an errback method).
|
||||||
|
|
||||||
|
|
||||||
|
Browser-based User Interface
|
||||||
|
============================
|
||||||
|
|
||||||
|
The user interface is provided via a browser-based application
|
||||||
|
based on Twisted and Nevow.
|
||||||
|
|
||||||
|
|
||||||
|
Configuration Management
|
||||||
|
========================
|
||||||
|
|
||||||
|
Functionality
|
||||||
|
|
||||||
|
- Storage of configuration parameters
|
||||||
|
- Interface to the browser-based user interface that allows the
|
||||||
|
editing of configuration parameters
|
||||||
|
|
||||||
|
|
||||||
|
Scheduling
|
||||||
|
==========
|
||||||
|
|
||||||
|
Configuration (per job)
|
||||||
|
|
||||||
|
- schedule, repeating pattern, conditions
|
||||||
|
- following job(s), e.g. to start a transfer immediately after a crawl
|
||||||
|
|
||||||
|
|
||||||
|
Crawling
|
||||||
|
========
|
||||||
|
|
||||||
|
General
|
||||||
|
-------
|
||||||
|
|
||||||
|
Functionality
|
||||||
|
|
||||||
|
- search for new or changed resources according to the search and
|
||||||
|
filter criteria
|
||||||
|
- keep a record of resources transferred already in order to avoid
|
||||||
|
duplicate transfers (?)
|
||||||
|
|
||||||
|
Configuration (per crawl job)
|
||||||
|
|
||||||
|
- predefined metadata
|
||||||
|
|
||||||
Local File System
|
Local File System
|
||||||
=================
|
-----------------
|
||||||
|
|
||||||
Configuration
|
Configuration (per crawl job)
|
||||||
-------------
|
|
||||||
|
|
||||||
Parameters: directories to search, schedules.
|
- directories to search
|
||||||
|
- filter criteria, e.g. file type
|
||||||
|
|
||||||
Logging info
|
Metadata sources
|
||||||
------------
|
|
||||||
|
|
||||||
|
- path, filename
|
||||||
|
|
||||||
E-Mail-Clients
|
E-Mail-Clients
|
||||||
==============
|
--------------
|
||||||
|
|
||||||
|
Configuration (per crawl job)
|
||||||
|
|
||||||
|
- folders to search
|
||||||
|
- filter criteria (e.g. sender, receiver, subject patterns)
|
||||||
|
|
||||||
|
Metadata sources
|
||||||
|
|
||||||
|
- folder names (path)
|
||||||
|
- header fields (sender, receiver, subject, ...)
|
||||||
|
|
||||||
|
Special handling
|
||||||
|
|
||||||
|
- HTML vs. plain text content: if a mail contains both HTML and plain
|
||||||
|
text parts the transfer may be limited to one of these parts (configuration
|
||||||
|
setting)
|
||||||
|
- attachments may be ignored (configuration setting; useful when attachments
|
||||||
|
are copied to the local filesystem and transferred from there anyways)
|
||||||
|
|
||||||
|
|
||||||
Software Update
|
Transport
|
||||||
|
=========
|
||||||
|
|
||||||
|
Configuration
|
||||||
|
|
||||||
|
- URL of the target loops site, e.g. http://z3.loops.cy55.de/bwp/d5
|
||||||
|
- username, password for logging in to loops
|
||||||
|
- machine name: name under which the client computer is know to the
|
||||||
|
loops server
|
||||||
|
- Transfer method, e.g. PUT
|
||||||
|
|
||||||
|
The following information is intended for the default transfer
|
||||||
|
protocol/method HTTP PUT but probably also pertains to other protocols
|
||||||
|
like e.g. FTP.
|
||||||
|
|
||||||
|
Format/Information structure
|
||||||
|
----------------------------
|
||||||
|
|
||||||
|
- Metadata URL (for storing or accessing metadata sets - optional, see below):
|
||||||
|
``$loopsSiteURL/resource_meta/$machine_name/$service/$path.xml``
|
||||||
|
- Resource URL (for storing or accessing the real resources):
|
||||||
|
``$loopsSiteURL/resource_data/$machine_name/$service/$path``
|
||||||
|
- ``$service`` names the crawler service, e.g. "filesystem" or "outlook"
|
||||||
|
- ``$path`` represents the full path, possibly with drive specification in front
|
||||||
|
(for filesystem resources on Windows), with special characters URL-escaped
|
||||||
|
|
||||||
|
Note that the URL uniquely identifies the resource on the local computer,
|
||||||
|
so a resource transferred with the exact location (path and filename)
|
||||||
|
on the local computer as a resource transferred previously will overwrite
|
||||||
|
the old version, so that the classification of the resource within loops
|
||||||
|
won't get lost. (This is of no relevance to emails.)
|
||||||
|
|
||||||
|
Metadata sets are XML files with metadata for the associated resource.
|
||||||
|
Usually a metadata set has the extension ".xml"; if the extension is ".zip"
|
||||||
|
the metadata file is a compressed file that will be expanded on the
|
||||||
|
server.
|
||||||
|
|
||||||
|
Data files may also be compressed in which case there must be a corresponding
|
||||||
|
entry in the associated metadata set.
|
||||||
|
|
||||||
|
|
||||||
|
Logging
|
||||||
|
=======
|
||||||
|
|
||||||
|
Configuration
|
||||||
|
|
||||||
|
- log format(s)
|
||||||
|
- log file(s) (or other forms of persistence)
|
||||||
|
|
||||||
|
|
||||||
|
Software Loader
|
||||||
===============
|
===============
|
||||||
|
|
||||||
|
Configuration (general)
|
||||||
|
|
||||||
Fin de partie
|
- source list: URL(s) of site(s) providing updated or additional packages
|
||||||
=============
|
|
||||||
|
Configuration (per install/update job)
|
||||||
|
|
||||||
|
- command: install, update, remove
|
||||||
|
- package names
|
||||||
|
|
||||||
(tearDown)
|
|
||||||
|
|
|
@ -26,9 +26,25 @@ from zope.interface import implements
|
||||||
from loops.agent.interfaces import IConfigurator
|
from loops.agent.interfaces import IConfigurator
|
||||||
|
|
||||||
|
|
||||||
class Configurator(object)
|
class Configurator(object):
|
||||||
|
|
||||||
implements(IConfigurator)
|
implements(IConfigurator)
|
||||||
|
|
||||||
def loadConfiguration(self):
|
def loadConfiguration(self):
|
||||||
pass
|
pass
|
||||||
|
|
||||||
|
def addConfigOption(self, key, value):
|
||||||
|
setattr(self, key, value)
|
||||||
|
|
||||||
|
def getConfigOption(self, key, value):
|
||||||
|
return getattr(self, key, None)
|
||||||
|
|
||||||
|
|
||||||
|
conf = Configurator()
|
||||||
|
|
||||||
|
# this is just for convenience during the development phase,
|
||||||
|
# thus we can retrieve the port easily via ``conf.ui.web.port``
|
||||||
|
conf.addConfigOption('ui', Configurator())
|
||||||
|
conf.ui.addConfigOption('web', Configurator())
|
||||||
|
conf.ui.web.addConfigOption('port', 10095)
|
||||||
|
|
||||||
|
|
|
@ -22,5 +22,5 @@ Filesystem crawler.
|
||||||
$Id$
|
$Id$
|
||||||
"""
|
"""
|
||||||
|
|
||||||
from loops.agent.interfaces import ICrawler
|
from loops.agent.interfaces import ICrawlingJob
|
||||||
|
|
||||||
|
|
|
@ -60,6 +60,8 @@ class IScheduledJob(Interface):
|
||||||
startTime = Attribute('Date/time at which the job should be executed.')
|
startTime = Attribute('Date/time at which the job should be executed.')
|
||||||
params = Attribute('Mapping with key/value pairs to be passed to the '
|
params = Attribute('Mapping with key/value pairs to be passed to the '
|
||||||
'execute method call as keyword parameters.')
|
'execute method call as keyword parameters.')
|
||||||
|
successors = Attribute('Jobs to execute immediately after this '
|
||||||
|
'one has been finished.')
|
||||||
|
|
||||||
def execute(**kw):
|
def execute(**kw):
|
||||||
""" Execute the job.
|
""" Execute the job.
|
||||||
|
@ -83,34 +85,56 @@ class ILogRecord(Interface):
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
|
||||||
class ICrawler(Interface):
|
class ICrawlingJob(IScheduledJob):
|
||||||
""" Collects resources.
|
""" Collects resources.
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
predefinedMetadata = Attribute('A mapping with metadata to be used '
|
||||||
|
'for all resources found.')
|
||||||
|
|
||||||
def collect(**criteria):
|
def collect(**criteria):
|
||||||
""" Return a collection of resources that should be transferred
|
""" Return a collection of resource/metadata pairs that should be transferred
|
||||||
the the server using the selection criteria given.
|
to the server using the selection criteria given.
|
||||||
|
"""
|
||||||
|
|
||||||
|
|
||||||
|
class IMetadataSet(Interface):
|
||||||
|
""" Metadata associated with a resource.
|
||||||
|
"""
|
||||||
|
|
||||||
|
def asXML():
|
||||||
|
""" Return an XML string representing the metadata set.
|
||||||
|
|
||||||
|
If this metadata set contains other metadata sets
|
||||||
|
(nested metadata) this will be converted to XML as well.
|
||||||
|
"""
|
||||||
|
|
||||||
|
def setData(key, value):
|
||||||
|
""" Set a metadata element.
|
||||||
|
|
||||||
|
The value may be a string or another metadata set
|
||||||
|
(nested metadata).
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
|
||||||
class ITransporter(Interface):
|
class ITransporter(Interface):
|
||||||
""" Transfers collected resources to the server. A resource need
|
""" Transfers collected resources to the server.
|
||||||
not be transferred immediately, resources may be be collected
|
|
||||||
first and transferred later together, e.g. as a compressed file.
|
|
||||||
"""
|
"""
|
||||||
|
|
||||||
serverURL = Attribute('URL of the server the resources will be '
|
serverURL = Attribute('URL of the server the resources will be '
|
||||||
'transferred to. The URL also determines the '
|
'transferred to. The URL also determines the '
|
||||||
'transfer protocol, e.g. HTTP or FTP.')
|
'transfer protocol, e.g. HTTP or FTP.')
|
||||||
method = Attribute('Transport method, e.g. PUT.')
|
method = Attribute('Transport method, e.g. PUT.')
|
||||||
|
machineName = Attribute('Name under which the local machine is '
|
||||||
|
'known to the server.')
|
||||||
|
userName = Attribute('User name for logging in to the server.')
|
||||||
|
password = Attribute('Password for logging in to the server.')
|
||||||
|
|
||||||
def transfer(resource, resourceType=file):
|
def transfer(resource, metadata=None, resourceType=file):
|
||||||
""" Transfer the resource (typically just a file that may
|
""" Transfer the resource (typically just a file that may
|
||||||
be read) to the server.
|
be read) to the server.
|
||||||
"""
|
|
||||||
|
|
||||||
def commit():
|
The resource may be associated with a metadata set.
|
||||||
""" Transfer all resources not yet transferred.
|
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
|
||||||
|
@ -122,7 +146,52 @@ class IConfigurator(Interface):
|
||||||
""" Find the configuration settings and load them.
|
""" Find the configuration settings and load them.
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
def setConfigOption(key, value):
|
||||||
|
""" Directly set a certain configuration option.
|
||||||
|
"""
|
||||||
|
|
||||||
def getConfigOption(key):
|
def getConfigOption(key):
|
||||||
""" Return the value for the configuration option identified
|
""" Return the value for the configuration option identified
|
||||||
by the key given.
|
by the key given.
|
||||||
|
|
||||||
|
In addition config options must be directly accessible
|
||||||
|
via attribute notation.
|
||||||
"""
|
"""
|
||||||
|
|
||||||
|
|
||||||
|
class IPackageManager(Interface):
|
||||||
|
""" Allows to install, update, or remove software packages (plugins,
|
||||||
|
typically as Python eggs) from a server.
|
||||||
|
"""
|
||||||
|
|
||||||
|
sources = Attribute('A list of URLs that provide software packages. ')
|
||||||
|
|
||||||
|
def getInstalledPackages():
|
||||||
|
""" Return a list of dictionaries, format:
|
||||||
|
[{'name': name, 'version': version,
|
||||||
|
'date': date_time_of_installation,}, ...]
|
||||||
|
"""
|
||||||
|
|
||||||
|
def getUpdateCandidates():
|
||||||
|
""" Return a list of dictionaries with information about updateable
|
||||||
|
packages.
|
||||||
|
"""
|
||||||
|
|
||||||
|
def installPackage(name, version=None, source=None):
|
||||||
|
""" Install a package.
|
||||||
|
If version is not given try to get the most recent one.
|
||||||
|
If source is not given search the sources attribute for the
|
||||||
|
first fit.
|
||||||
|
"""
|
||||||
|
|
||||||
|
def updatePackage(name, version=None, source=None):
|
||||||
|
""" Update a package.
|
||||||
|
If version is not given try to get the most recent one.
|
||||||
|
If source is not given search the sources attribute for the
|
||||||
|
first fit.
|
||||||
|
"""
|
||||||
|
|
||||||
|
def removePackage(name):
|
||||||
|
""" Remove a package from this agent.
|
||||||
|
"""
|
||||||
|
|
||||||
|
|
|
@ -1,10 +1,11 @@
|
||||||
from twisted.application import internet, service
|
from twisted.application import internet, service
|
||||||
from nevow import appserver
|
from nevow import appserver
|
||||||
|
|
||||||
from loops.agent.agent import startAgent
|
from loops.agent.core import startAgent
|
||||||
from loops.agent.ui.web import AgentHome
|
from loops.agent.ui.web import AgentHome
|
||||||
|
from loops.agent.config import conf
|
||||||
|
|
||||||
port = 10095
|
port = conf.ui.web.port or 10095
|
||||||
|
|
||||||
application = service.Application('LoopsAgent')
|
application = service.Application('LoopsAgent')
|
||||||
|
|
||||||
|
|
Loading…
Add table
Reference in a new issue