Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Discover bots based on the entry points #2413

Merged
merged 14 commits into from
Nov 14, 2023
Merged
Show file tree
Hide file tree
Changes from 5 commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@
*.profile
.vscode/
.profile
intelmq.egg-info
*.egg-info
build
dist
*.old
Expand Down
2 changes: 2 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -15,6 +15,7 @@ CHANGELOG
### Core
- `intelmq.lib.message`: For invalid message keys, add a hint on the failure to the exception: not allowed by configuration or not matching regular expression (PR#2398 by Sebastian Wagner).
- `intelmq.lib.exceptions.InvalidKey`: Add optional parameter `additional_text` (PR#2398 by Sebastian Wagner).
- Change the way we discover bots to allow easy extending based on the entry point name. (PR by Kamil Mankowski)
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

### Development

Expand All @@ -31,6 +32,7 @@ CHANGELOG

### Documentation
- Add a readthedocs configuration file to fix the build fail (PR#2403 by Sebastian Wagner).
- Add a guide of developing extensions packages (PR by Kamil Mankowski)
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

### Packaging

Expand Down
Empty file.
Empty file.
Empty file.
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
"""
SPDX-FileCopyrightText: 2023 CERT.at GmbH <https://cert.at/>
SPDX-License-Identifier: AGPL-3.0-or-later
"""

from intelmq.lib.bot import CollectorBot
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved


class ExampleAdditionalCollectorBot(CollectorBot):
"""
This is an example bot provided by extension package
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved
"""

def process(self):
report = self.new_report()
if self.raw: # noqa: Set as parameter
report['raw'] = 'test'
self.send_message(report)


BOT = ExampleAdditionalCollectorBot
44 changes: 44 additions & 0 deletions contrib/example-extension-package/setup.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,44 @@
"""Example IntelMQ extension package

SPDX-FileCopyrightText: 2023 CERT.at GmbH <https://cert.at/>
SPDX-License-Identifier: AGPL-3.0-or-later
"""

from pathlib import Path
from setuptools import find_packages, setup


# Instead of the bot-autodiscovery below, you can also just manually declare entrypoints
# (regardless of packaging solution, even in pyproject.toml etc.), e.g.:
#
# intelmq.bots.collectors.custom.collector = mybots.bots.collectors.custom.collector:BOT.run
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved
#
# Important is:
# - entry point has to start with `intelmq.bots.{type}` (type: collectors, experts, parsers, outputs)
# - target has to end with `:BOT.run`
# - entry points have to be in `console_scripts` group


BOTS = []

base_path = Path(__file__).parent / 'mybots/bots'
botfiles = [botfile for botfile in Path(base_path).glob('**/*.py') if botfile.is_file() and not botfile.name.startswith('_')]
for file in botfiles:
file = Path(str(file).replace(str(base_path), 'intelmq/bots'))
entry_point = '.'.join(file.with_suffix('').parts)
file = Path(str(file).replace('intelmq/bots', 'mybots/bots'))
module = '.'.join(file.with_suffix('').parts)
BOTS.append('{0} = {1}:BOT.run'.format(entry_point, module))

setup(
name='intelmq-example-extension',
version='1.0.0', # noqa: F821
maintainer='Your Name',
maintainer_email='[email protected]',
packages=find_packages(),
license='AGPLv3',
description='This is an example package to demonstrate how ones can extend IntelMQ.',
entry_points={
'console_scripts': BOTS
},
)
60 changes: 60 additions & 0 deletions docs/dev/guide.rst
Original file line number Diff line number Diff line change
Expand Up @@ -891,6 +891,66 @@ The databases `<` 10 are reserved for the IntelMQ core:
* 3: statistics
* 4: tests

****************************
Creating extensions packages
****************************

IntelMQ supports adding additional bots using your own independent packages. You can use this to
add a new integration that is special to you, or cannot be integrated for some reason
into the main IntelMQ repository.
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

Building an extension package
=============================

A simple example of the package can be found in ``contrib/example-extension-package``. In order to
bots to work with IntelMQ, you need to ensure that
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

- your bot's module exposes a ``BOT`` object of the class inherited from ``intelmq.lib.bot.Bot``
or its subclasses,
- your package registers an `entry point <https://packaging.python.org/en/latest/specifications/entry-points/>`_
in the ``console_scripts`` group with a name starting with ``intelmq.bots.`` followed by
the name of the group (collectors, experts, outputs, parsers), and then your original name.
The entry point must point to the ``BOT.run`` method,
- the module in which the bot resides must be importable by IntelMQ (e.g. installed in the same
virtualv).
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

Apart from these requirements, your package can use any of the usual package features. We strongly
recommend following the same principles and main guidelines as the official bots. This will ensure
experience when using official and additional bots.
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved

Naming convention
=================

Building your own extensions gives you a lot of freedom, but it's important to know that if your
bot's entry point uses the same name as another bot, it may not be possible to use it, or to
determine which one is being used. For this reason, we recommend that you start the name of your
bot with an with an organization identifier and then the bot name.

For example, if I create a collector bot for feed source ``Special`` and run it on behalf of the
organization ``Awesome``, the suggested entry point might be ``intelmq.bots.collectors.awesome.special``.
Note that the structure of your package doesn't matter, as long as it can be imported properly.

For example, I could create a package called ``awesome-bots`` with the following file structure

.. code-block:: text

awesome_bots
├── pyproject.toml
└── awesome_bots
├── __init__.py
└── special.py

The `pyproject.toml <https://packaging.python.org/en/latest/specifications/declaring-project-metadata/#entry-points>`_
file would then have the following section:

.. code-block:: ini

[project.scripts]
intelmq.bots.collectors.awesome.special = "awesome_bots.special:BOT.run"

Once you installed your package, you can run ``intelmqctl list bots`` to check if your bot was
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved
properly registered.

*************
Documentation
*************
Expand Down
29 changes: 20 additions & 9 deletions intelmq/lib/utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -34,7 +34,7 @@
import textwrap
import traceback
import zipfile
from pathlib import Path
from sys import version_info
from typing import (Any, Callable, Dict, Generator, Iterator, Optional,
Sequence, Union)

Expand All @@ -43,7 +43,6 @@
import dns.version
import requests
from dateutil.relativedelta import relativedelta
from pkg_resources import resource_filename
from ruamel.yaml import YAML
from ruamel.yaml.scanner import ScannerError
from termstyle import red
Expand All @@ -52,6 +51,12 @@
from intelmq import RUNTIME_CONF_FILE
from intelmq.lib.exceptions import DecodingError

try:
from importlib.metadata import entry_points
except ImportError:
from importlib_metadata import entry_points


__all__ = ['base64_decode', 'base64_encode', 'decode', 'encode',
'load_configuration', 'load_parameters', 'log', 'parse_logline',
'reverse_readline', 'error_message_from_exc', 'parse_relative',
Expand Down Expand Up @@ -839,6 +844,14 @@ def file_name_from_response(response: requests.Response) -> str:
return file_name


def _get_console_entry_points():
# Select interface was introduced in Python 3.10 and newer importlib_metadata
entries = entry_points()
if hasattr(entries, "select"):
return entries.select(group="console_scripts")
return entries.get("console_scripts", []) # it's a dict


def list_all_bots() -> dict:
"""
Compile a dictionary with all bots and their parameters.
Expand All @@ -860,13 +873,11 @@ def list_all_bots() -> dict:
from intelmq.lib.bot import Bot # noqa: prevents circular import
bot_parameters = dir(Bot)

base_path = resource_filename('intelmq', 'bots')

botfiles = [botfile for botfile in pathlib.Path(base_path).glob('**/*.py') if botfile.is_file() and botfile.name != '__init__.py']
for file in botfiles:
file = Path(file.as_posix().replace(base_path, 'intelmq/bots'))
bot_entrypoints = filter(lambda entry: entry.name.startswith("intelmq.bots."), _get_console_entry_points())
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved
for bot in bot_entrypoints:
try:
mod = importlib.import_module('.'.join(file.with_suffix('').parts))
module_name = bot.value.replace(":BOT.run", '')
mod = importlib.import_module(module_name)
except SyntaxError:
# Skip invalid bots
continue
Expand All @@ -884,7 +895,7 @@ def list_all_bots() -> dict:
for bot_type in ['CollectorBot', 'ParserBot', 'ExpertBot', 'OutputBot', 'Bot']:
name = name.replace(bot_type, '')

bots[file.parts[2].capitalize()[:-1]][name] = {
bots[module_name.split('.')[2].capitalize()[:-1]][name] = {
"module": mod.__name__,
"description": "Missing description" if not getattr(mod.BOT, '__doc__', None) else textwrap.dedent(mod.BOT.__doc__).strip(),
"parameters": keys,
Expand Down
1 change: 1 addition & 0 deletions setup.py
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@
'redis>=2.10',
'requests>=2.2.0',
'ruamel.yaml',
'importlib-metadata; python_version < "3.8"'
kamil-certat marked this conversation as resolved.
Show resolved Hide resolved
]

TESTS_REQUIRES = [
Expand Down