Hash generator for HTML inline styles and scripts
Go to file
dependabot[bot] 11b6bb3caa
Bump pygments from 2.14.0 to 2.15.0 (#3)
Bumps [pygments](https://github.com/pygments/pygments) from 2.14.0 to 2.15.0.
- [Release notes](https://github.com/pygments/pygments/releases)
- [Changelog](https://github.com/pygments/pygments/blob/master/CHANGES)
- [Commits](https://github.com/pygments/pygments/compare/2.14.0...2.15.0)

- dependency-name: pygments
  dependency-type: indirect

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-09-29 16:32:53 +01:00
.github/workflows add linter job to the CI 2023-01-22 17:55:53 +00:00
inlinehashes bump version and update readme 2023-01-25 22:37:30 +00:00
tests bump version and update readme 2023-01-25 22:37:30 +00:00
.gitignore Add hash implementation and basic html parsing. The cli app should be usable. 2022-03-13 13:18:34 +00:00
LICENSE Initial commit 2022-03-11 19:01:01 +00:00
README.rst bump version and update readme 2023-01-25 22:37:30 +00:00
poetry.lock Bump pygments from 2.14.0 to 2.15.0 (#3) 2023-09-29 16:32:53 +01:00
pyproject.toml bump version and update readme 2023-01-25 22:37:30 +00:00



A small tool and library to generate the hashes of inline content that needs to be whitelisted when serving an HTML document
with a `Content-Security-Policy <https://developer.mozilla.org/en-US/docs/Web/HTTP/CSP>`_ (because, as the name indicates,
using ``unsafe-inline`` is not recommended).

You provide the HTML content (directly or through a file path/URL) then ``inlinehashes`` will parse the document and provide
you with a list of elements that need to be explicitly added to the CSP header/tag.

The tool can be specially useful for scenarios where you use/include external software solutions in your website or application
(such as a 3rd party CMS, etc), since it will allow you to detect changes after updates and edit you CSP accordingly.

*Quick note: Always verify the content you are whitelisting and be careful when fetching live website data, since any existing
XSS code will be included in the results.*

**At the moment this package is still in a very early stage, so it still doesn't detect all possible items and the current API
might change with future releases.**

Inline content that is currently detected:

* ``<script></script>`` tags
* ``<style></style>`` tags
* Many event handlers defined in element/tag attributes
* Styles defined directly in the element/tag using the ``style`` attribute


Using pip you just need to ``pip install inlinehashes``


The package can be used through 2 different ways, either by using the CLI interface or programmatically in your python project.
Bellow you can find a quick summary of the available functionality.

CLI app

This is the available functionality:

.. code::

  usage: inlinehashes [-h] [-a {sha256,sha384,sha512}] [-o {table,json,plain}] [-t {all,script-src,style-src}] source

  positional arguments:
    source                URL or local HTML file to check

    -h, --help            show this help message and exit
    -a {sha256,sha384,sha512}, --alg {sha256,sha384,sha512}
                          Hash algorithm to use (default: sha256)
    -o {table,json,plain}, --output {table,json,plain}
                          Format used to write the output (default: table)
    -t {all,script-src,style-src}, --target {all,script-src,style-src}
                          Target inline content to look for (default: all)

Here is an example of the output:

.. code::

    $inlinehashes https://ovalerio.net -a sha384 -o json
        "content": "\n      html {\n        height: 100%;\n      }\n      ",
        "hash": "sha384-Ku20lQH5qbr4EDPzXD2rf25rEHJNswNYRUNMPjYl7jCe0eHJYDe0gFdQpnKkFUTv",
        "directive": "style-src",
        "line": 12,
        "position": 0


Here is the same example, but using python's shell:

.. code:: python

    >>> import requests
    >>> import inlinehashes
    >>> content = requests.get("https://ovalerio.net").text
    >>> inlines = inlinehashes.parse(content)
    >>> inlines
    [Inline(line='17', position='0')]
    >>> first = inlines[0]
    >>> first.short_content
    '\n      html {\n        height: 100%;\n      }\n      '
    >>> first.sha256
    >>> first.sha384
    >>> first.sha512
    >>> first.content
    '\n      html {\n        height: 100%;\n      }\n      body {\n        background-image: url("...'


All contributions and improvements are welcome.