annotate .hgignore @ 6532:33b71a130b16

templates: properly escape inline JavaScript values TLDR: Kallithea has issues with escaping values for use in inline JS. Despite judicious poking of the code, no actual security vulnerabilities have been found, just lots of corner-case bugs. This patch fixes those, and hardens the code against actual security issues. The long version: To embed a Python value (typically a 'unicode' plain-text value) in a larger file, it must be escaped in a context specific manner. Example: >>> s = u'<script>alert("It\'s a trap!");</script>' 1) Escaped for insertion into HTML element context >>> print cgi.escape(s) &lt;script&gt;alert("It's a trap!");&lt;/script&gt; 2) Escaped for insertion into HTML element or attribute context >>> print h.escape(s) &lt;script&gt;alert(&#34;It&#39;s a trap!&#34;);&lt;/script&gt; This is the default Mako escaping, as usually used by Kallithea. 3) Encoded as JSON >>> print json.dumps(s) "<script>alert(\"It's a trap!\");</script>" 4) Escaped for insertion into a JavaScript file >>> print '(' + json.dumps(s) + ')' ("<script>alert(\"It's a trap!\");</script>") The parentheses are not actually required for strings, but may be needed to avoid syntax errors if the value is a number or dict (object). 5) Escaped for insertion into a HTML inline <script> element >>> print h.js(s) ("\x3cscript\x3ealert(\"It's a trap!\");\x3c/script\x3e") Here, we need to combine JS and HTML escaping, further complicated by the fact that "<script>" tag contents can either be parsed in XHTML mode (in which case '<', '>' and '&' must additionally be XML escaped) or HTML mode (in which case '</script>' must be escaped, but not using HTML escaping, which is not available in HTML "<script>" tags). Therefore, the XML special characters (which can only occur in string literals) are escaped using JavaScript string literal escape sequences. (This, incidentally, is why modern web security best practices ban all use of inline JavaScript...) Unsurprisingly, Kallithea does not do (5) correctly. In most cases, Kallithea might slap a pair of single quotes around the HTML escaped Python value. A typical benign example: $('#child_link').html('${_('No revisions')}'); This works in English, but if a localized version of the string contains an apostrophe, the result will be broken JavaScript. In the more severe cases, where the text is user controllable, it leaves the door open to injections. In this example, the script inserts the string as HTML, so Mako's implicit HTML escaping makes sense; but in many other cases, HTML escaping is actually an error, because the value is not used by the script in an HTML context. The good news is that the HTML escaping thwarts attempts at XSS, since it's impossible to inject syntactically valid JavaScript of any useful complexity. It does allow JavaScript errors and gibberish to appear on the page, though. In these cases, the escaping has been fixed to use either the new 'h.js' helper, which does JavaScript escaping (but not HTML escaping), OR the new 'h.jshtml' helper (which does both), in those cases where it was unclear if the value might be used (by the script) in an HTML context. Some of these can probably be "relaxed" from h.jshtml to h.js later, but for now, using h.jshtml fixes escaping and doesn't introduce new errors. In a few places, Kallithea JSON encodes values in the controller, then inserts the JSON (without any further escaping) into <script> tags. This is also wrong, and carries actual risk of XSS vulnerabilities. However, in all cases, security vulnerabilities were narrowly avoided due to other filtering in Kallithea. (E.g. many special characters are banned from appearing in usernames.) In these cases, the escaping has been fixed and moved to the template, making it immediately visible that proper escaping has been performed. Mini-FAQ (frequently anticipated questions): Q: Why do everything in one big, hard to review patch? Q: Why add escaping in specific case FOO, it doesn't seem needed? Because the goal here is to have "escape everywhere" as the default policy, rather than identifying individual bugs and fixing them one by one by adding escaping where needed. As such, this patch surely introduces a lot of needless escaping. This is no different from how Mako/Pylons HTML escape everything by default, even when not needed: it's errs on the side of needless work, to prevent erring on the side of skipping required (and security critical) work. As for reviewability, the most important thing to notice is not where escaping has been introduced, but any places where it might have been missed (or where h.jshtml is needed, but h.js is used). Q: The added escaping is kinda verbose/ugly. That is not a question, but yes, I agree. Hopefully it'll encourage us to move away from inline JavaScript altogether. That's a significantly larger job, though; with luck this patch will keep us safe and secure until such a time as we can implement the real fix. Q: Why not use Mako filter syntax ("${val|h.js}")? Because of long-standing Mako bug #140, preventing use of 'h' in filters. Q: Why not work around bug #140, or even use straight "${val|js}"? Because Mako still applies the default h.escape filter before the explicitly specified filters. Q: Where do we go from here? Longer term, we should stop doing variable expansions in script blocks, and instead pass data to JS via e.g. data attributes, or asynchronously using AJAX calls. Once we've done that, we can remove inline JavaScript altogether in favor of separate script files, and set a strict Content Security Policy explicitly blocking inline scripting, and thus also the most common kind of cross-site scripting attack.
author Søren Løvborg <sorenl@unity3d.com>
date Tue, 28 Feb 2017 17:19:00 +0100
parents 9358211ee144
children 42718729687e
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
848
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
1 syntax: glob
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
2 *.pyc
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
3 *.swp
1971
3dd89d30cd28 ignore sqlite files
Marcin Kuzminski <marcin@python-works.com>
parents: 1893
diff changeset
4 *.sqlite
2555
b5c57e2176dc ignore tox stuff
Marcin Kuzminski <marcin@python-works.com>
parents: 2379
diff changeset
5 *.tox
1236
a28bd9cb6549 added egg-info,and .egg in hgignore
Marcin Kuzminski <marcin@python-works.com>
parents: 848
diff changeset
6 *.egg-info
a28bd9cb6549 added egg-info,and .egg in hgignore
Marcin Kuzminski <marcin@python-works.com>
parents: 848
diff changeset
7 *.egg
4986
7d6c4bd58abd i18n: don't include .mo files in the repo - build them on demand
Mads Kiilerich <madski@unity3d.com>
parents: 4902
diff changeset
8 *.mo
4997
9885bbacf99c .hgignore: Add .eggs/ and tarballcache/
Marc Abramowitz <marc@marc-abramowitz.com>
parents: 4986
diff changeset
9 .eggs/
9885bbacf99c .hgignore: Add .eggs/ and tarballcache/
Marc Abramowitz <marc@marc-abramowitz.com>
parents: 4986
diff changeset
10 tarballcache/
0
564e40829f80 initial commit.
Marcin Kuzminski
parents:
diff changeset
11
564e40829f80 initial commit.
Marcin Kuzminski
parents:
diff changeset
12 syntax: regexp
2106
95f1ed68cac1 ignore rcextensions
Marcin Kuzminski <marcin@python-works.com>
parents: 2007
diff changeset
13 ^rcextensions
848
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
14 ^build
4096
9496c047ea4d added dist dir to .hgignore
Marcin Kuzminski <marcin@python-works.com>
parents: 3960
diff changeset
15 ^dist/
848
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
16 ^docs/build/
a555d8345105 Small hgignore update
Lukasz Balcerzak <lukasz.balcerzak@python-center.org>
parents: 554
diff changeset
17 ^docs/_build/
9
058f63b6c2ff added hgignore
Marcin Kuzminski
parents: 0
diff changeset
18 ^data$
3908
b619d9eef67a ignore sql_dumps for testing schema migrations
Marcin Kuzminski <marcin@python-works.com>
parents: 3831
diff changeset
19 ^sql_dumps/
9
058f63b6c2ff added hgignore
Marcin Kuzminski
parents: 0
diff changeset
20 ^\.settings$
058f63b6c2ff added hgignore
Marcin Kuzminski
parents: 0
diff changeset
21 ^\.project$
72
596eb21f61d5 ignore db file
Marcin Kuzminski <marcin@python-blog.com>
parents: 9
diff changeset
22 ^\.pydevproject$
1893
27c8836e6356 ignore coverage and my local dev.ini file
Marcin Kuzminski <marcin@python-works.com>
parents: 1833
diff changeset
23 ^\.coverage$
4210
277684f23146 Rename some strings in defaults found in config files
Bradley M. Kuhn <bkuhn@sfconservancy.org>
parents: 4181
diff changeset
24 ^kallithea\.db$
460
92cacbcb5272 ignore test db
Marcin Kuzminski <marcin@python-works.com>
parents: 327
diff changeset
25 ^test\.db$
4212
24c0d584ba86 General renaming to Kallithea
Bradley M. Kuhn <bkuhn@sfconservancy.org>
parents: 4210
diff changeset
26 ^Kallithea\.egg-info$
4902
03bbd33bc084 docs: rework stuff
Mads Kiilerich <madski@unity3d.com>
parents: 4212
diff changeset
27 ^my\.ini$
2007
324ac367a4da Added VCS into rhodecode core for faster and easier deployments of new versions
Marcin Kuzminski <marcin@python-works.com>
parents: 1971
diff changeset
28 ^fabfile.py
3831
bd39c1f70e35 Updated hgignore to skip .idea config files
Marcin Kuzminski <marcin@python-works.com>
parents: 3051
diff changeset
29 ^\.idea$
5516
bfa66e8887d7 hgignore: ignore `.cache/`, created by pytest 2.8 when writing `.cache/v/cache/lastfailed`
Mads Kiilerich <madski@unity3d.com>
parents: 5416
diff changeset
30 ^\.cache$
6038
9358211ee144 tests: move test data to top level 'data' folder - make grepping in kallithea/ easier
Mads Kiilerich <madski@unity3d.com>
parents: 5516
diff changeset
31 /__pycache__$