html-conversion-procs.tcl

  • last updated 2 hours ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Use "ns_parsehtml" for speed and correctness when available

The new NaviServer built-in command "ns_parsehtml" is used on

several occasions in the code when available, namely in

- util_close_html_tags

- escaping disallowed HTML tags in ad_text_to_html

- util_close_html_tags

The command "util_close_html_tags" has a long history of troubles,

since it was implemented originally based on regular expressions. It

was reported in Jan 2009, that the function took on certain forums

entries 6 to 9 hours (e.g. /forums/message-view?message_id=357753) to

finish. Therefore, at this time a quick version based on tdom was

introduced, which reduced at that time the performance to 7.3

seconds. However, the tdom-based version tend to fail on invalid HTML,

so it could cover only a few cases. The new "ns_parsehtml" is more

robust and works very stable on openacs.org with the know problem

cases. Furthermore, this version is also significantly faster than the

tdom version.

  1. … 1 more file in changeset.
improve spelling

  1. … 6 more files in changeset.
Deprecate ad_parameter_all_values_as_list: it does not provide much value and was also often use improperly

  1. … 4 more files in changeset.
Fixed bug in util_convert_line_breaks_to_html

The code deleted spaces around certain tags, while - according to the

documentation, only line breaks should be removed from there.

Extended regression test.

  1. … 1 more file in changeset.
Fix quote_unquote_html test

Make ad_html_security_check configurable

ad_html_security_check has now three optional attributes

to make it configurable for different situations

-allowed_tags

-allowed_attributes

-allowed_protocols

If these attributes are not specified, the behavior is exactly like

before. This change makes it also easier to regression test this

function, since the behavior does not necessarily depend on a site's

parameter settings.

The function was also modernized, new regression tests were added.

  1. … 1 more file in changeset.
improve listing of test coverage

  1. … 17 more files in changeset.
improve spelling

  1. … 16 more files in changeset.
improve comments and spelling

  1. … 2 more files in changeset.
Rename string_truncate and string_truncate_middle to comply with OpenACS naming convention, create deprecated wrappers, replace occurrences

As string_truncate_middle was defined in xotcl-request-monitor, bring it to acs-tcl instead

  1. … 11 more files in changeset.
provide a better name for link in test case

add "blob" to the checked attributes

  1. … 1 more file in changeset.
add checking for javascript and data protocols (fixes issue #3413)

  1. … 3 more files in changeset.
improve spelling

  1. … 14 more files in changeset.
whitespace cleanup

  1. … 11 more files in changeset.
improving ad_html_to_text:

- handle src specs with "data:" (skip data)

- handle anchors with empty hrefs and internal links by not adding these to the text rendering

- add test cases to regression test

  1. … 1 more file in changeset.
Overhaul of test cases:

- make tested API calls explict

- reduce dependency on twt

  1. … 18 more files in changeset.
prefer "aa_equals" over "aa_true"+"string equal"

  1. … 5 more files in changeset.
Whitespace cleanup

Add 'Markdown' tests for 'ad_html_text_convert'

Add trivial tests for the other different kinds of space

- provide test case for html-to-text conversion

- provide a version of html-to-text based on ns_reflow_text

  1. … 2 more files in changeset.
Bug fix: avoid confusion between text argument and option, when text starts with "-"

  1. … 36 more files in changeset.
merged changes from the oacs-5-9 branch and resolved conflicts

  1. … 7834 more files in changeset.
Fix test cases for text conversion

  1. … 1 more file in changeset.
- increase robsutness of apm_package_installed_p_not_cached

- make ad_unquote capable to unquote ns_qutehtml

- fix test cases

  1. … 4 more files in changeset.
- prefer ns_quotehtml over ad_quotehtml

  1. … 36 more files in changeset.
- add editor hints to keep spaces/tabs in the furture more consistent

  1. … 754 more files in changeset.
Non-info file merge results, with conflicts fixed

  1. … 726 more files in changeset.
Fixed tests to match the tDOM-based tag closing code.