diff --git a/testdata/AUTHORS.rst b/testdata/AUTHORS.rst new file mode 100644 index 0000000..4a7de17 --- /dev/null +++ b/testdata/AUTHORS.rst @@ -0,0 +1,34 @@ +Credits +======= + +The ``html5lib`` test data is maintained by: + +- James Graham +- Geoffrey Sneddon + + +Contributors +------------ + +- Adam Barth +- Andi Sidwell +- Anne van Kesteren +- David Flanagan +- Edward Z. Yang +- Geoffrey Sneddon +- Henri Sivonen +- Ian Hickson +- Jacques Distler +- James Graham +- Lachlan Hunt +- lantis63 +- Mark Pilgrim +- Mats Palmgren +- Ms2ger +- Nolan Waite +- Philip Taylor +- Rafael Weinstein +- Ryan King +- Sam Ruby +- Simon Pieters +- Thomas Broyer diff --git a/testdata/LICENSE b/testdata/LICENSE new file mode 100644 index 0000000..8812371 --- /dev/null +++ b/testdata/LICENSE @@ -0,0 +1,21 @@ +Copyright (c) 2006-2013 James Graham, Geoffrey Sneddon, and +other contributors + +Permission is hereby granted, free of charge, to any person obtaining +a copy of this software and associated documentation files (the +"Software"), to deal in the Software without restriction, including +without limitation the rights to use, copy, modify, merge, publish, +distribute, sublicense, and/or sell copies of the Software, and to +permit persons to whom the Software is furnished to do so, subject to +the following conditions: + +The above copyright notice and this permission notice shall be +included in all copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, +EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF +MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND +NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE +LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION +OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION +WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. diff --git a/testdata/encoding/chardet/test_big5.txt b/testdata/encoding/chardet/test_big5.txt new file mode 100644 index 0000000..91074c9 --- /dev/null +++ b/testdata/encoding/chardet/test_big5.txt @@ -0,0 +1,51 @@ +老子《道德經》 第一~四十章 + +老子道經 + +第一章 + +道可道,非常道。名可名,非常名。無,名天地之始﹔有,名萬物之母。 +故常無,欲以觀其妙;常有,欲以觀其徼。此兩者,同出而異名,同謂之 +玄。玄之又玄,眾妙之門。 + +第二章 + +天下皆知美之為美,斯惡矣﹔皆知善之為善,斯不善矣。故有無相生,難 +易相成,長短相形,高下相傾,音聲相和,前後相隨。是以聖人處「無為 +」之事,行「不言」之教。萬物作焉而不辭,生而不有,為而不恃,功成 +而弗居。夫唯弗居,是以不去。 + +第三章 + +不尚賢,使民不爭﹔不貴難得之貨,使民不為盜﹔不見可欲,使民心不亂 +。是以「聖人」之治,虛其心,實其腹,弱其志,強其骨。常使民無知無 +欲。使夫智者不敢為也。為「無為」,則無不治。 + +第四章 + +「道」沖,而用之或不盈。淵兮,似萬物之宗﹔挫其銳,解其紛,和其光 +,同其塵﹔湛兮似或存。吾不知誰之子?象帝之先。 + +第五章 + +天地不仁,以萬物為芻狗﹔聖人不仁,以百姓為芻狗。天地之間,其猶橐 +蘥乎?虛而不屈,動而愈出。多言數窮,不如守中。 + +第六章 + +谷神不死,是謂玄牝。玄牝之門,是謂天地根。綿綿若存,用之不勤。 + +第七章 + +天長地久。天地所以能長且久者,以其不自生,故能長久。是以聖人後其 +身而身先,外其身而身存。非以其無私邪?故能成其私。 + +第八章 + +上善若水。水善利萬物而不爭。處眾人之所惡,故幾於道。居善地,心善 +淵,與善仁,言善信,政善治,事善能,動善時。夫唯不爭,故無尤。 + +第九章 + +持而盈之,不如其已﹔揣而銳之,不可長保。金玉滿堂,莫之能守﹔富貴 +而驕,自遺其咎。功遂身退,天之道。 diff --git a/testdata/encoding/scripted/tests1.dat b/testdata/encoding/scripted/tests1.dat new file mode 100644 index 0000000..04d18bb --- /dev/null +++ b/testdata/encoding/scripted/tests1.dat @@ -0,0 +1,5 @@ +#data + + +#encoding +iso-8859-2 diff --git a/testdata/encoding/test-yahoo-jp.dat b/testdata/encoding/test-yahoo-jp.dat new file mode 100644 index 0000000..3629278 --- /dev/null +++ b/testdata/encoding/test-yahoo-jp.dat @@ -0,0 +1,10 @@ +#data + +
+ + +One
Two + #errors + 3: Missing document type declaration + #document + | + |
+ | + |+ | "One" + |
+ | "Two" diff --git a/testdata/tree-construction/adoption01.dat b/testdata/tree-construction/adoption01.dat new file mode 100644 index 0000000..38f98ef --- /dev/null +++ b/testdata/tree-construction/adoption01.dat @@ -0,0 +1,354 @@ +#data +
+#errors +(1,3): expected-doctype-but-got-start-tag +(1,10): adoption-agency-1.3 +#document +| +| +| +| +|
+|
+
+#data
+1 2
+| +| "2" +| "3" + +#data +13 +#errors +(1,3): expected-doctype-but-got-start-tag +(1,17): adoption-agency-1.3 +#document +| +|
+| +| +| "1" +| +#errors +(1,3): expected-doctype-but-got-start-tag +(1,12): adoption-agency-1.3 +#document +| +| +| +| +| "1" +| +| "2" +| +| "3" + +#data +1