1. 2a18039 Update copyright notice to new format by Matt Jones · 2 years, 7 months ago main
  2. 36c7178 Migrate base::Value::GetList() to base::Value::GetListDeprecated(): 3/N. by Daniel Cheng · 3 years, 3 months ago
  3. b81fa59 Add scheduled GitHub Actions by Wei-Yin Chen (陳威尹) · 3 years, 8 months ago
  4. 6092ab0 Add support for GitHub Actions by Wei-Yin Chen (陳威尹) · 3 years, 9 months ago
  5. 166ccff Remove Travis CI integration by Wei-Yin Chen (陳威尹) · 3 years, 9 months ago
  6. 9652bdc Update protoc_plugins to match what's in chromium by Andrew Grieve · 4 years ago
  7. 3eef8a0 Strip all unsafe HTML attributes using an allowlist. by Dominic Mazzoni · 4 years, 11 months ago
  8. a1612e0 Update and improve readability of README.md. by Aran Gilman · 5 years ago
  9. 56b365d Use goog:loggingPrefs in get_screenshots.py by Wei-Yin Chen (陳威尹) · 6 years ago
  10. 4072533 Bump selenium version to 3.141.0 by Wei-Yin Chen (陳威尹) · 6 years ago
  11. 5438a7b Use --enable-distillability-service for native features by Wei-Yin Chen (陳威尹) · 6 years ago
  12. 45fe4a6 Update Travis CI environment from Ubuntu 14.04 to 18.04 by Wei-Yin Chen (陳威尹) · 6 years ago
  13. ba2c5d9 Adapt new version scheme of Chrome Driver by Wei-Yin Chen (陳威尹) · 6 years ago
  14. c38fd83 Use Open JDK 8 by Wei-Yin Chen (陳威尹) · 6 years ago
  15. ccfe233 Fix EmbedExtractorTest.testDivCaption by Wei-Yin Chen (陳威尹) · 7 years ago
  16. 8825eaf In tests, render elements before getting their innerText by Wei-Yin Chen (陳威尹) · 7 years ago
  17. 8c9af2c Make generateOutput(textOnly=true) standard compliant by Wei-Yin Chen (陳威尹) · 7 years ago
  18. 9596033 Skip elements with zero area when finding article element by Wei-Yin Chen (陳威尹) · 7 years ago
  19. 38037b3 Update docs after GoB/gerrit migration by Wei-Yin Chen (陳威尹) · 8 years ago
  20. 0bde315 Fix false positive of next page detection by Wei-Yin Chen (陳威尹) · 8 years ago
  21. f1d9b2d Upload dom-distiller changes to Gerrit by default by Aaron Gable · 8 years ago
  22. 4405dfd Fix build status badge image on GitHub by Wei-Yin Chen (陳威尹) · 8 years ago
  23. 8de0cac Handle image lazy loading on Wikipedia by Wei-Yin Chen (陳威尹) · 8 years ago
  24. 9c2021c Ignore the "Edit" links in wiki pages by Wei-Yin Chen (陳威尹) · 8 years ago
  25. cd538d1 Unhide folded sections in wiki pages by Wei-Yin Chen (陳威尹) · 8 years ago
  26. 4d66ebb Make sure images are loaded before running tests by Wei-Yin Chen (陳威尹) · 8 years ago
  27. 2d7b629 Make sure lazily-loaded images have absolute URL by Wei-Yin Chen (陳威尹) · 8 years ago
  28. 489c660 Skip some social sharing elements by Wei-Yin Chen (陳威尹) · 8 years ago
  29. 5da4878 Skip non-text elements in <a> without href in <figcaption> by Wei-Yin Chen (陳威尹) · 8 years ago
  30. 6bd2ebb Fix more partially hidden articles by Wei-Yin Chen (陳威尹) · 8 years ago
  31. 4540f35 Support <picture> in image extraction by Wei-Yin Chen (陳威尹) · 8 years ago
  32. aab1a1b Revert "Replace create_standalone_js.py with a custom GWT linker" by Wei-Yin Chen (陳威尹) · 8 years ago
  33. ac91168 Use stricter comment-detecting heuristics by Wei-Yin Chen (陳威尹) · 8 years ago
  34. 936bafa Skip <svg> elements when scanning the DOM by Wei-Yin Chen (陳威尹) · 8 years ago
  35. 315e460 Skip lead image finder when no text content by Wei-Yin Chen (陳威尹) · 8 years ago
  36. 072fe57 Recognize H4 to H6 as headings as well by Wei-Yin Chen (陳威尹) · 9 years ago
  37. 52047b4 Avoid using getClassName() to avoid issues with <svg> by Wei-Yin Chen (陳威尹) · 9 years ago
  38. 8cf93ce Bump ChromeDriver version to 2.24 by Wei-Yin Chen (陳威尹) · 9 years ago
  39. d876125 Add gen_mhtml_corpus.py to convert MHTML to eval corpus by Wei-Yin Chen (陳威尹) · 9 years ago
  40. 8b33c8b Amend "Fix partially hidden article" by Wei-Yin Chen (陳威尹) · 9 years ago
  41. 3fd2017 Strip unwanted classNames from all nodes by Wei-Yin Chen (陳威尹) · 9 years ago
  42. d16a68c Fix partially hidden article by Wei-Yin Chen (陳威尹) · 9 years ago
  43. 85447bd Ignore exceptions in embed extractors by Wei-Yin Chen (陳威尹) · 9 years ago
  44. 50efabe Skip unrecognized iframes by Wei-Yin Chen (陳威尹) · 9 years ago
  45. 23830cf Use .innerText in text-only mode by Wei-Yin Chen (陳威尹) · 9 years ago
  46. 7306201 Avoid creating container when possible by Wei-Yin Chen (陳威尹) · 9 years ago
  47. 5323f43 Forbid usage of GWT's Element.getInnerText() by Wei-Yin Chen (陳威尹) · 9 years ago
  48. b21e6a3 Remove //base dependency by Wei-Yin Chen (陳威尹) · 9 years ago
  49. 91f9f01 Fix figcaption generation by Wei-Yin Chen (陳威尹) · 9 years ago
  50. 365c44e Add support for figure element by marcelorcorrea · 9 years ago
  51. f8f3308 Update distillability modeling scripts to predict long articles by Wei-Yin Chen (陳威尹) · 9 years ago
  52. 8a12e18 Decrease mismatches in feature extraction by Wei-Yin Chen (陳威尹) · 9 years ago
  53. 4d7ab13 Extract image URLs in WebTables by Wei-Yin Chen (陳威尹) · 9 years ago
  54. 8d8063a Extract image URLs in srcset as well by Wei-Yin Chen (陳威尹) · 9 years ago
  55. 34c4a18 Re-enable tests containing <track> in CI by Wei-Yin Chen (陳威尹) · 9 years ago
  56. 0d4286b The display style of WebText root element should never be inline by Wei-Yin Chen (陳威尹) · 9 years ago
  57. 6c16f14 Warm up image cache for data URI used in tests by Wei-Yin Chen (陳威尹) · 9 years ago
  58. 301ea1e Fix malformed test filter by Wei-Yin Chen (陳威尹) · 9 years ago
  59. 9b606cb Remove files for drone.io by Wei-Yin Chen (陳威尹) · 9 years ago
  60. 7e3b63e Fix schema.org article matching by Wei-Yin Chen (陳威尹) · 9 years ago
  61. 54d05ba Discard hidden articles when using fast path by marcelorcorrea · 9 years ago
  62. f6d2dc1 Support extraction of lazily-loaded images by Wei-Yin Chen (陳威尹) · 9 years ago
  63. 11fdddc Strip "target" attribute from anchor elements by Wei-Yin Chen (陳威尹) · 9 years ago
  64. 65c0b6d Fix for LeadImage getting images after last relevant content by vinicius.suzuki · 9 years ago
  65. 0455a46 Support deprecated <object> API of Youtube embeds by Wei-Yin Chen (陳威尹) · 9 years ago
  66. 8637690 Fix some warnings in Eclipse by Wei-Yin Chen (陳威尹) · 9 years ago
  67. 0adf24a <track> should not have end tags by Wei-Yin Chen (陳威尹) · 9 years ago
  68. 2b5538e Remove scoped_ptr use from json_values_converter.py. by Daniel Cheng · 9 years ago
  69. c3edb38 Add travis-ci support by Wei-Yin Chen (陳威尹) · 9 years ago
  70. adbdbec Fix for Schema.org title extraction. by mlongaray · 9 years ago
  71. f40e521 Lazy evaluation of MarkupParsers by marcelorcorrea · 9 years ago
  72. 84cfcd2 Retain image sizes by dalmirsilva · 9 years ago
  73. 3b3253f Make long strings easier to compare in assertion failure by Wei-Yin Chen (陳威尹) · 9 years ago
  74. fc19f02 Clean attributes of mRoot between tests properly by Wei-Yin Chen (陳威尹) · 9 years ago
  75. ccefff9 Handle drop cap letters by Wei-Yin Chen (陳威尹) · 9 years ago
  76. 9101ca4 Fix spelling in comments by Wei-Yin Chen (陳威尹) · 9 years ago
  77. cfb7588 Update README.md by Wei-Yin Chen (陳威尹) · 9 years ago
  78. 4d0809f Reformat README.md to Google style by Wei-Yin Chen (陳威尹) · 9 years ago
  79. 72998ef Add scripts for distillability modeling by Wei-Yin Chen (陳威尹) · 9 years ago
  80. 7310b0f Process the root element in various DomUtil functions by Wei-Yin Chen (陳威尹) · 9 years ago
  81. 349b932 Remove DOM Distiller dependency on scoped_ptr<T>::Pass(). by dcheng · 9 years ago
  82. 8fa737b Add <pre> to nesting tags by Wei-Yin Chen (陳威尹) · 9 years ago
  83. 7991208 Clean up attributes of image elements by Wei-Yin Chen (陳威尹) · 9 years ago
  84. 6e47a67 dom_distiller: add option to select pagination algorithm by Kuan Tan · 9 years ago
  85. 5eaf86d implement fixes for new multi-page dataset by Kuan Tan · 9 years ago
  86. e55d6a4 Fix iOS crash related to window.performance by gnachman · 10 years ago
  87. 36a509c implement parser for new pagination algorithm by Kuan Tan · 10 years ago
  88. 6c7b59e Amend "Strip style attributes from all tags" by Wei-Yin Chen (陳威尹) · 10 years ago
  89. 21145c3 Add blockquote tag to the nestingTags in WebTag class. by marcelorcorrea · 10 years ago
  90. a4e94dd Fix for keeping lists structure by marcelorcorrea · 10 years ago
  91. 152a23e Make URLs in srcset absolute by Wei-Yin Chen (陳威尹) · 10 years ago
  92. 85fab03 Forbid "next article" in Chinese in pagination detection by Wei-Yin Chen (陳威尹) · 10 years ago
  93. b83e819 Refactor attribute stripping by Wei-Yin Chen (陳威尹) · 10 years ago
  94. bd686e8 Strip style attributes from all tags by Wei-Yin Chen (陳威尹) · 10 years ago
  95. b270b64 Update document about verbose logging by Wei-Yin Chen (陳威尹) · 10 years ago
  96. 78b3d20 Forbid "next post" links in pagination detection by Wei-Yin Chen (陳威尹) · 10 years ago
  97. 9c50bfe implement entry point for detection of pagination URLs by Kuan Tan · 10 years ago
  98. 6300dcf Leverage semantic elements to find the main article by Wei-Yin Chen (陳威尹) · 10 years ago
  99. 61e7226 Add Vagrant support by Wei-Yin Chen (陳威尹) · 10 years ago
  100. bf64df9 Make TableClassifierTest independent of browser window dimention by Wei-Yin Chen (陳威尹) · 10 years ago