1. 36c7178 Migrate base::Value::GetList() to base::Value::GetListDeprecated(): 3/N. by Daniel Cheng · 5 months ago main
  2. b81fa59 Add scheduled GitHub Actions by Wei-Yin Chen (陳威尹) · 10 months ago master
  3. 6092ab0 Add support for GitHub Actions by Wei-Yin Chen (陳威尹) · 10 months ago
  4. 166ccff Remove Travis CI integration by Wei-Yin Chen (陳威尹) · 10 months ago
  5. 9652bdc Update protoc_plugins to match what's in chromium by Andrew Grieve · 1 year, 2 months ago
  6. 3eef8a0 Strip all unsafe HTML attributes using an allowlist. by Dominic Mazzoni · 2 years ago
  7. a1612e0 Update and improve readability of README.md. by Aran Gilman · 2 years, 3 months ago
  8. 56b365d Use goog:loggingPrefs in get_screenshots.py by Wei-Yin Chen (陳威尹) · 2 years, 10 months ago
  9. 4072533 Bump selenium version to 3.141.0 by Wei-Yin Chen (陳威尹) · 3 years ago
  10. 5438a7b Use --enable-distillability-service for native features by Wei-Yin Chen (陳威尹) · 3 years ago
  11. 45fe4a6 Update Travis CI environment from Ubuntu 14.04 to 18.04 by Wei-Yin Chen (陳威尹) · 3 years ago
  12. ba2c5d9 Adapt new version scheme of Chrome Driver by Wei-Yin Chen (陳威尹) · 3 years ago
  13. c38fd83 Use Open JDK 8 by Wei-Yin Chen (陳威尹) · 3 years ago
  14. ccfe233 Fix EmbedExtractorTest.testDivCaption by Wei-Yin Chen (陳威尹) · 4 years ago
  15. 8825eaf In tests, render elements before getting their innerText by Wei-Yin Chen (陳威尹) · 4 years ago
  16. 8c9af2c Make generateOutput(textOnly=true) standard compliant by Wei-Yin Chen (陳威尹) · 4 years ago
  17. 9596033 Skip elements with zero area when finding article element by Wei-Yin Chen (陳威尹) · 4 years, 7 months ago
  18. 38037b3 Update docs after GoB/gerrit migration by Wei-Yin Chen (陳威尹) · 4 years, 9 months ago
  19. 0bde315 Fix false positive of next page detection by Wei-Yin Chen (陳威尹) · 4 years, 9 months ago
  20. f1d9b2d Upload dom-distiller changes to Gerrit by default by Aaron Gable · 4 years, 9 months ago
  21. 4405dfd Fix build status badge image on GitHub by Wei-Yin Chen (陳威尹) · 5 years ago
  22. 8de0cac Handle image lazy loading on Wikipedia by Wei-Yin Chen (陳威尹) · 5 years ago
  23. 9c2021c Ignore the "Edit" links in wiki pages by Wei-Yin Chen (陳威尹) · 5 years ago
  24. cd538d1 Unhide folded sections in wiki pages by Wei-Yin Chen (陳威尹) · 5 years ago
  25. 4d66ebb Make sure images are loaded before running tests by Wei-Yin Chen (陳威尹) · 5 years ago
  26. 2d7b629 Make sure lazily-loaded images have absolute URL by Wei-Yin Chen (陳威尹) · 5 years ago
  27. 489c660 Skip some social sharing elements by Wei-Yin Chen (陳威尹) · 5 years ago
  28. 5da4878 Skip non-text elements in <a> without href in <figcaption> by Wei-Yin Chen (陳威尹) · 5 years ago
  29. 6bd2ebb Fix more partially hidden articles by Wei-Yin Chen (陳威尹) · 5 years ago
  30. 4540f35 Support <picture> in image extraction by Wei-Yin Chen (陳威尹) · 5 years ago
  31. aab1a1b Revert "Replace create_standalone_js.py with a custom GWT linker" by Wei-Yin Chen (陳威尹) · 5 years ago
  32. ac91168 Use stricter comment-detecting heuristics by Wei-Yin Chen (陳威尹) · 5 years ago
  33. 936bafa Skip <svg> elements when scanning the DOM by Wei-Yin Chen (陳威尹) · 6 years ago
  34. 315e460 Skip lead image finder when no text content by Wei-Yin Chen (陳威尹) · 6 years ago
  35. 072fe57 Recognize H4 to H6 as headings as well by Wei-Yin Chen (陳威尹) · 6 years ago
  36. 52047b4 Avoid using getClassName() to avoid issues with <svg> by Wei-Yin Chen (陳威尹) · 6 years ago
  37. 8cf93ce Bump ChromeDriver version to 2.24 by Wei-Yin Chen (陳威尹) · 6 years ago
  38. d876125 Add gen_mhtml_corpus.py to convert MHTML to eval corpus by Wei-Yin Chen (陳威尹) · 6 years ago
  39. 8b33c8b Amend "Fix partially hidden article" by Wei-Yin Chen (陳威尹) · 6 years ago
  40. 3fd2017 Strip unwanted classNames from all nodes by Wei-Yin Chen (陳威尹) · 6 years ago
  41. d16a68c Fix partially hidden article by Wei-Yin Chen (陳威尹) · 6 years ago
  42. 85447bd Ignore exceptions in embed extractors by Wei-Yin Chen (陳威尹) · 6 years ago
  43. 50efabe Skip unrecognized iframes by Wei-Yin Chen (陳威尹) · 6 years ago
  44. 23830cf Use .innerText in text-only mode by Wei-Yin Chen (陳威尹) · 6 years ago
  45. 7306201 Avoid creating container when possible by Wei-Yin Chen (陳威尹) · 6 years ago
  46. 5323f43 Forbid usage of GWT's Element.getInnerText() by Wei-Yin Chen (陳威尹) · 6 years ago
  47. b21e6a3 Remove //base dependency by Wei-Yin Chen (陳威尹) · 6 years ago
  48. 91f9f01 Fix figcaption generation by Wei-Yin Chen (陳威尹) · 6 years ago
  49. 365c44e Add support for figure element by marcelorcorrea · 6 years ago
  50. f8f3308 Update distillability modeling scripts to predict long articles by Wei-Yin Chen (陳威尹) · 6 years ago
  51. 8a12e18 Decrease mismatches in feature extraction by Wei-Yin Chen (陳威尹) · 6 years ago
  52. 4d7ab13 Extract image URLs in WebTables by Wei-Yin Chen (陳威尹) · 6 years ago
  53. 8d8063a Extract image URLs in srcset as well by Wei-Yin Chen (陳威尹) · 6 years ago
  54. 34c4a18 Re-enable tests containing <track> in CI by Wei-Yin Chen (陳威尹) · 6 years ago
  55. 0d4286b The display style of WebText root element should never be inline by Wei-Yin Chen (陳威尹) · 6 years ago
  56. 6c16f14 Warm up image cache for data URI used in tests by Wei-Yin Chen (陳威尹) · 6 years ago
  57. 301ea1e Fix malformed test filter by Wei-Yin Chen (陳威尹) · 6 years ago
  58. 9b606cb Remove files for drone.io by Wei-Yin Chen (陳威尹) · 6 years ago
  59. 7e3b63e Fix schema.org article matching by Wei-Yin Chen (陳威尹) · 6 years ago
  60. 54d05ba Discard hidden articles when using fast path by marcelorcorrea · 6 years ago
  61. f6d2dc1 Support extraction of lazily-loaded images by Wei-Yin Chen (陳威尹) · 6 years ago
  62. 11fdddc Strip "target" attribute from anchor elements by Wei-Yin Chen (陳威尹) · 6 years ago
  63. 65c0b6d Fix for LeadImage getting images after last relevant content by vinicius.suzuki · 6 years ago
  64. 0455a46 Support deprecated <object> API of Youtube embeds by Wei-Yin Chen (陳威尹) · 6 years ago
  65. 8637690 Fix some warnings in Eclipse by Wei-Yin Chen (陳威尹) · 6 years ago
  66. 0adf24a <track> should not have end tags by Wei-Yin Chen (陳威尹) · 6 years ago
  67. 2b5538e Remove scoped_ptr use from json_values_converter.py. by Daniel Cheng · 6 years ago
  68. c3edb38 Add travis-ci support by Wei-Yin Chen (陳威尹) · 6 years ago
  69. adbdbec Fix for Schema.org title extraction. by mlongaray · 6 years ago
  70. f40e521 Lazy evaluation of MarkupParsers by marcelorcorrea · 6 years ago
  71. 84cfcd2 Retain image sizes by dalmirsilva · 6 years ago
  72. 3b3253f Make long strings easier to compare in assertion failure by Wei-Yin Chen (陳威尹) · 6 years ago
  73. fc19f02 Clean attributes of mRoot between tests properly by Wei-Yin Chen (陳威尹) · 6 years ago
  74. ccefff9 Handle drop cap letters by Wei-Yin Chen (陳威尹) · 6 years ago
  75. 9101ca4 Fix spelling in comments by Wei-Yin Chen (陳威尹) · 6 years ago
  76. cfb7588 Update README.md by Wei-Yin Chen (陳威尹) · 6 years ago
  77. 4d0809f Reformat README.md to Google style by Wei-Yin Chen (陳威尹) · 6 years ago
  78. 72998ef Add scripts for distillability modeling by Wei-Yin Chen (陳威尹) · 6 years ago
  79. 7310b0f Process the root element in various DomUtil functions by Wei-Yin Chen (陳威尹) · 6 years ago
  80. 349b932 Remove DOM Distiller dependency on scoped_ptr<T>::Pass(). by dcheng · 7 years ago
  81. 8fa737b Add <pre> to nesting tags by Wei-Yin Chen (陳威尹) · 7 years ago
  82. 7991208 Clean up attributes of image elements by Wei-Yin Chen (陳威尹) · 7 years ago
  83. 6e47a67 dom_distiller: add option to select pagination algorithm by Kuan Tan · 7 years ago
  84. 5eaf86d implement fixes for new multi-page dataset by Kuan Tan · 7 years ago
  85. e55d6a4 Fix iOS crash related to window.performance by gnachman · 7 years ago
  86. 36a509c implement parser for new pagination algorithm by Kuan Tan · 7 years ago
  87. 6c7b59e Amend "Strip style attributes from all tags" by Wei-Yin Chen (陳威尹) · 7 years ago
  88. 21145c3 Add blockquote tag to the nestingTags in WebTag class. by marcelorcorrea · 7 years ago
  89. a4e94dd Fix for keeping lists structure by marcelorcorrea · 7 years ago
  90. 152a23e Make URLs in srcset absolute by Wei-Yin Chen (陳威尹) · 7 years ago
  91. 85fab03 Forbid "next article" in Chinese in pagination detection by Wei-Yin Chen (陳威尹) · 7 years ago
  92. b83e819 Refactor attribute stripping by Wei-Yin Chen (陳威尹) · 7 years ago
  93. bd686e8 Strip style attributes from all tags by Wei-Yin Chen (陳威尹) · 7 years ago
  94. b270b64 Update document about verbose logging by Wei-Yin Chen (陳威尹) · 7 years ago
  95. 78b3d20 Forbid "next post" links in pagination detection by Wei-Yin Chen (陳威尹) · 7 years ago
  96. 9c50bfe implement entry point for detection of pagination URLs by Kuan Tan · 7 years ago
  97. 6300dcf Leverage semantic elements to find the main article by Wei-Yin Chen (陳威尹) · 7 years ago
  98. 61e7226 Add Vagrant support by Wei-Yin Chen (陳威尹) · 7 years ago
  99. bf64df9 Make TableClassifierTest independent of browser window dimention by Wei-Yin Chen (陳威尹) · 7 years ago
  100. 7a10992 Fix word count issue for Chinese and Japanese by Wei-Yin Chen (陳威尹) · 7 years ago