Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Odd behaviour with post content #93

Open
himynamesdave opened this issue Dec 6, 2024 · 2 comments
Open

Odd behaviour with post content #93

himynamesdave opened this issue Dec 6, 2024 · 2 comments
Assignees
Labels
bug Something isn't working

Comments

@himynamesdave
Copy link
Member

found running this test

#91 (comment)

using

feeds/bf07ffa3-4099-5fbc-90c7-681f850744d3/

see how all descriptions are the same

{
  "page_size": 50,
  "page_number": 1,
  "page_results_count": 25,
  "total_results_count": 25,
  "posts": [
    {
      "id": "edcfe7db-443a-54ab-9d97-07c0fd7ee50d",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:30Z",
      "title": "aniother new post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/aniother-new-post.html",
      "pubdate": "2024-12-06T14:13:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": [],
      "profile_id": "bcf09ec5-d124-528a-bb21-480114231795"
    },
    {
      "id": "4252fcd6-3449-59d8-b5e7-cb980791ade0",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:31Z",
      "title": "my new post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/my-new-post.html",
      "pubdate": "2024-12-06T14:06:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": [],
      "profile_id": "bcf09ec5-d124-528a-bb21-480114231795"
    },
    {
      "id": "ea1af607-43d9-56dd-8068-c10b74e527c4",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:33Z",
      "title": "second friday lunchtime post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/second-friday-lunchtime-post.html",
      "pubdate": "2024-12-06T12:33:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "a59da190-967f-5498-80ca-0b0934a307d4",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:34Z",
      "title": "a friday lunchtime post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/a-friday-lunchtime-post.html",
      "pubdate": "2024-12-06T12:13:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "363eb1c6-6ddf-5e7e-8bcb-ba18c3f4c39a",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:36Z",
      "title": "weds am post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/weds-am-post.html",
      "pubdate": "2024-12-04T06:12:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "44d51254-c7c0-5643-adb5-6e488b7defd9",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:38Z",
      "title": "monday evening post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/monday-evening-post.html",
      "pubdate": "2024-12-02T19:09:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "ed0fa5f3-cf68-5f65-9950-522211d741c4",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:39Z",
      "title": "testing the update schedule",
      "description": "<html><body><div><div class=\"post\">\n\n<a name=\"6654924059073820798\"></a>\n<h3 class=\"post-title entry-title\">\ntesting the update schedule\n</h3>\n\n\n<p class=\"post-body entry-content float-container\" id=\"post-body-6654924059073820798\">\na new post ready to be indexed\n\nand some more text for the update added later\n</p>\n\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/12/testing-update-schedule.html",
      "pubdate": "2024-12-02T10:41:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "9a188782-b4a8-505e-a77c-504feea52520",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:41Z",
      "title": "add a new post",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/11/add-new-post.html",
      "pubdate": "2024-11-29T06:30:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "bfd9c37d-5fb7-5867-9f5a-bb593fd13591",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:42Z",
      "title": "Testing Obstracts web update",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2024/11/testing-obstracts-web-update.html",
      "pubdate": "2024-11-24T12:04:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "976f7cad-8eba-57f1-9941-9d57f40a436a",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:44Z",
      "title": "Intel iocs",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2022/05/intel-iocs.html",
      "pubdate": "2022-05-05T19:27:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "8a744c79-44cf-5fe3-b48e-ec5e7500cd16",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:45Z",
      "title": "Iran post 2",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2022/04/iran-post-2.html",
      "pubdate": "2022-04-29T16:47:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "1361aa62-2e89-5dd4-a0f9-8bd55456ec1f",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:47Z",
      "title": "Iran intel",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/10/iran-intel.html",
      "pubdate": "2021-10-29T15:50:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "b17fc5f6-28ae-513b-ba20-e8d5e2305ad7",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:49Z",
      "title": "my custom indicators",
      "description": "<html><body><div><div class=\"post\">\n\n<a name=\"8824581425969983791\"></a>\n<h3 class=\"post-title entry-title\">\nmy custom indicators\n</h3>\n\n\n<p class=\"post-body entry-content float-container\" id=\"post-body-8824581425969983791\">\nvulnerability\ncampaign\ncourse-of-action\ninfrastructure\nattack-pattern\nmalware\nnote\nthreat-actor\ntool\nsoftware\nintrusion-set\n</p>\n\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/08/my-custom-indicators.html",
      "pubdate": "2021-08-17T18:39:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "04a2d275-49fd-541a-b33f-45e9cf588ea6",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:50Z",
      "title": "DO YOU SEE THIS?",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/07/do-you-see-this.html",
      "pubdate": "2021-07-08T19:38:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "563f7874-f388-5b72-a4ec-647cce189fc3",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:52Z",
      "title": "Testing delete",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/07/testing-delete.html",
      "pubdate": "2021-07-07T19:42:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "ced4f166-1554-58d4-9a9a-1ce917beb807",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:54Z",
      "title": "1.1.1.1",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/07/1111.html",
      "pubdate": "2021-07-07T18:53:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "9fd81eab-eeb3-5c16-889b-17e7e3f77aa6",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:55Z",
      "title": "Incident test",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/07/incident-test.html",
      "pubdate": "2021-07-07T18:47:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "163a0170-11dc-55de-a2ca-31564d51e347",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:56Z",
      "title": "17 comma",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/17-comma.html",
      "pubdate": "2021-06-20T14:03:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "860a1745-f40f-575c-ac5a-c41a7b3689ef",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:58Z",
      "title": "Sixteen",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/sixteen.html",
      "pubdate": "2021-06-20T14:02:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "5b3a77e4-2282-522d-b40e-5ff9342bfbc7",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:21:59Z",
      "title": "Fifteen",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/fifteen.html",
      "pubdate": "2021-06-20T06:28:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "846ec377-a3c7-5a19-bd8e-4e5f32c48168",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:22:01Z",
      "title": "Fourteen",
      "description": "<html><body><div><div class=\"popular-posts-snippet snippet-container r-snippet-container\">\n<p class=\"snippet-item r-snippetized\">\n7.7.7.7 seven.com United Kingdom  https://www.yesme.com\n</p>\n<a class=\"snippet-fade r-snippet-fade hidden\" href=\"https://throw-away123456.blogspot.com/2021/06/my-seventh-post.html?m=1\"></a>\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/fourteen.html",
      "pubdate": "2021-06-20T06:28:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "8b5824bb-6b69-5e78-8544-a00a57b61edb",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:22:02Z",
      "title": "Lucky Thirteen",
      "description": "<html><body><div><div class=\"post\">\n\n<a name=\"7571581621736067253\"></a>\n<h3 class=\"post-title entry-title\">\nLucky Thirteen\n</h3>\n\n\n<p class=\"post-body entry-content float-container\" id=\"post-body-7571581621736067253\">\nThis blog post was created at 20:14 UK time\n\n13.13.13.13 China CN Shenzen thirteen.com James\n</p>\n\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/lucky-thirteen.html",
      "pubdate": "2021-06-19T17:42:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "e1b31485-c2f0-5ce3-b8dd-999bd78c70c2",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:22:04Z",
      "title": "Eleven before 12",
      "description": "<html><body><div><div class=\"post\">\n\n<a name=\"4807343893447323047\"></a>\n<h3 class=\"post-title entry-title\">\nEleven before 12\n</h3>\n\n\n<p class=\"post-body entry-content float-container\" id=\"post-body-4807343893447323047\">\nThis blog post was created at 21:05 UK time\n\n11.11.11.11\n\nGermany\n\neleven.com\n</p>\n\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/eleven-before-12.html",
      "pubdate": "2021-06-07T20:05:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": []
    },
    {
      "id": "24b629b5-6313-55c4-b603-b1a1f21ba380",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:22:05Z",
      "title": "Twelve",
      "description": "<html><body><div><div class=\"post\">\n\n<a name=\"7145658940051214268\"></a>\n<h3 class=\"post-title entry-title\">\nTwelve\n</h3>\n\n\n<p class=\"post-body entry-content float-container\" id=\"post-body-7145658940051214268\">\nThis blog post was created at 20:14 UK time\n\n12.12.12.12\n\nFrance\n\ntwelve.com\n</p>\n\n</div>\n</div></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/twelve.html",
      "pubdate": "2021-06-07T19:15:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": [
        "label"
      ],
      "profile_id": "bcf09ec5-d124-528a-bb21-480114231795"
    },
    {
      "id": "8b5f1f46-84ca-5869-8fcf-2082907406a7",
      "datetime_added": "2024-12-06T14:21:28Z",
      "datetime_updated": "2024-12-06T14:22:07Z",
      "title": "Number 10",
      "description": "<html><body></body></html>",
      "link": "https://throw-away123456.blogspot.com/2021/06/number-10.html",
      "pubdate": "2021-06-07T17:52:00Z",
      "author": "Throwawy",
      "is_full_text": true,
      "content_type": "text/html; charset=utf-8",
      "added_manually": false,
      "categories": [
        "elevencom"
      ]
    }
  ]
}
@himynamesdave himynamesdave added the bug Something isn't working label Dec 6, 2024
@github-project-automation github-project-automation bot moved this to Todo in Roadmap Dec 6, 2024
@himynamesdave himynamesdave changed the title Odd behaviour with update posts Odd behaviour with post content Dec 6, 2024
@fqrious
Copy link
Contributor

fqrious commented Dec 6, 2024

well, they are not at all the same.

Image

There's also some very clear differences at the end of the list

Image

@fqrious
Copy link
Contributor

fqrious commented Dec 6, 2024

more investigation points to readability-lxml choosing the wrong article.

Image

It appears to have skipped the main article and used the second

on the page

Image

@fqrious fqrious moved this from Todo to Need Info in Roadmap Dec 6, 2024
@fqrious fqrious moved this from Need Info to Discovery in Roadmap Dec 12, 2024
@himynamesdave himynamesdave self-assigned this Dec 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
Status: Discovery
Development

No branches or pull requests

2 participants