Skip to content

Conversation

@vimscientist69
Copy link
Contributor

@vimscientist69 vimscientist69 commented Oct 16, 2025

Reason

AutomaWebCore provides a way for scraping the HTML of websites. We need to use AutomaWebCore API to scrape the HTML of an article, parse the HTML to get the text, and use an LLM to format that text into a dictionary. This is needed, because having an article as a dictionary will be used to generate Tweets.

Tech Details List

  1. Implement AricleContentScraperService with proper error handling, metrics and logs
  2. Refactor/clean scraper service code
  3. Write a simple integration test for the service, testing the only public method

Tasks:

  • Implement AricleContentScraperService with proper error handling, metrics and logs
  • Refactor/clean scraper service code
  • Write a simple integration test for the service, testing the only public method

Un-important tasks:

Links:

Testing

Steps:

  1. Run required commands to test all the code modified by this PR
# unit tests
# integration tests
swift test --filter "ArticleContentScraperServiceIntegrationTests"
# procs / testing logic
  1. Further testing can be done by following the notes in TESTING-QA

Output:

Testing QA

@vimscientist69 vimscientist69 self-assigned this Oct 16, 2025
@vimscientist69 vimscientist69 merged commit ec87189 into develop Oct 16, 2025
3 of 4 checks passed
@vimscientist69 vimscientist69 deleted the article-content-scraper branch October 16, 2025 04:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant