scrapeninja

aigeon-ai/scrapeninja

3.8

scrapeninja is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.

If you are the rightful owner of scrapeninja and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.

ScrapeNinja is a high-performance web scraping MCP server designed to tackle common challenges encountered by developers when scraping various websites.

Try scrapeninja with chat:

MCPHub score:3.78

Has a README

Github repo has a README.md.

Has a License

Github repo doesn't have a valid license.

Server can be inspected

View server inspector

Server schema can be extracted

Can get at lease one tool info from the README or server.

Online hosted on MCPHub

Can be automatically deployed by MCPHub.

Has social accounts

Do not have any social accounts.

Claimed by the author or certified by MCPHub

If you are the author, claim authorship

AI Evaluation Report
Total Score: 2/10

The agent consistently failed to perform the web scraping tasks due to a malfunction in handling URL inputs. Despite the agent's capabilities to use a real Chrome browser engine for complex scraping tasks, it was unable to process the URLs correctly, leading to repeated failures in extracting content from various websites. This indicates a significant limitation in the tool's current implementation, as it cannot fulfill its primary function of web scraping. The agent's strength lies in its potential capabilities, but these were not demonstrated in the tests due to technical issues.

  • Test case 1
    Score: 2/10
    Perform the operation of extracting the main content from the article at https://en.wikinews.org/wiki/Malcolm-Jamal_Warner,_%27Cosby_Show%27_actor,_drowns_in_Costa_Rica using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.

    No valid answer is generated due to tool malfunction. The response indicates that the tool requires a specific URL input, and there was an issue with correctly providing that URL in the tool's call. Despite attempts to correct this, the tool continued to indicate that the 'URL is required,' suggesting a malfunction in the tool's ability to process the input correctly.

  • Test case 2
    Score: 2/10
    Perform the operation of extracting the latest technology news headlines from CNN's Tech section at https://www.cnn.com/business/tech/ using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted headlines.

    No valid answer is generated due to tool malfunction. The response indicates repeated issues with the URL format in the scraping tools, suggesting that the tool is not processing the URL correctly or there is an unexpected argument format causing the failure.

  • Test case 3
    Score: 6/10
    Perform the operation of extracting the main content from the article at https://www.newsbreak.com/cbs-news-510078/4149138727230-virginia-giuffre-s-family-shocked-by-trump-saying-epstein-stole-her using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.

    No valid answer is generated due to invalid input. The tool encountered an error because the URL was required for the scraping operation. Without the ability to specify the URL properly, the content from the article could not be extracted.

  • Test case 4
    Score: 2/10
    Perform the operation of extracting the main content from the Wikipedia Main Page at https://en.wikipedia.org/wiki/Main_Page using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.

    No valid answer is generated due to tool malfunction. The response indicates that there was a persistent issue with the tool not accepting the URL, which caused the scraping attempt to fail repeatedly. The error suggests that the required URL was not being provided correctly, leading to a technical limitation in retrieving the content from the Wikipedia Main Page.

  • Test case 5
    Score: 2/10
    Perform the operation of extracting the main content from the article at https://www.cnn.com/business/live-news/trade-deadline-tariffs-trump-deals using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.

    No valid answer is generated due to tool malfunction. The tool is malfunctioning due to a persistent issue with not accepting the URL parameter, preventing the extraction of content from the specified article.