aigeon-ai/scrapeninja
scrapeninja is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.
If you are the rightful owner of scrapeninja and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
ScrapeNinja is a high-performance web scraping MCP server designed to tackle common challenges encountered by developers when scraping various websites.
Try scrapeninja with chat:
Has a README
Github repo has a README.md.
Has a License
Github repo doesn't have a valid license.
Server can be inspected
View server inspector
Server schema can be extracted
Can get at lease one tool info from the README or server.
Online hosted on MCPHub
Can be automatically deployed by MCPHub.
Has social accounts
Do not have any social accounts.
Claimed by the author or certified by MCPHub
If you are the author, claim authorship
AI Evaluation ReportTotal Score: 2/10
The agent consistently failed to perform the web scraping tasks due to a malfunction in handling URL inputs. Despite the agent's capabilities to use a real Chrome browser engine for complex scraping tasks, it was unable to process the URLs correctly, leading to repeated failures in extracting content from various websites. This indicates a significant limitation in the tool's current implementation, as it cannot fulfill its primary function of web scraping. The agent's strength lies in its potential capabilities, but these were not demonstrated in the tests due to technical issues.
Test case 1
Score: 2/10Perform the operation of extracting the main content from the article at https://en.wikinews.org/wiki/Malcolm-Jamal_Warner,_%27Cosby_Show%27_actor,_drowns_in_Costa_Rica using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.No valid answer is generated due to tool malfunction. The response indicates that the tool requires a specific URL input, and there was an issue with correctly providing that URL in the tool's call. Despite attempts to correct this, the tool continued to indicate that the 'URL is required,' suggesting a malfunction in the tool's ability to process the input correctly.
Test case 2
Score: 2/10Perform the operation of extracting the latest technology news headlines from CNN's Tech section at https://www.cnn.com/business/tech/ using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted headlines.No valid answer is generated due to tool malfunction. The response indicates repeated issues with the URL format in the scraping tools, suggesting that the tool is not processing the URL correctly or there is an unexpected argument format causing the failure.
Test case 3
Score: 6/10Perform the operation of extracting the main content from the article at https://www.newsbreak.com/cbs-news-510078/4149138727230-virginia-giuffre-s-family-shocked-by-trump-saying-epstein-stole-her using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.No valid answer is generated due to invalid input. The tool encountered an error because the URL was required for the scraping operation. Without the ability to specify the URL properly, the content from the article could not be extracted.
Test case 4
Score: 2/10Perform the operation of extracting the main content from the Wikipedia Main Page at https://en.wikipedia.org/wiki/Main_Page using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.No valid answer is generated due to tool malfunction. The response indicates that there was a persistent issue with the tool not accepting the URL, which caused the scraping attempt to fail repeatedly. The error suggests that the required URL was not being provided correctly, leading to a technical limitation in retrieving the content from the Wikipedia Main Page.
Test case 5
Score: 2/10Perform the operation of extracting the main content from the article at https://www.cnn.com/business/live-news/trade-deadline-tariffs-trump-deals using the ScrapeNinja tool's real Chrome browser engine, and provide the extracted text.No valid answer is generated due to tool malfunction. The tool is malfunctioning due to a persistent issue with not accepting the URL parameter, preventing the extraction of content from the specified article.