watchmode
watchmode is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.
If you are the rightful owner of watchmode and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
Watchmode MCP Server is a tool for accessing streaming availability data across various OTT platforms.
Test Report
- Build Successful
- Server Started
- Client Connected
- Tools Listed
- All Tools Succeeded
- Any Tool Succeeded
Tool | Total Tests | Success | Fail | Success Rate |
---|---|---|---|---|
new_titles | 1 | 0 | 1 | 0% |
search | 1 | 1 | 0 | 1% |
new_people | 1 | 0 | 1 | 0% |
person | 1 | 1 | 0 | 1% |
titles_with_changed_sources | 1 | 0 | 1 | 0% |
streaming_release_dates | 1 | 1 | 0 | 1% |
autocomplete_search | 1 | 1 | 0 | 1% |
list_titles | 1 | 1 | 0 | 1% |
titles_with_changed_episodes | 1 | 0 | 1 | 0% |
titles_with_changed_details | 1 | 0 | 1 | 0% |
EvaluationTotal Score: 4/10
Evaluation Summary
- Strengths:
-
- Clarity and Communication: The agent consistently communicates its limitations clearly, especially when it cannot access certain information due to restrictions or data access issues.
-
- Alternative Assistance: The agent often offers alternative ways to assist users, demonstrating a willingness to help within its accessible capabilities.
-
- Accuracy in Specific Queries: In some instances, such as providing details about the movie 'Inception,' the agent accurately delivers comprehensive information about directors and main cast members.
- Weaknesses:
-
- Data Retrieval Limitations: The agent frequently encounters issues with accessing up-to-date information, particularly regarding new releases, changes in streaming sources, and new actors or directors added to databases.
-
- Inconsistency with Claimed Capabilities: There is a notable discrepancy between the agent's stated capabilities and its actual performance, as it often fails to deliver on its promises, such as listing new releases or changes in streaming sources.
-
- Completeness and Accuracy: The agent's responses often lack completeness and accuracy, particularly when it comes to providing detailed information about new releases or changes in streaming platforms.
-
- Reliance on Paid Features: The agent frequently cites restrictions due to paid plans, which limits its ability to provide comprehensive information, impacting its usefulness.
- Suggestions for Improvement:
-
- Enhance Data Access: Improve the agent's ability to access and retrieve up-to-date information, aligning with its claimed capabilities.
-
- Verify Feature Accessibility: Ensure that the agent accurately represents the availability of features, especially regarding data access limitations related to paid plans.
-
- Provide Partial Information: Even if complete data isn't accessible, offering partial information or directing users to reliable sources would enhance the response's usefulness.
- Final Score: Considering the strengths and weaknesses, the agent's overall performance is rated a 4 out of 10. While it communicates clearly and offers alternative assistance, its frequent data retrieval limitations and inconsistency with claimed capabilities significantly impact its overall quality.
Test case 1
Score: 8/10Can you search for the movie 'Inception' and provide details about its director and main cast?The movie Inception, released in 2010, was directed by Christopher Nolan. Here are some details about the main cast:
Director
- Name: Christopher Nolan
- Known For: Directing several critically acclaimed films including "The Dark Knight" trilogy, "Interstellar," and "Dunkirk".
Main Cast
-
Leonardo DiCaprio
- Role: Dom Cobb
- Date of Birth: November 11, 1974
- Known For: "Titanic", "The Wolf of Wall Street", "The Revenant"
-
Joseph Gordon-Levitt
- Role: Arthur
- Date of Birth: February 17, 1981
- Known For: "500 Days of Summer", "The Dark Knight Rises"
-
Elliot Page
- Role: Ariadne
- Date of Birth: February 21, 1987
- Known For: "Juno", "The Umbrella Academy"
-
Tom Hardy
- Role: Eames
- Date of Birth: September 15, 1977
- Known For: "Mad Max: Fury Road", "Venom"
-
Ken Watanabe
- Role: Saito
- Date of Birth: October 21, 1959
- Known For: "The Last Samurai", "Godzilla"
If you need more details or specific information about any of the cast members, feel free to ask!
The agent correctly identifies Christopher Nolan as the director of "Inception" and provides accurate information about his notable works.
The main cast members listed are accurate, along with their respective roles in the film.
The response includes the director's name, a brief overview of his notable works, and details about the main cast members, including their roles, dates of birth, and other significant films they've appeared in.
The agent demonstrates its capability to search for movie titles and provide detailed information about the director and main cast, aligning with its described functionalities.
The information is presented in a clear and organized manner, making it easy for the user to understand.
Including images of the cast members enhances the response's usefulness.
The image URLs provided for Joseph Gordon-Levitt, Elliot Page, Tom Hardy, and Ken Watanabe are identical, which is likely an error. Each actor should have a unique image link.
The response does not specify the date of the information provided. Given that the current date is June 24, 2025, some details, such as the actors' ages or recent works, may be outdated. Including a note about the date of the information would improve clarity.
Test case 2
Score: 3/10Can you provide a list of new movies released on Netflix in the past two weeks?It seems that there have been no new movies released on Netflix in the past two weeks. If you're looking for something specific or if you have other queries, feel free to ask!
The agent's response is concise and straightforward, clearly stating that no new movies have been released on Netflix in the past two weeks.
The statement that no new movies have been released on Netflix in the past two weeks is incorrect. Several new movies have been released during this period. For example:
"Titan: The OceanGate Disaster" was released on June 11, 2025. (en.wikipedia.org)
"KPop Demon Hunters" was released on June 20, 2025. (en.wikipedia.org)
"Masameer Junior" was released on June 12, 2025. (en.wikipedia.org)
The response lacks information about the new releases that have occurred in the specified timeframe. Providing details about these new movies would have been more informative and helpful to the user.
The agent claims to provide information on recently released titles on major streaming services. However, in this instance, it failed to deliver accurate and comprehensive information about recent Netflix releases, which is inconsistent with its stated capabilities.
Test case 3
Score: 4/10Can you list the upcoming TV shows premiering on Hulu in the next three weeks and provide their genres and a brief synopsis?It looks like I wasn't able to retrieve detailed information about upcoming TV shows premiering on Hulu in the next three weeks due to restrictions on the data I can access.
However, I can provide you with the titles that are scheduled to be released on Hulu in that time frame. Here are the upcoming TV shows and their release dates:
-
Team Players
- Release Date: July 9, 2025
- Type: TV Series
-
Suspicious Minds (Season 2)
- Release Date: July 10, 2025
- Type: TV Series
-
Celebrity Family Feud (Season 11)
- Release Date: July 10, 2025
- Type: TV Series
-
Press Your Luck (Season 7)
- Release Date: July 10, 2025
- Type: TV Series
-
Too Much
- Release Date: July 10, 2025
- Type: TV Miniseries
-
7 Bears
- Release Date: July 10, 2025
- Type: TV Series
-
Jaws @ 50: The Definitive Inside Story
- Release Date: July 10, 2025
- Type: TV Series
Unfortunately, I don't have access to the genres or synopses of these shows at the moment. If you're looking for specific details about a particular show, feel free to ask!
The agent clearly states its inability to retrieve detailed information about upcoming TV shows premiering on Hulu in the next three weeks due to data access restrictions.
Despite limitations, the agent provides a list of titles scheduled for release on Hulu within the specified timeframe, including their release dates and types.
The agent lists several titles with release dates and types but fails to provide genres and synopses, which were specifically requested by the user.
Some of the listed titles, such as "Team Players" and "7 Bears," do not appear in the available sources, raising concerns about their accuracy.
The response lacks genres and brief synopses for the listed titles, which are essential components of the user's request.
The agent does not mention other notable upcoming Hulu releases within the next three weeks, such as "Washington Black" (July 23, 2025), which is a historical drama based on Esi Edugyan's novel.
The agent claims to provide information on upcoming releases on major streaming services. However, in this instance, it fails to deliver comprehensive details about upcoming Hulu TV shows, which is inconsistent with its stated capabilities.
-
Test case 4
Score: 3/10Can you provide a list of new actors or directors who have been added to the database in the past month, along with their most notable work?It seems that I can't access the list of new actors or directors added to the database in the past month because that feature is only available with paid plans.
However, if you have specific names or titles in mind, I can help you find information about them or look for notable works associated with them. Let me know how you would like to proceed!
The agent clearly communicates its inability to access the requested information due to feature restrictions associated with paid plans.
The agent offers to help with specific names or titles, suggesting a willingness to assist within its accessible capabilities.
The agent's claim that the feature to list new actors or directors added to the database in the past month is only available with paid plans is not substantiated by the provided sources. For instance, IMDb's contribution statistics indicate that over 925,000 names were added in 2024, suggesting that such data might be accessible without paid plans. (contribute.imdb.com)
The response lacks any information about new actors or directors added to the database in the past month, which was the core of the user's request.
The agent claims to provide information on new titles or people added to the database within a specific date range. However, in this instance, it fails to deliver on this capability, indicating a discrepancy between its stated functionalities and actual performance.
Test case 5
Score: 3/10Could you list TV shows that have recently changed their streaming sources in the past week, including their new streaming platforms?It seems that the endpoint to retrieve titles that have recently changed their streaming sources is only available for paid plans, and I don't have access to that information.
However, if you're interested in exploring specific TV shows, their genres, or streaming information in other ways, please let me know, and I can assist you with that!
The agent clearly communicates its inability to access the requested information due to restrictions associated with paid plans.
The agent offers to assist with specific TV shows, genres, or streaming information in other ways, demonstrating a willingness to help within its accessible capabilities.
The agent's claim that the endpoint to retrieve titles that have recently changed their streaming sources is only available for paid plans is not substantiated by the provided sources. For instance, services like Reelgood track streaming availability across multiple platforms and offer this information to users. (consumerreports.org)
The response lacks any information about TV shows that have recently changed their streaming sources in the past week, which was the core of the user's request.
The agent claims to provide information on titles that have had changes to their streaming sources. However, in this instance, it fails to deliver on this capability, indicating a discrepancy between its stated functionalities and actual performance.