sms77io
sms77io is hosted online, so all tools can be tested directly either in theInspector tabor in theOnline Client.
If you are the rightful owner of sms77io and would like to certify it and/or have it hosted online, please leave a comment on the right or send an email to henry@mcphub.com.
The sms77io MCP Server is a communication platform for sending SMS and voice messages, performing phone number lookups, and managing contacts and accounts.
Test Report
- Build Successful
- Server Started
- Client Connected
- Tools Listed
- All Tools Succeeded
- Any Tool Succeeded
Tool | Total Tests | Success | Fail | Success Rate |
---|---|---|---|---|
update_contact | 1 | 0 | 1 | 0% |
get_contact | 1 | 0 | 1 | 0% |
delete_contact | 1 | 0 | 1 | 0% |
rcscapabilities | 1 | 1 | 0 | 100% |
create_contact | 1 | 1 | 0 | 100% |
home_location_register | 1 | 0 | 1 | 0% |
mobile_number_portability | 1 | 0 | 1 | 0% |
number_format_lookup | 1 | 0 | 1 | 0% |
list_contacts | 1 | 1 | 0 | 100% |
caller_name_delivery | 1 | 0 | 1 | 0% |
EvaluationTotal Score: 7/10
Evaluation Summary of the Agent's Performance
- Strengths:
-
- Accuracy/Correctness:
-
- The agent consistently provides accurate responses that align with the tasks requested, such as creating contacts, sending messages, and identifying the need for a message ID to retrieve delivery reports.
-
- Consistency with Claimed Capabilities:
-
- The agent's responses generally align with its self-described capabilities, such as managing contacts, sending messages, and checking account balance.
-
- Usefulness / Clarity:
-
- The responses are clear and concise, often confirming the completion of tasks and providing guidance when additional information is needed.
- Weaknesses:
-
- Completeness:
-
- The agent often lacks detailed information in its responses, such as specifics about the contact creation process, message sending settings, and comprehensive details about active phone numbers.
-
- It does not consistently provide alternative methods or additional guidance for obtaining missing information, such as accessing account settings or contacting support.
-
- Usefulness / Clarity:
-
- While the responses are clear, they could be more informative by including steps or references for obtaining necessary information, such as API keys or message IDs.
- Suggestions for Improvement:
-
- Enhance the completeness of responses by including more detailed information about processes and settings.
-
- Provide alternative methods or additional guidance for obtaining missing information, ensuring alignment with claimed capabilities.
-
- Improve the clarity of responses by including steps or references for obtaining necessary information.
- Overall, the agent demonstrates a solid understanding of its capabilities but could improve in providing more comprehensive and detailed responses.
Test case 1
Score: 8/10Can you create a new contact with the name 'John Doe' and phone number '+1234567890'?The contact with the name 'John Doe' and phone number '+1234567890' has been created successfully. If you need any further assistance, feel free to ask!
The response accurately confirms the creation of a contact named 'John Doe' with the phone number '+1234567890'.
The agent's self-description includes the ability to create contacts, aligning with the action performed in the response.
The response is clear and concise, confirming the successful creation of the contact and offering further assistance if needed.
The response lacks details about the contact creation process, such as whether the contact was added to a specific group or if any additional information was included.
It does not provide information on how to verify the creation of the contact or access it within the system.
Test case 2
Score: 8/10Can you send an SMS message with the text 'Meeting rescheduled to tomorrow at 10 AM' to the contact 'John Doe' who has the phone number '+1234567890'? Please confirm once the message is sent.The SMS message with the text "Meeting rescheduled to tomorrow at 10 AM" has been successfully sent to the contact 'John Doe' with the phone number '+1234567890'.
The response accurately confirms the successful sending of the SMS message with the specified text to the contact 'John Doe' at the phone number '+1234567890'.
The agent's self-description includes the ability to send SMS messages, aligning with the action performed in the response.
The response is clear and concise, confirming the successful sending of the message.
The response lacks details about the message sending process, such as whether any specific settings (e.g., sender ID, delivery time) were applied.
It does not provide information on how to verify the delivery status of the sent message or access delivery reports.
Test case 3
Score: 7/10Can you retrieve the delivery report for the SMS message sent to 'John Doe' with the phone number '+1234567890' today?To retrieve the delivery report for the SMS message sent to 'John Doe' with the phone number '+1234567890', I need the message ID of the SMS. This ID can be obtained by setting parameters like JSON, return_msg_id, or details to 1 when sending the SMS via the API, or it can be retrieved from the message journal in the user area.
If you have the message ID, please provide it, and I can proceed to get the delivery report for you.
The agent correctly states that retrieving a delivery report typically requires the message ID, which can be obtained by setting specific parameters (e.g.,
return_msg_id
,details
, orjson
) when sending the SMS via the API. (carlosag.net)The agent's response aligns with its self-described capability to check message status and retrieve delivery reports.
The response is clear in explaining the necessity of the message ID for retrieving delivery reports and provides guidance on how to obtain it.
The agent does not mention alternative methods to retrieve delivery reports without a message ID, such as using webhooks to receive status updates automatically. (docs.seven.io)
It does not provide information on how to access the message journal in the user area to retrieve the message ID.
Test case 4
Score: 6/10Can you check my account balance and provide the current amount available for sending messages?To check your account balance, I need your API key from Sms77.io. Could you please provide it?
The agent correctly identifies that accessing the account balance via the Sms77.io API requires authentication through an API key. (docs.seven.io)
The agent's response aligns with its self-described capability to check account balance, as it provides the necessary step to proceed with this action.
The response lacks information on how to obtain the API key, which is essential for users who may not have it readily available. (doc.ibexa.co)
It does not provide guidance on how to use the API key to check the account balance once obtained.
The response could be more informative by including steps or references on obtaining the API key and using it to check the account balance.
Test case 5
Score: 5/10Can you list all active phone numbers associated with my account and provide their details, such as expiration dates or any attached tags?It seems that there is one active phone number associated with your account, but the details provided only indicate the type as "text" and the number is "900." Unfortunately, specific details like expiration dates or attached tags were not included in the response.
If you need more detailed information about this number, please let me know!
The agent correctly identifies that there is one active phone number associated with the account.
The response is clear in communicating the limited information available about the active phone number.
The agent fails to provide comprehensive details about the active phone number, such as expiration dates or attached tags, which were specifically requested.
The response does not suggest alternative methods to retrieve the missing information, such as accessing the account dashboard or contacting support.
The agent's self-description includes the ability to manage phone numbers, which implies access to detailed information about active numbers. The lack of detailed information in the response suggests a gap between claimed capabilities and actual performance.