en

AI Platform Security Testing

Client:
AI SaaS Platform Developer
Industry:
AI Technology
Focus:
Comprehensive AI product security testing, including prompt injection resilience assessment, data leakage analysis, agent tool evaluation, and compliance review against the OWASP LLM Top 10.
Main challenge:
Critical AI environment vulnerabilities, including full system prompt extraction and arbitrary file write capabilities without path validation.
Market:
International
Services provided:
AI Systems Penetration Testing (OWASP LLM Top 10)
Key Takeaways
  • 51 vulnerabilities identified, including 2 critical
  • Full system prompt disclosure confirmed
  • Arbitrary file write vulnerability discovered due to missing path validation
  • None of the 15 sandbox escape techniques succeeded
  • Assessment conducted across all 10 OWASP LLM Top 10 (2025) categories
  • 51
    vulnerabilities identified
    2
    critical vulnerabilities
    10/10
    OWASP categories assessed
    AI Platform Security Testing
    The AI platform handles personal data and system settings. Any vulnerability poses a business risk. The company engaged Datami to conduct a comprehensive security assessment of its AI product. Pentest identified 51 vulnerabilities, including 2 critical ones.

    The client developed an LLM-powered AI SaaS platform with chat, agent tools, file uploads, and web browsing. 

    Because it processes personal data, system configurations, and credentials, a single vulnerability could expose internal instructions, compromise user data, or enable agent takeover.

    Project goals & challenges
    The client engaged Datami to perform a comprehensive security assessment of its AI platform. 
    The primary objective was to evaluate the system’s resilience to prompt injection attacks, verify the protection of sensitive data, and identify weaknesses across the application stack.
    • Test direct and indirect prompt injection attacks
    • Identify vulnerabilities in API endpoints and the codebase (DAST + SAST)
    • Assess compliance with the OWASP LLM Top 10 (2025 edition)
    icon
    Prompt injection testing
    Evaluating LLM resilience to manipulation via chat, files, images, and web browsing.
    icon
    Code and API analysis
    Automated and manual testing of 9 API endpoints and the codebase.
    icon
    Reporting and recommendations
    Findings classified by severity, with clear remediation timelines and mapping to OWASP categories.

    Our approach

    In this case, Datami used a combined approach: Black-box testing for the public interface and White-box testing for the codebase. This helped simulate external attacks and uncover hidden architectural vulnerabilities.

    API security was assessed using automated scanners and manual testing. Static code analysis was performed with SonarQube and Snyk, as well as locally deployed language models.

    Black-box

    Black-box

    Testing the public interface by simulating the actions of an external attacker without access to the source code.
    White-box

    White-box

    Analyzing internal logic and security mechanisms with full access to the codebase.
    Key project stages and solutions

    The engagement began with setting up a secure testing environment, ensuring the client’s codebase was never transferred to external servers. 

    Another task was validating attack vectors across multiple languages, as some security filter bypasses were only possible through non-Latin scripts.

    • Preparation
      Scope alignment, isolated environment setup, and engagement of specialists for security testing.
    • Testing
      Application of more than 20 attack techniques through chat, files, images, and web browsing, along with dynamic scanning of 9 API endpoints.
    • Reporting
      Classification of 51 vulnerabilities by severity and remediation timelines, with findings mapped to the OWASP LLM Top 10.
    How we can help you?

    Every cybersecurity case study we solve involves deep analysis, tailored solutions, and measurable results.
    Datami has already helped over 600 companies strengthen their digital defenses — and we can do the same for your business.
    Ready to take action?

    Let’s start with a free consultation!
    Results and recommendations

    Results and recommendations

    Datami conducted a penetration test of the AI platform and identified 51 vulnerabilities, including two critical issues: full system prompt extraction through a multi-step attack chain and arbitrary file writing caused by missing path validation. Through editing tools, files could be written to any location within the server’s file system.

    The assessment also revealed security filter bypasses through language switching, confirmed in 6 of 14 tested categories, as well as prompt injection via uploaded PDF and DOCX files and images containing embedded text.

    Recommendations:

    • Immediately remediate arbitrary file write and system prompt extraction vulnerabilities.
    • Within 30 days, address unauthorized session access, authentication bypass, and injection vulnerabilities.
    • Within 60 days, eliminate personal data exposure in error responses.
    • Expand security classifiers beyond English-language keywords.
    Key project results

    The AI platform processed personal data and system configurations in an environment where an attacker could bypass protections through a standard chat interface and gain access to internal system instructions.

    Through Datami’s penetration test, critical threats were identified and remediated before they could be exploited. This cybersecurity case confirms that AI products require specialized security assessments that go beyond traditional web application testing.

    Metric
    Before the project
    Result after the project
    System prompt
    Fully extractable through a multi-step attack
    Recommendations provided to eliminate system prompt extraction
    File writing
    Arbitrary file write without path validation
    File system path validation recommended and implemented
    Language-based attack vectors
    Security filters failed to detect non-Latin prompts
    Semantic analysis recommended regardless of language
    OWASP LLM compliance
    Not assessed
    All 10 categories evaluated and remediation plan provided
    Execution environment isolation
    Status unknown
    Strong isolation confirmed (15 sandbox escape techniques failed)
    More success stories with Datami
    Browse other project case studies
    GCP security audit for PCI DSS readiness
    GCP security audit for PCI DSS readiness
    • PCI DSS compliance achieved.
    • Risk of unauthorized access reduced by 90%.
    Services:
    Cloud penetration testing, cloud security assessment
    Apr 25, 2026
    Azure Audit for a Government Business Platform
    Azure Audit for a Government Business Platform
    • ISO/IEC 27001 and GDPR compliance achieved
    • Infrastructure set up for the website update launch
    Services:
    Azure Security Audit (White-box)
    Mar 5, 2026
    AWS Security Audit for a Recruiting Platform
    AWS Security Audit for a Recruiting Platform
    • Threat detection time reduced to 20 minutes.
    • Full compliance with GDPR requirements ensured.
    Services:
    AWS cloud environment security assessment (White-Box)
    Mar 3, 2026
    Security image
    Ready to assess your project's security?
    Contact Datami — we’ll help you identify risks, strengthen your cybersecurity, and confidently pass certification.
    Datami articles
    Top Business Cyber Security Issues Oleksandr Filipov
    Oleksandr Filipov
    Top Business Cyber Security Issues

    Which issues in cyber security do businesses face most frequently? In this article, we examine the top 9 most relevant cybersecurity issues by criticality level and provide recommendations for their remediation.

    May 4, 2026 3 min
    What is a Cybersecurity Incident? Oleksandr Filipov
    Oleksandr Filipov
    What is a Cybersecurity Incident?

    Cyber incidents have long ceased to be a headache only for large corporations and government institutions. Today, they are a common part of the digital reality faced by companies of all sizes.

    May 4, 2026 3 min
    Top 3 Industries with the Highest Number of Critical Cybersecurity Vulnerabilities from Datami Practice Oleksandr Filipov
    Oleksandr Filipov
    Top 3 Industries with the Highest Number of Critical Cybersecurity Vulnerabilities from Datami Practice

    Which industries face the highest concentration of critical cybersecurity risks? Based on an analysis of the Datami project results, we identified three sectors where the average number of critical vulnerabilities discovered per project is the highest.

    Mar 31, 2026 15 min
    Order a consultation
    We value your privacy
    We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies. Cookie policy