Web Security

The protective measures and protocols that organizations adopt to protect the organization from, cyber criminals and threats that use the web channel.

Using Defence In Depth approach: By increasing the number of layers, an attacker's chance of success is decreased. For example an applications which contains:

Sensitive data is encrypted when stored in the database
Application has an extensive input validation
Hosts are patched with the latest security patches
The internal network is segregated into different zones protected by firewall rules
A firewall separates the internal perimeter from the internet
Servers are located in a badge-reader protected room
Finally, security assessments are performed to check adherence

Vulnerabilities and Exploits

Preventing vulnerabilities

At the "Data" layer: Access controls, Encryption, Backup and restore procedures
At the "Application" layer: Authentication, Authorization, Auditing (otherwise known as AAA). Securing coding, and Hardening.
At the "Host" layer: Hardening, Authentication, Patch management, Antivirus
On the "Internal network": Network segmentation, IPsec, TLS, NAT
At the "Perimeter" layer: Firewall, TLS, Denial of service, prevention
And finally, you should consider "Physical Security" including: Guards, Locks, Tracking devices, and Badging systems

Threat Modelling

Threat Modelling is a way to identify, communicate, and understand security threats and mitigations

Different Threat Modelling

STRIDE

STRIDE

Developed by Microsoft, STRIDE is considered one of the most comprehensive ways to identify threats.

STRIDE stands for:

Spoofing: Authentication
- Spoofing is the impersonation of something or someone else
Tampering: Integrity
- Tampering happens when attacker modify data or code
Repudiation: Non-repudiation
- Repudiation occurs when someone claims not to have performed an action
Information disclosure: Confidentiality
- Information disclosure describes when information is exposed to users who are not authorized to see it
Denial of service: Availability
- Denial of service happens when service is denied or degraded
Elevation of privilege: Authorization
- Elevation of privilege means users can gain capabilities when they don't have proper authorization

Security

More complex the application (more ways to interact) more potential bugs: Combinatorial explosion of different ways an API can be used

What is Application Security?
- Security beyond infrastructure
- Integrating security tooling to DevOps a.k.a DevSecOps
- Determine security posture for application
- Understanding technical risks
- Providing technical recommendations
- Application security guidelines and standards for Group
- Knowledge sharing and presentations on security
Why is Application Security Important?
- PCI DSS.
- APRA Prudential Standard CPS 234 & Practice Guide CPG 234.

Common form of attacks:

Spam: Sent from legitimate IP address, less likely to be blocked
Denial of service: attack competitors, or seek ransom
Infect visiting users with malware: Infect one server, use it to infect hundreds of thousands of clients
Data theft: Steal credentials, credit card numbers, intellectual property

Different Parts of Web Security:

Browser security:
- Example: Same-Origin Policy - Isolate sites from each other, while running in the same browser
Server app security:
- Attackers can run arbitrary HTTP clients: can send anything to server
Client app security:
- Prevent user from being attacked while using web app locally
Protect the user:
- From social engineering
- From trackers, private data being leaked

Why is web security hard?

Social engineering is very effective
Extremely ambitious goal: Run untrusted code securely
Different sites interacting in the same tab like ads ("mashups")
Low-level features: hardware access
Desire for high performance
APIs were not designed from first principles: evolved
Strict backwards compatibility requirements
"Don't break the web"

"Modern web applications are built on a tangle of technologies that have been developed over time and then haphazardly pieced together. Every piece of the web application stack, from HTTP requests to browser-side scripts, comes with important yet subtle security consequences. To keep users safe, it is essential for developers to confidently navigate this landscape."
Tangled Web

The browser has a seemingly impossible task:

Sites - even malicious ones can:
Download content from anywhere
Spawn worker processes
Open sockets to a server, or even to another user's browser
Display media in huge number of formats
Run custom code on the GPU
Save/read data from the file-system

"It's all too easy to criticize, lament, and create paranoid scenarios about the 'unsound security foundations' of the web. Truth is, all of that criticism is true, and yet the web has proven to be an incredibly robust platform."
Ilya Grigorik, Google web performance engineer

The Five Stages of Hacking

Reconnaissance - Gathering Information
- Passive Reconnaissance
- Active Reconnaissance
Scanning and Enumeration
Gaining Access - Exploitation, Remote Connection Execution (RCE)
Maintaining Access
Covering Tracks

Reconnaissance

Passive Reconnaissance:
- Target Validation: WHOIS, nslookup, dnsrecon
- Finding Subdomains: Google Fu, dig, Nmap, Sublist3r, Bluto, crt.sh, etc.
- Fingerprinting: Nmap, Wappalyze, WhatWeb, BuiltWith, Netcat
- Data Breaches: HaveIBeenPwned and similar lists

OWASP Top 10

Insecure Direct Object References
Missing Function Level Access Control
Unvalidated Redirects and Forwards
Click Jacking attacks
Insufficient Attack Protection
Under-protected APIs

2021 List

Broken Access Control
Cryptographic Failures (2017 - 3)
Injection
Insecure Design
Security Misconfiguration
Vulnerable and Outdated Components (2017 - 9)
Identification and Authentication Failures (2017 - 2)
Software and Data Integrity Failures
Security Logging and Monitoring Failures
Server-Side Request Forgery (SSRF) (2017 - 10)

2017 List

Injection
Broken Authentication
Sensitive Data Exposure
XML External Entities
Broken Access Control (2013 - 4 & 7)
Security Misconfiguration
Cross-Site Scripting (XSS): Cheat sheet
Insecure De-serialization
Using Components with Known Vulnerabilities
Insufficient Logging & Monitoring

2013 List

Injection
Broken Authentication and Session Management
Cross-Site Scripting (XSS)
Insecure Direct Object References
Security Misconfiguration
Sensitive Data Exposure
Missing Function Level Access Control
Cross-Site Request Forgery (CSFR)
Using Components with Known Vulnerabilities
Unvalidated Redirects and Forwards

All cross-origin calls must return with Access-Control-* headers to indicate white-listed applications

Access-Control-Allow-Origin: List of origins allowed to connect to this API
Access-Control-Allow-Methods: List of methods allowed for the API

Pre-flight question:

Browser sends request with OPTION method set to receive CORS headers from backend
After verification, browser sends the actual request

Attributes	Simple Request	Pre-flighted Request
Methods	`GET`, `HEAD`, `POST`	`DELETE`, `PUT`, `PATCH`, `CONNECT`
Allowed Headers	`Accept`, `Accept-Language`,`Content-Language`, `Content-Type`, etc.	N/A
Non-Standard Headers	Not-allowed, Upgrade to Pre-flighted	N/A

Same-Site Policy

Site vs Origin:

Two URIs are part of the same origin, if they have the same scheme, host and port
Two URIs are part of the same site, if they have the same domain and suffix

Same Site:

text

https://foo.example.org
https://bar.example.org

Cross Site:

text

https://foo.github.io
https://bar.github.io

Same Site:

text

https://foo.bar.example.org
https://bar.example.org

SSP Applicable

Which is blocked?

Web Forms: applies

Scripts, images, etc. which remain constant:
html
```
<script src="https://cross-origin/my.js">
```

Cross-origin web forms:

html

<form action="https://cross-origin/getmyval" method="GET"></form>

JavaScript: applies
- Content operated via XMLHTTPRequest or Fetch (APIs):
  javascript
```
fetch("https://cross-origin/getmyval");
```

Security Headers

Mandatory security headers for apps:

Strict-Transport-Security enforces secure (HTTP over SSL/TLS) connections to the server
X-XSS-Protection enables the Cross-site scripting (XSS) filter built into most recent web browsers
Content-Security-Policy prevents a wide range of attacks, including Cross-site scripting and other cross-site injections
Reduce Fingerprinting:
- X-Powered-By:
MIME Sniffing Protection:
- When content type is:
  - Set: Some browsers (IE) try to verify content type with actual content payload
  - Not Set: Some browsers guess content type while others block loading of page
- Block browser MIME sniffing
- Blocks request if MIME type does not match tag types, e.g. script
- Preferred Header Settings:
http
```
X-Content-Type-Options: nosniff
```
Clickjacking Protection (Legacy):
- AKA UI redressing
- Attackers trick users into clicking hidden actionable content
  - CSS is used to create and manipulate layers
  - iframes are used to create hidden content
- Prevent framing of your site as iframe in another website:
  - Complete deny:
    http
```
X-Frame-Options: deny
```
  - Allow same origin:
    http
```
X-Frame-Options: sameorigin
```
  - Allow from specific origin:
    - Allow from is deprecated in some browsers and should not be relied upon - use in conjunction with CSP
    http
```
X-Frame-Options: allow-from https://normal-website.com
```
NOTE
Clickjacking attacks are not mitigated by CSRF tokens
Cache Headers:
- Properly configured caching header is to avoid having sensitive information stored in browser or proxy:
  - Cache-Control: Header to determine if page should be cached
  - Expires: Not relevant in modern browsers but needed to stay consistent
  - Pragma: This is older header and replaced by Cache-Control
http
```
Cache-Control: private, no-cache, no-store, max-age=0, no-transform
Pragma: no-cache
Expires: 0
```
NOTE
no-cache only works for proxies and no-store only works for browsers

You can cache:

Static or non-personalised content
Only GET request
Only cache HTTP 200 OK response

You can't cache:

Sensitive information/PII
Set cookies request
Cache-Control header that contains private and no-cache and no-store
Pragma header that contains private and no-cache and no-store

Examples of extensions to cache:

Web	Images	Font
css	jpg	woff
js	jpeg	tff
json (static only)	png	eot
	gif
	svg
	jpg

Regular expression Denial of Service - ReDoS

Regular expression Denial of Service

Check if regex is safe:

safe-regex

Unique Identifiers

Unique Identifiers are used to identify any entity uniquely. They are used in various places like:

Session ID
CSRF Token
API Key
User ID
Order ID

You can use sequential identifiers (like auto-incrementing integers) to identify entities. In a database, you can use a primary key to identify entities. But, sequential identifiers are predictable and can be easily guessed

For example, if you are using sequential identifiers for session IDs (sessionID: 123), an attacker can easily guess the next session ID (sessionID: 124) and hijack the session

To avoid this, you can use random identifiers (like UUIDs or GUIDs) to identify entities
They are hard to guess and are more secure

Why do we need unique identifiers?

To identify entities uniquely
To avoid conflicts
Increase security

What makes a good unique identifier?

Secure
Collision-resistant
Horizontally scalable
Offline generation
Identifiable
Unpredictable

Common forms of unique identifiers:

URI (Uniform Resource Identifier)
UUID (Universally Unique Identifier)
GUID (Globally Unique Identifier)
CUID (Collision-resistant Unique Identifier)
BIC (Bank Identifier Code)
UDID (Unique Device Identifier)
SSID (Service Set Identifier)

UUID

Universally Unique Identifier (UUID) is a 128-bit label that can be used across all computers and networks wherever a unique identifier is required. Such an identifier has a very low probability of being duplicated

Odds of collision for UUIDs are supposed to be one in a 1.7-quintillion (1.7 x 10^18)
RFC 4122 defines the UUID
UUIDs are also known as GUIDs (Globally Unique Identifier) as per RFC 4122

Format

A UUID is a 128-bit in size, in which 2-4 bits are used to indicate the format's variant. The most common variant in use, OSF DCE

Different versions of UUID:

V1 and V6 (date-time and MAC address): Based on the current time and MAC address
V2 (date-time and MAC address, DCE Security verion): Based on the POSIX UID/GID
V3 and V5 (namespace name-based): Based on a hash of a name
V4 (random): Generated randomly
V7 (timestamp and random): Designed for keys in high-load databases and distributed systems
V8 (custom): Custom version

Due to some older implementations of V4 UUID can't generate more than 10k ids without generating a collision

GUID

Globally Unique Identifier (GUID) is a 128-bit integer (16 bytes) that can be used across all computers and networks wherever a unique identifier is required. Such an identifier has a very low probability of being duplicated

Odds of collision for GUIDs are supposed to be one in a 5.3-quintillion (5.3 x 10^18)

There are two types of unique identifiers:

Sequential: Generated in sequence
Random: Generated randomly

CUID

Collision-resistant Unique Identifier (CUID) is a 128-bit integer (16 bytes) that can be used across all computers and networks wherever a unique identifier is required. Such an identifier has a very low probability of being duplicated

Snowflake ID

Twitter's Snowflake ID is a 64-bit unique identifier that is used to generate unique IDs at high scale. It is composed of:

41 bits for timestamp
10 bits for machine ID
12 bits for sequence number

Snowflake ID is used to generate unique IDs at high scale

Every snowflake ID has a unique structure, hence called snowflake ID

Example: Tweet by Wikipedia has a snowflake ID 1541815603606036480

Database Unique Identifier

When storing data into a database, you can use a unique identifier as primary key. You can use sequential identifiers (like auto-incrementing integers) or random identifiers (like UUIDs or GUIDs) to identify entities

If we use sequential identifiers, it is good for performance but it is predictable and can be easily guessed. Also in a distributed system, different nodes can generate the same identifier which can cause conflicts

If we use random identifiers, it is secure but it is bad for performance.

Resources

CS-241 and CS-253 (Stanford): YouTube Link
Adopting CSP

Web Security ​

Preventing vulnerabilities ​

Threat Modelling ​

STRIDE ​

Security ​

The Five Stages of Hacking ​

Reconnaissance ​

OWASP Top 10 ​

2021 List ​

2017 List ​

2013 List ​

Cross-Origin Resource Sharing (CORS) ​

Same-Site Policy ​

SSP Applicable ​

Security Headers ​

Regular expression Denial of Service - ReDoS ​

Unique Identifiers ​

UUID ​

Format ​

GUID ​

CUID ​

Snowflake ID ​

Database Unique Identifier ​

Resources ​

Web Security

Preventing vulnerabilities

Threat Modelling

STRIDE

Security

The Five Stages of Hacking

Reconnaissance

OWASP Top 10

2021 List

2017 List

2013 List

Cross-Origin Resource Sharing (CORS)

Same-Site Policy

SSP Applicable

Security Headers

Regular expression Denial of Service - ReDoS

Unique Identifiers

UUID

Format

GUID

CUID

Snowflake ID

Database Unique Identifier

Resources