What term is commonly used to describe malware that changes characteristics or rewrites its code between infections?

In computing, polymorphic code is code that uses a polymorphic engine to mutate while keeping the original algorithm intact - that is, the code changes itself every time it runs, but the function of the code (its semantics) will not change at all. For example, the simple math expressions 3+1 and 6-2 both achieve the same result, yet run with different machine code in a CPU. This technique is sometimes used by computer viruses, shellcodes and computer worms to hide their presence.[1]

Encryption is the most common method to hide code. With encryption, the main body of the code (also called its payload) is encrypted and will appear meaningless. For the code to function as before, a decryption function is added to the code. When the code is executed, this function reads the payload and decrypts it before executing it in turn.

Encryption alone is not polymorphism. To gain polymorphic behavior, the encryptor/decryptor pair is mutated with each copy of the code. This allows different versions of some code which all function the same.[2]

Malicious code[edit]

Most anti-virus software and intrusion detection systems (IDS) attempt to locate malicious code by searching through computer files and data packets sent over a computer network. If the security software finds patterns that correspond to known computer viruses or worms, it takes appropriate steps to neutralize the threat. Polymorphic algorithms make it difficult for such software to recognize the offending code because it constantly mutates.

Malicious programmers have sought to protect their encrypted code from this virus-scanning strategy by rewriting the unencrypted decryption engine (and the resulting encrypted payload) each time the virus or worm is propagated. Anti-virus software uses sophisticated pattern analysis to find underlying patterns within the different mutations of the decryption engine, in hopes of reliably detecting such malware.

Emulation may be used to defeat polymorphic obfuscation by letting the malware demangle itself in a virtual environment before utilizing other methods, such as traditional signature scanning. Such a virtual environment is sometimes called a sandbox. Polymorphism does not protect the virus against such emulation if the decrypted payload remains the same regardless of variation in the decryption algorithm. Metamorphic code techniques may be used to complicate detection further, as the virus may execute without ever having identifiable code blocks in memory that remains constant from infection to infection.

The first known polymorphic virus was written by Mark Washburn. The virus, called 1260, was written in 1990. A better-known polymorphic virus was created in 1992 by the hacker Dark Avenger as a means of avoiding pattern recognition from antivirus software. A common and very virulent polymorphic virus is the file infecter Virut.

Example[edit]

This example is not really a polymorphic code but will serve as an introduction to the world of encryption via the XOR operator. For example, in an algorithm using the variables A and B but not the variable C, there could be a large amount of code that changes C, and it would have no effect on the algorithm itself, allowing it to be changed endlessly and without heed as to what the final product will be.

Start: GOTO Decryption_Code Encrypted: ...lots of encrypted code... Decryption_Code: C = C + 1 A = Encrypted Loop: B = *A C = 3214 * A B = B XOR CryptoKey *A = B C = 1 C = A + B A = A + 1 GOTO Loop IF NOT A = Decryption_Code C = C^2 GOTO Encrypted CryptoKey: some_random_number

The encrypted code is the payload. To make different versions of the code, in each copy the garbage lines which manipulate C will change. The code inside "Encrypted" ("lots of encrypted code") can search the code between Decryption_Code and CryptoKey and each algorithm for new code that does the same thing. Usually, the coder uses a zero key (for example; A xor 0 = A) for the first generation of the virus, making it easier for the coder because with this key the code is not encrypted. The coder then implements an incremental key algorithm or a random one.

Polymorphic encryption[edit]

Polymorphic code can be also used to generate encryption algorithm. This code was generated by the online service StringEncrypt.[3] It takes the string or a file content and encrypts it with random encryption commands and generates polymorphic decryption code in one of the many supported programming languages:

// encrypted with //www.stringencrypt.com (v1.1.0) [C/C++] // szLabel = "Wikipedia" wchar_t szLabel[10] = { 0xB1A8, 0xB12E, 0xB0B4, 0xB03C, 0x33B9, 0xB30C, 0x3295, 0xB260, 0xB5E5, 0x35A2 }; for (unsigned tUTuj = 0, KRspk = 0; tUTuj < 10; tUTuj++) { KRspk = szLabel[tUTuj]; KRspk ^= 0x2622; KRspk = ~KRspk; KRspk --; KRspk += tUTuj; KRspk = (((KRspk & 0xFFFF) >> 3) | (KRspk << 13)) & 0xFFFF; KRspk += tUTuj; KRspk --; KRspk = ((KRspk << 8) | ( (KRspk & 0xFFFF) >> 8)) & 0xFFFF; KRspk ^= 0xE702; KRspk = ((KRspk << 4) | ( (KRspk & 0xFFFF) >> 12)) & 0xFFFF; KRspk ^= tUTuj; KRspk ++; KRspk = (((KRspk & 0xFFFF) >> 8) | (KRspk << 8)) & 0xFFFF; KRspk = ~KRspk; szLabel[tUTuj] = KRspk; } wprintf(szLabel);

As you can see in this C++ example, the string was encrypted and each character was stored in encrypted form using UNICODE widechar format. Different encryption commands were used like bitwise XOR, NOT, addition, subtraction, bit rotations. Everything is randomized, encryption keys, bit rotation counters and encryption commands order as well. Output code can be generated in C/C++, C#, Java, JavaScript, Python, Ruby, Haskell, MASM, FASM and AutoIt. Thanks to the randomization the generated algorithm is different every time. It's not possible to write generic decryption tools and the compiled code with polymorphic encryption code has to be analyzed each time it's re-encrypted.

Some tools use polymorphic encryption for data privacy, running multiple encryption algorithms to break up sensitive data into usable components. This allows users to generate and use large sets of data without decrypting them. [4][5]

See also[edit]

  • Timeline of notable computer viruses and worms
  • Metamorphic code
  • Self-modifying code
  • Alphanumeric shellcode
  • Shellcode
  • Software cracking
  • Security cracking
  • Obfuscated code
  • Oligomorphic code

References[edit]

  1. ^ Raghunathan, Srinivasan (2007). Protecting anti-virus software under viral attacks (M.Sc.). Arizona State University. CiteSeerX 10.1.1.93.796.
  2. ^ Wong, Wing; Stamp, M. (2006). "Hunting for Metamorphic Engines". Journal in Computer Virology. 2 (3): 211–229. CiteSeerX 10.1.1.108.3878. doi:10.1007/s11416-006-0028-7. S2CID 8116065.
  3. ^ Wójcik, Bartosz (2015). String & File Encryption
  4. ^ "Skyflow Polymorphic Encryption Enables Secure PII Data". Developer.com. 2021-07-21. Retrieved 2022-08-11.
  5. ^ "What is Polymorphic Encryption? - Skyflow". www.skyflow.com. Retrieved 2022-08-11.

  • Spinellis, Diomidis (January 2003). "Reliable identification of bounded-length viruses is NP-complete". IEEE Transactions on Information Theory. 49 (1): 280–4. doi:10.1109/TIT.2002.806137.

What is it called when malware changes its code?

A metamorphic virus is a type of malware that is capable of changing its code and signature patterns with each iteration.

What is polymorphism in malware?

A polymorphic virus is a harmful, destructive or intrusive type of malware that can change or "morph," making it difficult to detect with antimalware programs. Evolution of the malicious code can occur in a variety of ways such as filename changes, compression and encryption with variable keys.

Which virus type will rewrite itself after each new infection?

A metamorphic virus is one that can transform based on the ability to translate, edit and rewrite its own code. It is considered the most infectious computer virus, and it can do serious damage to a system if it isn't detected quickly.

Which type of mutational malware changes its code to mask the presence of the malware?

Polymorphic malware pairs a mutation engine with self-propagating code to continually change its “appearance,” and it uses encryption (or other methods) to hide its code. As you've guessed, this ability to change is not so great for you when you are the intended target of a malware-based cyber attack.

Bài Viết Liên Quan

Chủ đề