High-level programming language

A high-level programming language is designed to be easy for humans to write and read. It hides the complex details of how the computer actually works, letting you focus on the program's logic instead of the machine's internals. The amount of abstraction provided defines how "high-level" a programming language is.^[1]

High-level refers to a level of abstraction from the hardware details of a processor inherent in machine and assembly code. Rather than dealing with registers, memory addresses, and call stacks, high-level languages deal with variables, arrays, objects, arithmetic and Boolean expressions, functions, loops, threads, locks, and other computer science abstractions, intended to facilitate correctness and maintainability. Unlike low-level assembly languages, high-level languages have few, if any, language elements that translate directly to a machine's native opcodes. Other features, such as string handling, object-oriented programming features, and file input/output, may also be provided. A high-level language allows for source code that is detached and separated from the machine details. That is, unlike low-level languages like assembly and machine code, high-level language code may result in data movements without the programmer's knowledge. Some control of what instructions to execute is handed to the compiler.

History

In the 1960s, the term autocode was commonly used to describe a high-level programming language that relied on a compiler. Notable examples of such autocodes include COBOL and Fortran.

COBOL (Common Business-Oriented Language)

Developer: Committee, with Grace Hopper's influence (1959)
Explanation: Like FORTRAN, COBOL is a monumental successor to the autocode idea, but for business data processing. Its defining feature was its syntax, which was designed to read almost like English.
- Example: ADD PRICE TO TOTAL.
- This made it possible for business professionals (like managers and accountants) to read and understand the program's logic, even if they weren't expert programmers. COBOL's English-like structure is a direct evolution of the "autocode" goal of being more human-readable.

FORTRAN (Formula Translation)

Developer: John Backus and team at IBM (1957)
Explanation: While not always labeled an "autocode" today, FORTRAN is the most famous and successful direct descendant of the autocode concept. It was designed specifically for scientists and engineers. Its key innovation was allowing programmers to write instructions using familiar mathematical formulas.
- Example: Instead of writing complex machine instructions, a programmer could simply write something like X = (A + B) / C. The FORTRAN compiler would then translate this into efficient machine code. Its success proved that high-level languages could be both practical and efficient.

The first high-level language to achieve widespread adoption was Fortran, a machine-independent evolution of IBM’s earlier Autocode systems. Around the same time, the ALGOL family emerged—ALGOL 58 in 1958 and ALGOL 60 in 1960—created by joint committees of European and American computer scientists. ALGOL introduced key innovations such as recursion, nested functions under lexical scope, and a clear distinction between value and name parameters with their respective semantics. It also pioneered several structured programming concepts, including the while-do loop and if-then-else statements, and became the first language whose syntax was formally defined using Backus–Naur Form (BNF).

Meanwhile, COBOL brought the concept of records (also known as structs) into mainstream programming, and Lisp became the first language to implement a fully general lambda abstraction.

Abstraction penalty

A high-level language provides features that standardize common tasks, permit rich debugging, and maintain architectural agnosticism. On the other hand, a low-level language requires the coder to work at a lower-level of abstraction which is generally more challenging, but does allow for optimizations that are not possible with a high-level language. This abstraction penalty for using a high-level language instead of a low-level language is real, but in practice, low-level optimizations rarely improve performance at the user experience level.^[2]^[3]^[4] None the less, code that needs to run quickly and efficiently may require the use of a lower-level language, even if a higher-level language would make the coding easier to write and maintain. In many cases, critical portions of a program mostly in a high-level language are coded in assembly in order to meet tight timing or memory constraints. A well-designed compiler for a high-level language can produce code comparable in efficiency to what could be coded by hand in assembly, and the higher-level abstractions sometimes allow for optimizations that beat the performance of hand-coded assembly.^[5] Since a high-level language is designed independent of a specific computing system architecture, a program written in such a language can run on any computing context with a compatible compiler or interpreter.

Unlike a low-level language that is inherently tied to processor hardware, a high-level language can be improved, and new high-level languages can evolve from others with the goal of aggregating the most popular constructs with improved features. For example, Scala maintains backward compatibility with Java. Code written in Java continue to be usable even if a developer switches to Scala. This makes the transition easier and extends the lifespan of a codebase. In contrast, low-level programs rarely survive beyond the system architecture which they were written for.

Relative meaning

The terms high-level and low-level are inherently relative, and languages can be compared as higher or lower level to each other. Sometimes the C language is considered as either high-level or low-level depending on one's perspective. Regardless, most agree that C is higher level than assembly and lower level than most other languages.

C supports constructs such as expression evaluation, parameterized and recursive functions, data types and structures which are generally not supported in assembly or directly by a processor but C does provide lower-level features such as auto-increment and pointer math. But C lacks many higher-level abstracts common in other languages such as garbage collection and a built-in string type. In the introduction of The C Programming Language (second edition) by Brian Kernighan and Dennis Ritchie, C is described as "not a very high level" language.^[6]

Assembly language is higher-level than machine code, but still highly tied to the processor hardware. However, assembly may provide some higher-level features such as macros, relatively limited expressions, constants, variables, procedures, and data structures.

Machine code is at a slightly higher level abstraction than the microcode or micro-operations used internally in many processors.^[7]

Execution modes

The source code of a high-level language may be processed in various ways, such as:

Compiled: A compiler transforms source code into other code. In some cases, a compiler generates native machine code that is interpreted by the processor; however, many execution models today involve generating an intermediate representation (i.e. bytecode) that is later interpreted in software or converted to native code at runtime (via JIT compilation).

Transpiled: Code may be translated into source code of another language (typically lower-level) for which a compiler or interpreter is available. JavaScript and the C are common targets for such translators. For example, C and C++ code can be seen as generated from Eiffel code when using the EiffelStudio IDE. In Eiffel, the translated process is referred to as transcompiling or transcompiled, and the Eiffel compiler as a transcompiler or source-to-source compiler.

Software interpreted: A software interpreter performs the actions encoded in source code without generating native machine code.

Hardware interpreted: Although uncommon, a processor with a high-level language computer architecture can process a high-level language without a compilation step. For example, the Burroughs large systems were target machines for ALGOL 60.^[8]

Note that a language is not strictly interpreted or compiled. Rather, an execution model involves a compiler or an interpreter and the same language might be used with different execution models. For example, ALGOL 60 and Fortran have both been interpreted even though they were more typically compiled. Similarly, Java shows the difficulty of trying to apply these labels to languages, rather than to implementations. Java is compiled to bytecode which is then executed by either interpreting in a Java virtual machine (JVM) or JIT compiled.

References

^ "HThreads - RD Glossary". Archived from the original on 26 August 2007.
^ Surana P (2006). "Meta-Compilation of Language Abstractions" (PDF). Archived (PDF) from the original on 17 February 2015. Retrieved 17 March 2008. {{cite journal}}: Cite journal requires |journal= (help)
^ Kuketayev, Argyn. "The Data Abstraction Penalty (DAP) Benchmark for Small Objects in Java". Application Development Trends. Archived from the original on 11 January 2009. Retrieved 17 March 2008.
^ Chatzigeorgiou; Stephanides (2002). "Evaluating Performance and Power Of Object-Oriented Vs. Procedural Programming Languages". In Blieberger; Strohmeier (eds.). Proceedings - 7th International Conference on Reliable Software Technologies - Ada-Europe'2002. Springer. p. 367.
^ Manuel Carro; José F. Morales; Henk L. Muller; G. Puebla; M. Hermenegildo (2006). "High-level languages for small devices: a case study" (PDF). Proceedings of the 2006 International Conference on Compilers, Architecture and Synthesis for Embedded Systems. ACM.
^ Kernighan, Brian W.; Ritchie, Dennis M. (1988). The C Programming Language: 2nd Edition. Prentice Hall. ISBN 9780131103627. Archived from the original on 25 October 2022. Retrieved 25 October 2022.{{cite book}}: CS1 maint: bot: original URL status unknown (link)
^ Hyde, Randall. (2010). The art of assembly language (2nd ed.). San Francisco: No Starch Press. ISBN 9781593273019. OCLC 635507601.
^ Chu, Yaohan (1975), "Concepts of High-Level Language Computer Architecture", High-Level Language Computer Architecture, Elsevier, pp. 1–14, doi:10.1016/b978-0-12-174150-1.50007-0, ISBN 9780121741501

External links

http://c2.com/cgi/wiki?HighLevelLanguage - The WikiWikiWeb's article on high-level programming languages

[1] "HThreads - RD Glossary". Archived from the original on 26 August 2007.

[2] Surana P (2006). "Meta-Compilation of Language Abstractions" (PDF). Archived (PDF) from the original on 17 February 2015. Retrieved 17 March 2008. {{cite journal}}: Cite journal requires |journal= (help)

[3] Kuketayev, Argyn. "The Data Abstraction Penalty (DAP) Benchmark for Small Objects in Java". Application Development Trends. Archived from the original on 11 January 2009. Retrieved 17 March 2008.

[4] Chatzigeorgiou; Stephanides (2002). "Evaluating Performance and Power Of Object-Oriented Vs. Procedural Programming Languages". In Blieberger; Strohmeier (eds.). Proceedings - 7th International Conference on Reliable Software Technologies - Ada-Europe'2002. Springer. p. 367.

[5] Manuel Carro; José F. Morales; Henk L. Muller; G. Puebla; M. Hermenegildo (2006). "High-level languages for small devices: a case study" (PDF). Proceedings of the 2006 International Conference on Compilers, Architecture and Synthesis for Embedded Systems. ACM.

[6] Kernighan, Brian W.; Ritchie, Dennis M. (1988). The C Programming Language: 2nd Edition. Prentice Hall. ISBN 9780131103627. Archived from the original on 25 October 2022. Retrieved 25 October 2022.{{cite book}}: CS1 maint: bot: original URL status unknown (link)

[7] Hyde, Randall. (2010). The art of assembly language (2nd ed.). San Francisco: No Starch Press. ISBN 9781593273019. OCLC 635507601.

[8] Chu, Yaohan (1975), "Concepts of High-Level Language Computer Architecture", High-Level Language Computer Architecture, Elsevier, pp. 1–14, doi:10.1016/b978-0-12-174150-1.50007-0, ISBN 9780121741501

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

v t e Types of programming languages
Level	Machine Assembly Compiled Interpreted Low-level High-level Very high-level Esoteric
Generation	First Second Third Fourth Fifth

Authority control databases
International	GND
National	United States Israel
Other	Yale LUX

Knowledge Base

Talk Channels

Special Pages

High-level programming language

High-level programming language

High-level programming language

History

COBOL (Common Business-Oriented Language)

FORTRAN (Formula Translation)

Abstraction penalty

Relative meaning

Execution modes

See also

References

External links

High-level programming language

Definition and Fundamentals

Core Definition

Key Characteristics

Historical Development

Early Innovations

Modern Evolution

Abstraction and Design Principles

Levels of Abstraction

Relative Nature of High-Level

Execution and Implementation

Compilation Process

Interpretation and Hybrid Modes

Performance and Trade-offs

Abstraction Penalty

Optimization Strategies

Applications and Examples

Common Use Cases

Notable Languages

References