treelang
) compiler. At the moment this manual is not
incorporated into the main GCC manual as it is incomplete. It
corresponds to the GCC-4.2.1 version of treelang
.
Published by the Free Software Foundation 51 Franklin Street, Fifth Floor Boston, MA 02110-1301 USA
Copyright © 1995,1996,1997,1998,1999,2000,2001,2002,2003,2004,2005 Free Software Foundation, Inc.
Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with the Invariant Sections being “GNU General Public License”, the Front-Cover texts being (a) (see below), and with the Back-Cover Texts being (b) (see below). A copy of the license is included in the section entitled “GNU Free Documentation License”.
(a) The FSF's Front-Cover Text is:
A GNU Manual
(b) The FSF's Back-Cover Text is:
You have freedom to copy and modify this GNU Manual, like GNU software. Copies published by the Free Software Foundation raise funds for GNU development.
This manual documents how to run, install and maintain treelang
.
It also documents the features and incompatibilities in the GCC-4.2.1
version of treelang
.
--- The Detailed Node Listing ---
Other Languages
treelang internals
treelang compiler interfaces
treelang main compiler
Reporting Bugs
Copyright © 1989, 1991 Free Software Foundation, Inc. 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed.
The licenses for most software are designed to take away your freedom to share and change it. By contrast, the GNU General Public License is intended to guarantee your freedom to share and change free software—to make sure the software is free for all its users. This General Public License applies to most of the Free Software Foundation's software and to any other program whose authors commit to using it. (Some other Free Software Foundation software is covered by the GNU Library General Public License instead.) You can apply it to your programs, too.
When we speak of free software, we are referring to freedom, not price. Our General Public Licenses are designed to make sure that you have the freedom to distribute copies of free software (and charge for this service if you wish), that you receive source code or can get it if you want it, that you can change the software or use pieces of it in new free programs; and that you know you can do these things.
To protect your rights, we need to make restrictions that forbid anyone to deny you these rights or to ask you to surrender the rights. These restrictions translate to certain responsibilities for you if you distribute copies of the software, or if you modify it.
For example, if you distribute copies of such a program, whether gratis or for a fee, you must give the recipients all the rights that you have. You must make sure that they, too, receive or can get the source code. And you must show them these terms so they know their rights.
We protect your rights with two steps: (1) copyright the software, and (2) offer you this license which gives you legal permission to copy, distribute and/or modify the software.
Also, for each author's protection and ours, we want to make certain that everyone understands that there is no warranty for this free software. If the software is modified by someone else and passed on, we want its recipients to know that what they have is not the original, so that any problems introduced by others will not reflect on the original authors' reputations.
Finally, any free program is threatened constantly by software patents. We wish to avoid the danger that redistributors of a free program will individually obtain patent licenses, in effect making the program proprietary. To prevent this, we have made it clear that any patent must be licensed for everyone's free use or not licensed at all.
The precise terms and conditions for copying, distribution and modification follow.
Activities other than copying, distribution and modification are not covered by this License; they are outside its scope. The act of running the Program is not restricted, and the output from the Program is covered only if its contents constitute a work based on the Program (independent of having been made by running the Program). Whether that is true depends on what the Program does.
You may charge a fee for the physical act of transferring a copy, and you may at your option offer warranty protection in exchange for a fee.
These requirements apply to the modified work as a whole. If identifiable sections of that work are not derived from the Program, and can be reasonably considered independent and separate works in themselves, then this License, and its terms, do not apply to those sections when you distribute them as separate works. But when you distribute the same sections as part of a whole which is a work based on the Program, the distribution of the whole must be on the terms of this License, whose permissions for other licensees extend to the entire whole, and thus to each and every part regardless of who wrote it.
Thus, it is not the intent of this section to claim rights or contest your rights to work written entirely by you; rather, the intent is to exercise the right to control the distribution of derivative or collective works based on the Program.
In addition, mere aggregation of another work not based on the Program with the Program (or with a work based on the Program) on a volume of a storage or distribution medium does not bring the other work under the scope of this License.
The source code for a work means the preferred form of the work for making modifications to it. For an executable work, complete source code means all the source code for all modules it contains, plus any associated interface definition files, plus the scripts used to control compilation and installation of the executable. However, as a special exception, the source code distributed need not include anything that is normally distributed (in either source or binary form) with the major components (compiler, kernel, and so on) of the operating system on which the executable runs, unless that component itself accompanies the executable.
If distribution of executable or object code is made by offering access to copy from a designated place, then offering equivalent access to copy the source code from the same place counts as distribution of the source code, even though third parties are not compelled to copy the source along with the object code.
If any portion of this section is held invalid or unenforceable under any particular circumstance, the balance of the section is intended to apply and the section as a whole is intended to apply in other circumstances.
It is not the purpose of this section to induce you to infringe any patents or other property right claims or to contest validity of any such claims; this section has the sole purpose of protecting the integrity of the free software distribution system, which is implemented by public license practices. Many people have made generous contributions to the wide range of software distributed through that system in reliance on consistent application of that system; it is up to the author/donor to decide if he or she is willing to distribute software through any other system and a licensee cannot impose that choice.
This section is intended to make thoroughly clear what is believed to be a consequence of the rest of this License.
Each version is given a distinguishing version number. If the Program specifies a version number of this License which applies to it and “any later version”, you have the option of following the terms and conditions either of that version or of any later version published by the Free Software Foundation. If the Program does not specify a version number of this License, you may choose any version ever published by the Free Software Foundation.
If you develop a new program, and you want it to be of the greatest possible use to the public, the best way to achieve this is to make it free software which everyone can redistribute and change under these terms.
To do so, attach the following notices to the program. It is safest to attach them to the start of each source file to most effectively convey the exclusion of warranty; and each file should have at least the “copyright” line and a pointer to where the full notice is found.
one line to give the program's name and a brief idea of what it does. Copyright (C) year name of author This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA
Also add information on how to contact you by electronic and paper mail.
If the program is interactive, make it output a short notice like this when it starts in an interactive mode:
Gnomovision version 69, Copyright (C) year name of author Gnomovision comes with ABSOLUTELY NO WARRANTY; for details type `show w'. This is free software, and you are welcome to redistribute it under certain conditions; type `show c' for details.
The hypothetical commands `show w' and `show c' should show the appropriate parts of the General Public License. Of course, the commands you use may be called something other than `show w' and `show c'; they could even be mouse-clicks or menu items—whatever suits your program.
You should also get your employer (if you work as a programmer) or your school, if any, to sign a “copyright disclaimer” for the program, if necessary. Here is a sample; alter the names:
Yoyodyne, Inc., hereby disclaims all copyright interest in the program `Gnomovision' (which makes passes at compilers) written by James Hacker. signature of Ty Coon, 1 April 1989 Ty Coon, President of Vice
This General Public License does not permit incorporating your program into proprietary programs. If your program is a subroutine library, you may consider it more useful to permit linking proprietary applications with the library. If this is what you want to do, use the GNU Library General Public License instead of this License.
Copyright © 2000,2001,2002 Free Software Foundation, Inc. 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA Everyone is permitted to copy and distribute verbatim copies of this license document, but changing it is not allowed.
The purpose of this License is to make a manual, textbook, or other functional and useful document free in the sense of freedom: to assure everyone the effective freedom to copy and redistribute it, with or without modifying it, either commercially or noncommercially. Secondarily, this License preserves for the author and publisher a way to get credit for their work, while not being considered responsible for modifications made by others.
This License is a kind of “copyleft”, which means that derivative works of the document must themselves be free in the same sense. It complements the GNU General Public License, which is a copyleft license designed for free software.
We have designed this License in order to use it for manuals for free software, because free software needs free documentation: a free program should come with manuals providing the same freedoms that the software does. But this License is not limited to software manuals; it can be used for any textual work, regardless of subject matter or whether it is published as a printed book. We recommend this License principally for works whose purpose is instruction or reference.
This License applies to any manual or other work, in any medium, that contains a notice placed by the copyright holder saying it can be distributed under the terms of this License. Such a notice grants a world-wide, royalty-free license, unlimited in duration, to use that work under the conditions stated herein. The “Document”, below, refers to any such manual or work. Any member of the public is a licensee, and is addressed as “you”. You accept the license if you copy, modify or distribute the work in a way requiring permission under copyright law.
A “Modified Version” of the Document means any work containing the Document or a portion of it, either copied verbatim, or with modifications and/or translated into another language.
A “Secondary Section” is a named appendix or a front-matter section of the Document that deals exclusively with the relationship of the publishers or authors of the Document to the Document's overall subject (or to related matters) and contains nothing that could fall directly within that overall subject. (Thus, if the Document is in part a textbook of mathematics, a Secondary Section may not explain any mathematics.) The relationship could be a matter of historical connection with the subject or with related matters, or of legal, commercial, philosophical, ethical or political position regarding them.
The “Invariant Sections” are certain Secondary Sections whose titles are designated, as being those of Invariant Sections, in the notice that says that the Document is released under this License. If a section does not fit the above definition of Secondary then it is not allowed to be designated as Invariant. The Document may contain zero Invariant Sections. If the Document does not identify any Invariant Sections then there are none.
The “Cover Texts” are certain short passages of text that are listed, as Front-Cover Texts or Back-Cover Texts, in the notice that says that the Document is released under this License. A Front-Cover Text may be at most 5 words, and a Back-Cover Text may be at most 25 words.
A “Transparent” copy of the Document means a machine-readable copy, represented in a format whose specification is available to the general public, that is suitable for revising the document straightforwardly with generic text editors or (for images composed of pixels) generic paint programs or (for drawings) some widely available drawing editor, and that is suitable for input to text formatters or for automatic translation to a variety of formats suitable for input to text formatters. A copy made in an otherwise Transparent file format whose markup, or absence of markup, has been arranged to thwart or discourage subsequent modification by readers is not Transparent. An image format is not Transparent if used for any substantial amount of text. A copy that is not “Transparent” is called “Opaque”.
Examples of suitable formats for Transparent copies include plain ascii without markup, Texinfo input format, LaTeX input format, SGML or XML using a publicly available DTD, and standard-conforming simple HTML, PostScript or PDF designed for human modification. Examples of transparent image formats include PNG, XCF and JPG. Opaque formats include proprietary formats that can be read and edited only by proprietary word processors, SGML or XML for which the DTD and/or processing tools are not generally available, and the machine-generated HTML, PostScript or PDF produced by some word processors for output purposes only.
The “Title Page” means, for a printed book, the title page itself, plus such following pages as are needed to hold, legibly, the material this License requires to appear in the title page. For works in formats which do not have any title page as such, “Title Page” means the text near the most prominent appearance of the work's title, preceding the beginning of the body of the text.
A section “Entitled XYZ” means a named subunit of the Document whose title either is precisely XYZ or contains XYZ in parentheses following text that translates XYZ in another language. (Here XYZ stands for a specific section name mentioned below, such as “Acknowledgements”, “Dedications”, “Endorsements”, or “History”.) To “Preserve the Title” of such a section when you modify the Document means that it remains a section “Entitled XYZ” according to this definition.
The Document may include Warranty Disclaimers next to the notice which states that this License applies to the Document. These Warranty Disclaimers are considered to be included by reference in this License, but only as regards disclaiming warranties: any other implication that these Warranty Disclaimers may have is void and has no effect on the meaning of this License.
You may copy and distribute the Document in any medium, either commercially or noncommercially, provided that this License, the copyright notices, and the license notice saying this License applies to the Document are reproduced in all copies, and that you add no other conditions whatsoever to those of this License. You may not use technical measures to obstruct or control the reading or further copying of the copies you make or distribute. However, you may accept compensation in exchange for copies. If you distribute a large enough number of copies you must also follow the conditions in section 3.
You may also lend copies, under the same conditions stated above, and you may publicly display copies.
If you publish printed copies (or copies in media that commonly have printed covers) of the Document, numbering more than 100, and the Document's license notice requires Cover Texts, you must enclose the copies in covers that carry, clearly and legibly, all these Cover Texts: Front-Cover Texts on the front cover, and Back-Cover Texts on the back cover. Both covers must also clearly and legibly identify you as the publisher of these copies. The front cover must present the full title with all words of the title equally prominent and visible. You may add other material on the covers in addition. Copying with changes limited to the covers, as long as they preserve the title of the Document and satisfy these conditions, can be treated as verbatim copying in other respects.
If the required texts for either cover are too voluminous to fit legibly, you should put the first ones listed (as many as fit reasonably) on the actual cover, and continue the rest onto adjacent pages.
If you publish or distribute Opaque copies of the Document numbering more than 100, you must either include a machine-readable Transparent copy along with each Opaque copy, or state in or with each Opaque copy a computer-network location from which the general network-using public has access to download using public-standard network protocols a complete Transparent copy of the Document, free of added material. If you use the latter option, you must take reasonably prudent steps, when you begin distribution of Opaque copies in quantity, to ensure that this Transparent copy will remain thus accessible at the stated location until at least one year after the last time you distribute an Opaque copy (directly or through your agents or retailers) of that edition to the public.
It is requested, but not required, that you contact the authors of the Document well before redistributing any large number of copies, to give them a chance to provide you with an updated version of the Document.
You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version:
If the Modified Version includes new front-matter sections or appendices that qualify as Secondary Sections and contain no material copied from the Document, you may at your option designate some or all of these sections as invariant. To do this, add their titles to the list of Invariant Sections in the Modified Version's license notice. These titles must be distinct from any other section titles.
You may add a section Entitled “Endorsements”, provided it contains nothing but endorsements of your Modified Version by various parties—for example, statements of peer review or that the text has been approved by an organization as the authoritative definition of a standard.
You may add a passage of up to five words as a Front-Cover Text, and a passage of up to 25 words as a Back-Cover Text, to the end of the list of Cover Texts in the Modified Version. Only one passage of Front-Cover Text and one of Back-Cover Text may be added by (or through arrangements made by) any one entity. If the Document already includes a cover text for the same cover, previously added by you or by arrangement made by the same entity you are acting on behalf of, you may not add another; but you may replace the old one, on explicit permission from the previous publisher that added the old one.
The author(s) and publisher(s) of the Document do not by this License give permission to use their names for publicity for or to assert or imply endorsement of any Modified Version.
You may combine the Document with other documents released under this License, under the terms defined in section 4 above for modified versions, provided that you include in the combination all of the Invariant Sections of all of the original documents, unmodified, and list them all as Invariant Sections of your combined work in its license notice, and that you preserve all their Warranty Disclaimers.
The combined work need only contain one copy of this License, and multiple identical Invariant Sections may be replaced with a single copy. If there are multiple Invariant Sections with the same name but different contents, make the title of each such section unique by adding at the end of it, in parentheses, the name of the original author or publisher of that section if known, or else a unique number. Make the same adjustment to the section titles in the list of Invariant Sections in the license notice of the combined work.
In the combination, you must combine any sections Entitled “History” in the various original documents, forming one section Entitled “History”; likewise combine any sections Entitled “Acknowledgements”, and any sections Entitled “Dedications”. You must delete all sections Entitled “Endorsements.”
You may make a collection consisting of the Document and other documents released under this License, and replace the individual copies of this License in the various documents with a single copy that is included in the collection, provided that you follow the rules of this License for verbatim copying of each of the documents in all other respects.
You may extract a single document from such a collection, and distribute it individually under this License, provided you insert a copy of this License into the extracted document, and follow this License in all other respects regarding verbatim copying of that document.
A compilation of the Document or its derivatives with other separate and independent documents or works, in or on a volume of a storage or distribution medium, is called an “aggregate” if the copyright resulting from the compilation is not used to limit the legal rights of the compilation's users beyond what the individual works permit. When the Document is included in an aggregate, this License does not apply to the other works in the aggregate which are not themselves derivative works of the Document.
If the Cover Text requirement of section 3 is applicable to these copies of the Document, then if the Document is less than one half of the entire aggregate, the Document's Cover Texts may be placed on covers that bracket the Document within the aggregate, or the electronic equivalent of covers if the Document is in electronic form. Otherwise they must appear on printed covers that bracket the whole aggregate.
Translation is considered a kind of modification, so you may distribute translations of the Document under the terms of section 4. Replacing Invariant Sections with translations requires special permission from their copyright holders, but you may include translations of some or all Invariant Sections in addition to the original versions of these Invariant Sections. You may include a translation of this License, and all the license notices in the Document, and any Warranty Disclaimers, provided that you also include the original English version of this License and the original versions of those notices and disclaimers. In case of a disagreement between the translation and the original version of this License or a notice or disclaimer, the original version will prevail.
If a section in the Document is Entitled “Acknowledgements”, “Dedications”, or “History”, the requirement (section 4) to Preserve its Title (section 1) will typically require changing the actual title.
You may not copy, modify, sublicense, or distribute the Document except as expressly provided for under this License. Any other attempt to copy, modify, sublicense or distribute the Document is void, and will automatically terminate your rights under this License. However, parties who have received copies, or rights, from you under this License will not have their licenses terminated so long as such parties remain in full compliance.
The Free Software Foundation may publish new, revised versions of the GNU Free Documentation License from time to time. Such new versions will be similar in spirit to the present version, but may differ in detail to address new problems or concerns. See http://www.gnu.org/copyleft/.
Each version of the License is given a distinguishing version number. If the Document specifies that a particular numbered version of this License “or any later version” applies to it, you have the option of following the terms and conditions either of that specified version or of any later version that has been published (not as a draft) by the Free Software Foundation. If the Document does not specify a version number of this License, you may choose any version ever published (not as a draft) by the Free Software Foundation.
To use this License in a document you have written, include a copy of the License in the document and put the following copyright and license notices just after the title page:
Copyright (C) year your name. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled ``GNU Free Documentation License''.
If you have Invariant Sections, Front-Cover Texts and Back-Cover Texts, replace the “with...Texts.” line with this:
with the Invariant Sections being list their titles, with the Front-Cover Texts being list, and with the Back-Cover Texts being list.
If you have Invariant Sections without Cover Texts, or some other combination of the three, merge those two alternatives to suit the situation.
If your document contains nontrivial examples of program code, we recommend releasing these examples in parallel under your choice of free software license, such as the GNU General Public License, to permit their use in free software.
Treelang was based on 'toy' by Richard Kenner, and also uses code from the GCC core code tree. Tim Josling first created the language and documentation, based on the GCC Fortran compiler's documentation framework. Treelang was updated to use the TreeSSA infrastructure by James A. Morrison.
If you want to have more free software a few years from now, it makes sense for you to help encourage people to contribute funds for its development. The most effective approach known is to encourage commercial redistributors to donate.
Users of free software systems can boost the pace of development by encouraging for-a-fee distributors to donate part of their selling price to free software developers—the Free Software Foundation, and others.
The way to convince distributors to do this is to demand it and expect it from them. So when you compare distributors, judge them partly by how much they give to free software development. Show distributors they must compete to be the one who gives the most.
To make this approach work, you must insist on numbers that you can compare, such as, “We will donate ten dollars to the Frobnitz project for each disk sold.” Don't be satisfied with a vague promise, such as “A portion of the profits are donated,” since it doesn't give a basis for comparison.
Even a precise fraction “of the profits from this disk” is not very meaningful, since creative accounting and unrelated business decisions can greatly alter what fraction of the sales price counts as profit. If the price you pay is $50, ten percent of the profit is probably less than a dollar; it might be a few cents, or nothing at all.
Some redistributors do development work themselves. This is useful too; but to keep everyone honest, you need to inquire how much they do, and what kind. Some kinds of development make much more long-term difference than others. For example, maintaining a separate version of a program contributes very little; maintaining the standard version of a program for the whole community contributes much. Easy new ports contribute little, since someone else would surely do them; difficult ports such as adding a new CPU to the GNU Compiler Collection contribute more; major new features or packages contribute the most.
By establishing the idea that supporting further development is “the proper thing to do” when distributing free software for a fee, we can assure a steady flow of resources into making more free software.
Copyright © 1994 Free Software Foundation, Inc. Verbatim copying and redistribution of this section is permitted without royalty; alteration is not permitted.
Treelang is a sample language, useful only to help people understand how to implement a new language front end to GCC. It is not a useful language in itself other than as an example or basis for building a new language. Therefore only language developers are likely to have an interest in it.
This manual assumes familiarity with GCC, which you can obtain by using it and by reading the manuals `Using the GNU Compiler Collection (GCC)' and `GNU Compiler Collection (GCC) Internals'.
To install treelang, follow the GCC installation instructions, taking care to ensure you specify treelang in the configure step by adding treelang to the list of languages specified by --enable-languages, e.g. `--enable-languages=all,treelang'.
If you're generally curious about the future of
treelang
, see Projects.
If you're curious about its past,
see Contributors.
To see a few of the questions maintainers of treelang
have,
and that you might be able to answer,
see Open Questions.
GNU Treelang, or treelang
, is designed initially as a free
replacement for, or alternative to, the 'toy' language, but which is
amenable to inclusion within the GCC source tree.
treelang
is largely a cut down version of C, designed to showcase
the features of the GCC code generation back end. Only those features
that are directly supported by the GCC code generation back end are
implemented. Features are implemented in a manner which is easiest and
clearest to implement. Not all or even most code generation back end
features are implemented. The intention is to add features incrementally
until most features of the GCC back end are implemented in treelang.
The main features missing are structures, arrays and pointers.
A sample program follows:
// function prototypes // function 'add' taking two ints and returning an int external_definition int add(int arg1, int arg2); external_definition int subtract(int arg3, int arg4); external_definition int first_nonzero(int arg5, int arg6); external_definition int double_plus_one(int arg7); // function definition add { // return the sum of arg1 and arg2 return arg1 + arg2; } subtract { return arg3 - arg4; } double_plus_one { // aaa is a variable, of type integer and allocated at the start of // the function automatic int aaa; // set aaa to the value returned from add, when passed arg7 and arg7 as // the two parameters aaa=add(arg7, arg7); aaa=add(aaa, aaa); aaa=subtract(subtract(aaa, arg7), arg7) + 1; return aaa; } first_nonzero { // C-like if statement if (arg5) { return arg5; } else { } return arg6; }
Treelang programs consist of whitespace, comments, keywords and names.
A B
x = 1; // Set X to 1
a A _a a_ IF_X
are all different names.
Declarations are built up from the lexical elements described above. A file may contain one of more declarations.
static int add (int a, int b)
Example:
int temp1 = 1;
A variable declaration can be outside a function, or at the start of a function.
This defines the scope, duration and visibility of a function or variable
This defines the data type of a variable or the return type of a function.
The variable declarations must not have initializations.
Values without a unary plus or minus are considered to be unsigned.
e.g. 1 +2 -3
A function consists of the function name then the declarations (if any) and statements (if any) within one pair of braces.
The details of the function arguments come from the function prototype. The function prototype must precede the function declaration in the file.
The first lot of statements is executed if the expression is nonzero. Otherwise the second lot of statements is executed. Either list of statements may be empty, but both sets of braces and the else must be present.
if (a==b)
{
// nothing
}
else
{
a=b;
}
The expression is executed, including any side effects.
Returns from the function. If the function is void, the expression must be absent, and if the function is not void the expression must be present.
An expression can be a constant or a variable reference or a function_call. Expressions can be combined as a sum of two expressions or the difference of two expressions, or an equality test of two expressions. An assignment is also an expression. Expressions and operator precedence work as in C.
This invokes the function, passing to it the values of the expressions as actual parameters.
treelang is run as part of the GCC compiler.
gdb
).
ld
command. However, the gcc
command, as with most
compiler commands, automatically performs the linking step by calling on
ld
directly, unless asked to not do so by the user.)
The compiler stops after the first error. There are no plans to fix this, ever, as it would vastly complicate the implementation of treelang to little or no benefit.
(Sometimes an incorrect usage of the language leads to a situation where the compiler can not make any sense of what it reads—while a human might be able to—and thus ends up complaining about an incorrect “problem” it encounters that, in fact, reflects a misunderstanding of the programmer's intention.)
treelang
consists of several components:
gcc
command, which also might be
installed as the system's cc
command.
(In many cases, cc
refers to the
system's “native” C compiler, which
might be a non-GNU compiler, or an older version
of GCC
considered more stable or that is
used to build the operating system kernel.)
treelang
command itself.
libc
run-time library. This library contains the machine
code needed to support capabilities of the Treelang language that are
not directly provided by the machine code generated by the
treelang
compilation phase. This is the same library that the
main C compiler uses (libc).
tree1
.
Note that tree1
does not generate machine code directly—it
generates assembly code that is a more readable form
of machine code, leaving the conversion to actual machine code
to an assembler, usually named as
.
GCC
is often thought of as “the C compiler” only,
but it does more than that.
Based on command-line options and the names given for files
on the command line, gcc
determines which actions to perform, including
preprocessing, compiling (in a variety of possible languages), assembling,
and linking.
For example, the command `gcc foo.c' drives the file
foo.c through the preprocessor cpp
, then
the C compiler (internally named
cc1
), then the assembler (usually as
), then the linker
(ld
), producing an executable program named a.out (on
UNIX systems).
As another example, the command `gcc foo.tree' would do much the
same as `gcc foo.c', but instead of using the C compiler named
cc1
, gcc
would use the treelang compiler (named
tree1
). However there is no preprocessor for treelang.
In a GNU Treelang installation, gcc
recognizes Treelang source
files by name just like it does C and C++ source files. It knows to use
the Treelang compiler named tree1
, instead of cc1
or
cc1plus
, to compile Treelang files. If a file's name ends in
.tree
then GCC knows that the program is written in treelang. You
can also manually override the language.
Non-Treelang-related operation of gcc
is generally
unaffected by installing the GNU Treelang version of gcc
.
However, without the installed version of gcc
being the
GNU Treelang version, gcc
will not be able to compile
and link Treelang programs.
The command `gcc -v x.tree' where `x.tree' is a file which must exist but whose contents are ignored, is a quick way to display version information for the various programs used to compile a typical Treelang source file.
The tree1
program represents most of what is unique to GNU
Treelang; tree1
is a combination of two rather large chunks of
code.
One chunk is the so-called GNU Back End, or GBE,
which knows how to generate fast code for a wide variety of processors.
The same GBE is used by the C, C++, and Treelang compiler programs cc1
,
cc1plus
, and tree1
, plus others.
Often the GBE is referred to as the “GCC back end” or
even just “GCC”—in this manual, the term GBE is used
whenever the distinction is important.
The other chunk of tree1
is the majority of what is unique about
GNU Treelang—the code that knows how to interpret Treelang programs to
determine what they are intending to do, and then communicate that
knowledge to the GBE for actual compilation of those programs. This
chunk is called the Treelang Front End (TFE). The cc1
and
cc1plus
programs have their own front ends, for the C and C++
languages, respectively. These fronts ends are responsible for
diagnosing incorrect usage of their respective languages by the programs
the process, and are responsible for most of the warnings about
questionable constructs as well. (The GBE in principle handles
producing some warnings, like those concerning possible references to
undefined variables, but these warnings should not occur in treelang
programs as the front end is meant to pick them up first).
Because so much is shared among the compilers for various languages, much of the behavior and many of the user-selectable options for these compilers are similar. For example, diagnostics (error messages and warnings) are similar in appearance; command-line options like `-Wall' have generally similar effects; and the quality of generated code (in terms of speed and size) is roughly similar (since that work is done by the shared GBE).
A GNU Treelang installation includes a modified version of the gcc
command.
In a non-Treelang installation, gcc
recognizes C, C++,
and Objective-C source files.
In a GNU Treelang installation, gcc
also recognizes Treelang source
files and accepts Treelang-specific command-line options, plus some
command-line options that are designed to cater to Treelang users
but apply to other languages as well.
See Programming Languages Supported by GCC,
for information on the way different languages are handled
by the GCC compiler (gcc
).
You can use this, combined with the output of the `gcc -v x.tree' command to get the options applicable to treelang. Treelang programs must end with the suffix `.tree'.
Treelang programs are not by default run through the C
preprocessor by gcc
. There is no reason why they cannot be run through the
preprocessor manually, but you would need to prevent the preprocessor
from generating #line directives, using the `-P' option, otherwise
tree1 will not accept the input.
The GNU Treelang compiler, treelang
, supports programs written
in the GNU Treelang language.
The output of treelang programs looks like C program code to the linker and everybody else, so you should be able to freely mix treelang and C (and C++) code, with one proviso.
C promotes small integer types to 'int' when used as function parameters and return values in non-prototyped functions. Since treelang has no non-prototyped functions, the treelang compiler does not do this.
To create a compiler that integrates into GCC, you need create many files. Some of the files are integrated into the main GCC makefile, to build the various parts of the compiler and to run the test suite. Others are incorporated into various GCC programs such as gcc.c. Finally you must provide the actual programs comprising your compiler.
This need not be present if the language is incorporated into the main GCC tree, as the main GCC directory has this file.
This need not be present if the language is incorporated into the main GCC tree, as the main GCC directory has this file.
Makefile in turn is the main instruction to actually build everything. The build instructions are held in the main GCC manual and web site so they are not repeated here.
There are some comments at the top which will help you understand what you need to do.
There are make commands to build things, remove generated files with various degrees of thoroughness, count the lines of code (so you know how much progress you are making), build info and html files from the texinfo source, run the tests etc.
The GCC compiler consists of a driver, which then executes the various compiler phases based on the instructions in the specs files.
Typically a program's language will be identified from its suffix (e.g., .tree) for treelang programs.
The driver (gcc.c) will then drive (exec) in turn a preprocessor, the main compiler, the assembler and the link editor. Options to GCC allow you to override all of this. In the case of treelang programs there is no preprocessor, and mostly these days the C preprocessor is run within the main C compiler rather than as a separate process, apparently for reasons of speed.
You will be using the standard assembler and linkage editor so these are ignored from now on.
You have to write your own preprocessor if you want one. This is usually totally language specific. The main point to be aware of is to ensure that you find some way to pass file name and line number information through to the main compiler so that it can tell the back end this information and so the debugger can find the right source line for each piece of code. That is all there is to say about the preprocessor except that the preprocessor will probably not be the slowest part of the compiler and will probably not use the most memory so don't waste too much time tuning it until you know you need to do so.
The main compiler for treelang consists of toplev.c from the main GCC compiler, the parser, lexer and back end interface routines, and the back end routines themselves, of which there are many.
toplev.c does a lot of work for you and you should almost certainly use it.
Writing this code is the hard part of creating a compiler using GCC. The back end interface documentation is incomplete and the interface is complex.
There are three main aspects to interfacing to the other GCC code.
In treelang this is handled mainly in tree1.c and partly in treetree.c. Peruse toplev.c for details of what you need to do.
Interfacing to the garbage collection. In treelang this is mainly in tree1.c.
Memory allocation in the compiler should be done using the ggc_alloc and kindred routines in ggc*.*. At the end of every 'function' in your language, toplev.c calls the garbage collection several times. The garbage collection calls mark routines which go through the memory which is still used, telling the garbage collection not to free it. Then all the memory not used is freed.
What this means is that you need a way to hook into this marking process. This is done by calling ggc_add_root. This provides the address of a callback routine which will be called duing garbage collection and which can call ggc_mark to save the storage. If storage is only used within the parsing of a function, you do not need to provide a way to mark it.
Note that you can also call ggc_mark_tree to mark any of the back end internal 'tree' nodes. This routine will follow the branches of the trees and mark all the subordinate structures. This is useful for example when you have created a variable declaration that will be used across multiple functions, or for a function declaration (from a prototype) that may be used later on. See the next item for more on the tree nodes.
In treelang this is done in treetree.c. A typedef called 'tree' which is defined in tree.h and tree.def in the GCC directory and largely implemented in tree.c and stmt.c forms the basic interface to the compiler back end.
In general you call various tree routines to generate code, either directly or through toplev.c. You build up data structures and expressions in similar ways.
You can read some documentation on this which can be found via the GCC main web page. In particular, the documentation produced by Joachim Nadler and translated by Tim Josling can be quite useful. the C compiler also has documentation in the main GCC manual (particularly the current CVS version) which is useful on a lot of the details.
In time it is hoped to enhance this document to provide a more comprehensive overview of this topic. The main gap is in explaining how it all works together.
If you know GCC well, please consider looking at the file treetree.c and resolving any questions marked "???".
You can report bugs to gcc-bugs@gcc.gnu.org or bug-gcc@gnu.org. Please make sure bugs are real before reporting them. Follow the guidelines in the main GCC manual for submitting bug reports.
If you would like to write bug fixes or improvements for the GNU Treelang compiler, that is very helpful. Send suggested fixes to gcc-patches@gcc.gnu.org.
If you need help installing, using or changing GNU Treelang, there are two ways to find it:
If you want to contribute to treelang
by doing research,
design, specification, documentation, coding, or testing,
the following information should give you some ideas.
Send a message to gcc@gcc.gnu.org if you plan to add a feature.
The main requirement for treelang is to add features and to add documentation. Features are things that the GCC back end can do but which are not reflected in treelang. Examples include structures, unions, pointers, arrays.
as
command: Compiler Overviewas
: Compiler Overviewgcc
: TREELANG and GCCgcc
: Compiler Overviewgdb
: Compiler Overviewld
: Compiler Overviewtreelang
: Compiler Overviewtreelang
: Compiler OverviewGCC
, back end: Compiler Overviewgcc
, command: TREELANG and GCCgcc
, command: Compiler Overviewgcc
, command as driver: Compiler Overviewgcc
, not recognizing Treelang source: Compiler Overviewgdb
, command: Compiler Overviewld
command: Compiler Overviewtree1
: Compiler Overviewtree1
, program: Compiler Overviewtreelang
, command: Compiler Overviewtreelang
, components of: Compiler Overviewtreelang
, front end: Compiler Overview