gforth/doc/vmgen.texi - view

File: [gforth] / gforth / doc / vmgen.texi
Revision 1.28: download - view: text, annotated - select for diffs
Wed Jan 26 21:24:15 2005 UTC (19 years, 3 months ago) by anton
Branches: MAIN
CVS tags: HEAD

EXECUTE and friends are now relocatable again even with the PR15242 workaround.

1: \input texinfo @c -*-texinfo-*- 2: @comment %**start of header 3: @setfilename vmgen.info 4: @include version.texi 5: @settitle Vmgen (Gforth @value{VERSION}) 6: @c @syncodeindex pg cp 7: @comment %**end of header 8: @copying 9: This manual is for Vmgen 10: (version @value{VERSION}, @value{UPDATED}), 11: the virtual machine interpreter generator 12: 13: Copyright @copyright{} 2002,2003,2005 Free Software Foundation, Inc. 14: 15: @quotation 16: Permission is granted to copy, distribute and/or modify this document 17: under the terms of the GNU Free Documentation License, Version 1.1 or 18: any later version published by the Free Software Foundation; with no 19: Invariant Sections, with the Front-Cover texts being ``A GNU Manual,'' 20: and with the Back-Cover Texts as in (a) below. A copy of the 21: license is included in the section entitled ``GNU Free Documentation 22: License.'' 23: 24: (a) The FSF's Back-Cover Text is: ``You have freedom to copy and modify 25: this GNU Manual, like GNU software. Copies published by the Free 26: Software Foundation raise funds for GNU development.'' 27: @end quotation 28: @end copying 29: 30: @dircategory Software development 31: @direntry 32: * Vmgen: (vmgen). Virtual machine interpreter generator 33: @end direntry 34: 35: @titlepage 36: @title Vmgen 37: @subtitle for Gforth version @value{VERSION}, @value{UPDATED} 38: @author M. Anton Ertl (@email{anton@@mips.complang.tuwien.ac.at}) 39: @page 40: @vskip 0pt plus 1filll 41: @insertcopying 42: @end titlepage 43: 44: @contents 45: 46: @ifnottex 47: @node Top, Introduction, (dir), (dir) 48: @top Vmgen 49: 50: @insertcopying 51: @end ifnottex 52: 53: @menu 54: * Introduction:: What can Vmgen do for you? 55: * Why interpreters?:: Advantages and disadvantages 56: * Concepts:: VM interpreter background 57: * Invoking Vmgen:: 58: * Example:: 59: * Input File Format:: 60: * Error messages:: reported by Vmgen 61: * Using the generated code:: 62: * Hints:: VM archictecture, efficiency 63: * The future:: 64: * Changes:: from earlier versions 65: * Contact:: Bug reporting etc. 66: * Copying This Manual:: Manual License 67: * Index:: 68: 69: @detailmenu 70: --- The Detailed Node Listing --- 71: 72: Concepts 73: 74: * Front end and VM interpreter:: Modularizing an interpretive system 75: * Data handling:: Stacks, registers, immediate arguments 76: * Dispatch:: From one VM instruction to the next 77: 78: Example 79: 80: * Example overview:: 81: * Using profiling to create superinstructions:: 82: 83: Input File Format 84: 85: * Input File Grammar:: 86: * Simple instructions:: 87: * Superinstructions:: 88: * Store Optimization:: 89: * Register Machines:: How to define register VM instructions 90: 91: Input File Grammar 92: 93: * Eval escapes:: what follows \E 94: 95: Simple instructions 96: 97: * C Code Macros:: Macros recognized by Vmgen 98: * C Code restrictions:: Vmgen makes assumptions about C code 99: * Stack growth direction:: is configurable per stack 100: 101: Using the generated code 102: 103: * VM engine:: Executing VM code 104: * VM instruction table:: 105: * VM code generation:: Creating VM code (in the front-end) 106: * Peephole optimization:: Creating VM superinstructions 107: * VM disassembler:: for debugging the front end 108: * VM profiler:: for finding worthwhile superinstructions 109: 110: Hints 111: 112: * Floating point:: and stacks 113: 114: Copying This Manual 115: 116: * GNU Free Documentation License:: License for copying this manual. 117: 118: @end detailmenu 119: @end menu 120: 121: @c @ifnottex 122: @c This file documents Vmgen (Gforth @value{VERSION}). 123: 124: @c ************************************************************ 125: @node Introduction, Why interpreters?, Top, Top 126: @chapter Introduction 127: 128: Vmgen is a tool for writing efficient interpreters. It takes a simple 129: virtual machine description and generates efficient C code for dealing 130: with the virtual machine code in various ways (in particular, executing 131: it). The run-time efficiency of the resulting interpreters is usually 132: within a factor of 10 of machine code produced by an optimizing 133: compiler. 134: 135: The interpreter design strategy supported by Vmgen is to divide the 136: interpreter into two parts: 137: 138: @itemize @bullet 139: 140: @item The @emph{front end} takes the source code of the language to be 141: implemented, and translates it into virtual machine code. This is 142: similar to an ordinary compiler front end; typically an interpreter 143: front-end performs no optimization, so it is relatively simple to 144: implement and runs fast. 145: 146: @item The @emph{virtual machine interpreter} executes the virtual 147: machine code. 148: 149: @end itemize 150: 151: Such a division is usually used in interpreters, for modularity as well 152: as for efficiency. The virtual machine code is typically passed between 153: front end and virtual machine interpreter in memory, like in a 154: load-and-go compiler; this avoids the complexity and time cost of 155: writing the code to a file and reading it again. 156: 157: A @emph{virtual machine} (VM) represents the program as a sequence of 158: @emph{VM instructions}, following each other in memory, similar to real 159: machine code. Control flow occurs through VM branch instructions, like 160: in a real machine. 161: 162: @cindex functionality features overview 163: In this setup, Vmgen can generate most of the code dealing with virtual 164: machine instructions from a simple description of the virtual machine 165: instructions (@pxref{Input File Format}), in particular: 166: 167: @table @strong 168: 169: @item VM instruction execution 170: 171: @item VM code generation 172: Useful in the front end. 173: 174: @item VM code decompiler 175: Useful for debugging the front end. 176: 177: @item VM code tracing 178: Useful for debugging the front end and the VM interpreter. You will 179: typically provide other means for debugging the user's programs at the 180: source level. 181: 182: @item VM code profiling 183: Useful for optimizing the VM interpreter with superinstructions 184: (@pxref{VM profiler}). 185: 186: @end table 187: 188: To create parts of the interpretive system that do not deal with VM 189: instructions, you have to use other tools (e.g., @command{bison}) and/or 190: hand-code them. 191: 192: @cindex efficiency features overview 193: @noindent 194: Vmgen supports efficient interpreters though various optimizations, in 195: particular 196: 197: @itemize @bullet 198: 199: @item Threaded code 200: 201: @item Caching the top-of-stack in a register 202: 203: @item Combining VM instructions into superinstructions 204: 205: @item 206: Replicating VM (super)instructions for better BTB prediction accuracy 207: (not yet in vmgen-ex, but already in Gforth). 208: 209: @end itemize 210: 211: @cindex speed for JVM 212: As a result, Vmgen-based interpreters are only about an order of 213: magnitude slower than native code from an optimizing C compiler on small 214: benchmarks; on large benchmarks, which spend more time in the run-time 215: system, the slowdown is often less (e.g., the slowdown of a 216: Vmgen-generated JVM interpreter over the best JVM JIT compiler we 217: measured is only a factor of 2-3 for large benchmarks; some other JITs 218: and all other interpreters we looked at were slower than our 219: interpreter). 220: 221: VMs are usually designed as stack machines (passing data between VM 222: instructions on a stack), and Vmgen supports such designs especially 223: well; however, you can also use Vmgen for implementing a register VM 224: (@pxref{Register Machines}) and still benefit from most of the advantages 225: offered by Vmgen. 226: 227: There are many potential uses of the instruction descriptions that are 228: not implemented at the moment, but we are open for feature requests, and 229: we will consider new features if someone asks for them; so the feature 230: list above is not exhaustive. 231: 232: @c ********************************************************************* 233: @node Why interpreters?, Concepts, Introduction, Top 234: @chapter Why interpreters? 235: @cindex interpreters, advantages 236: @cindex advantages of interpreters 237: @cindex advantages of vmgen 238: 239: Interpreters are a popular language implementation technique because 240: they combine all three of the following advantages: 241: 242: @itemize @bullet 243: 244: @item Ease of implementation 245: 246: @item Portability 247: 248: @item Fast edit-compile-run cycle 249: 250: @end itemize 251: 252: Vmgen makes it even easier to implement interpreters. 253: 254: @cindex speed of interpreters 255: The main disadvantage of interpreters is their run-time speed. However, 256: there are huge differences between different interpreters in this area: 257: the slowdown over optimized C code on programs consisting of simple 258: operations is typically a factor of 10 for the more efficient 259: interpreters, and a factor of 1000 for the less efficient ones (the 260: slowdown for programs executing complex operations is less, because the 261: time spent in libraries for executing complex operations is the same in 262: all implementation strategies). 263: 264: Vmgen supports techniques for building efficient interpreters. 265: 266: @c ******************************************************************** 267: @node Concepts, Invoking Vmgen, Why interpreters?, Top 268: @chapter Concepts 269: 270: @menu 271: * Front end and VM interpreter:: Modularizing an interpretive system 272: * Data handling:: Stacks, registers, immediate arguments 273: * Dispatch:: From one VM instruction to the next 274: @end menu 275: 276: @c -------------------------------------------------------------------- 277: @node Front end and VM interpreter, Data handling, Concepts, Concepts 278: @section Front end and VM interpreter 279: @cindex modularization of interpreters 280: 281: @cindex front-end 282: Interpretive systems are typically divided into a @emph{front end} that 283: parses the input language and produces an intermediate representation 284: for the program, and an interpreter that executes the intermediate 285: representation of the program. 286: 287: @cindex virtual machine 288: @cindex VM 289: @cindex VM instruction 290: @cindex instruction, VM 291: @cindex VM branch instruction 292: @cindex branch instruction, VM 293: @cindex VM register 294: @cindex register, VM 295: @cindex opcode, VM instruction 296: @cindex immediate argument, VM instruction 297: For efficient interpreters the intermediate representation of choice is 298: virtual machine code (rather than, e.g., an abstract syntax tree). 299: @emph{Virtual machine} (VM) code consists of VM instructions arranged 300: sequentially in memory; they are executed in sequence by the VM 301: interpreter, but VM branch instructions can change the control flow and 302: are used for implementing control structures. The conceptual similarity 303: to real machine code results in the name @emph{virtual machine}. 304: Various terms similar to terms for real machines are used; e.g., there 305: are @emph{VM registers} (like the instruction pointer and stack 306: pointer(s)), and the VM instruction consists of an @emph{opcode} and 307: @emph{immediate arguments}. 308: 309: In this framework, Vmgen supports building the VM interpreter and any 310: other component dealing with VM instructions. It does not have any 311: support for the front end, apart from VM code generation support. The 312: front end can be implemented with classical compiler front-end 313: techniques, supported by tools like @command{flex} and @command{bison}. 314: 315: The intermediate representation is usually just internal to the 316: interpreter, but some systems also support saving it to a file, either 317: as an image file, or in a full-blown linkable file format (e.g., JVM). 318: Vmgen currently has no special support for such features, but the 319: information in the instruction descriptions can be helpful, and we are 320: open to feature requests and suggestions. 321: 322: @c -------------------------------------------------------------------- 323: @node Data handling, Dispatch, Front end and VM interpreter, Concepts 324: @section Data handling 325: 326: @cindex stack machine 327: @cindex register machine 328: Most VMs use one or more stacks for passing temporary data between VM 329: instructions. Another option is to use a register machine architecture 330: for the virtual machine; we believe that using a stack architecture is 331: usually both simpler and faster. 332: 333: however, this option is slower or 334: significantly more complex to implement than a stack machine architecture. 335: 336: Vmgen has special support and optimizations for stack VMs, making their 337: implementation easy and efficient. 338: 339: You can also implement a register VM with Vmgen (@pxref{Register 340: Machines}), and you will still profit from most Vmgen features. 341: 342: @cindex stack item size 343: @cindex size, stack items 344: Stack items all have the same size, so they typically will be as wide as 345: an integer, pointer, or floating-point value. Vmgen supports treating 346: two consecutive stack items as a single value, but anything larger is 347: best kept in some other memory area (e.g., the heap), with pointers to 348: the data on the stack. 349: 350: @cindex instruction stream 351: @cindex immediate arguments 352: Another source of data is immediate arguments VM instructions (in the VM 353: instruction stream). The VM instruction stream is handled similar to a 354: stack in Vmgen. 355: 356: @cindex garbage collection 357: @cindex reference counting 358: Vmgen has no built-in support for, nor restrictions against 359: @emph{garbage collection}. If you need garbage collection, you need to 360: provide it in your run-time libraries. Using @emph{reference counting} 361: is probably harder, but might be possible (contact us if you are 362: interested). 363: @c reference counting might be possible by including counting code in 364: @c the conversion macros. 365: 366: @c -------------------------------------------------------------------- 367: @node Dispatch, , Data handling, Concepts 368: @section Dispatch 369: @cindex Dispatch of VM instructions 370: @cindex main interpreter loop 371: 372: Understanding this section is probably not necessary for using Vmgen, 373: but it may help. You may want to skip it now, and read it if you find statements about dispatch methods confusing. 374: 375: After executing one VM instruction, the VM interpreter has to dispatch 376: the next VM instruction (Vmgen calls the dispatch routine @samp{NEXT}). 377: Vmgen supports two methods of dispatch: 378: 379: @table @strong 380: 381: @item switch dispatch 382: @cindex switch dispatch 383: In this method the VM interpreter contains a giant @code{switch} 384: statement, with one @code{case} for each VM instruction. The VM 385: instruction opcodes are represented by integers (e.g., produced by an 386: @code{enum}) in the VM code, and dispatch occurs by loading the next 387: opcode, @code{switch}ing on it, and continuing at the appropriate 388: @code{case}; after executing the VM instruction, the VM interpreter 389: jumps back to the dispatch code. 390: 391: @item threaded code 392: @cindex threaded code 393: This method represents a VM instruction opcode by the address of the 394: start of the machine code fragment for executing the VM instruction. 395: Dispatch consists of loading this address, jumping to it, and 396: incrementing the VM instruction pointer. Typically the threaded-code 397: dispatch code is appended directly to the code for executing the VM 398: instruction. Threaded code cannot be implemented in ANSI C, but it can 399: be implemented using GNU C's labels-as-values extension (@pxref{Labels 400: as Values, , Labels as Values, gcc.info, GNU C Manual}). 401: 402: @c call threading 403: @end table 404: 405: Threaded code can be twice as fast as switch dispatch, depending on the 406: interpreter, the benchmark, and the machine. 407: 408: @c ************************************************************* 409: @node Invoking Vmgen, Example, Concepts, Top 410: @chapter Invoking Vmgen 411: @cindex Invoking Vmgen 412: 413: The usual way to invoke Vmgen is as follows: 414: 415: @example 416: vmgen @var{inputfile} 417: @end example 418: 419: Here @var{inputfile} is the VM instruction description file, which 420: usually ends in @file{.vmg}. The output filenames are made by taking 421: the basename of @file{inputfile} (i.e., the output files will be created 422: in the current working directory) and replacing @file{.vmg} with 423: @file{-vm.i}, @file{-disasm.i}, @file{-gen.i}, @file{-labels.i}, 424: @file{-profile.i}, and @file{-peephole.i}. E.g., @command{vmgen 425: hack/foo.vmg} will create @file{foo-vm.i}, @file{foo-disasm.i}, 426: @file{foo-gen.i}, @file{foo-labels.i}, @file{foo-profile.i} and 427: @file{foo-peephole.i}. 428: 429: The command-line options supported by Vmgen are 430: 431: @table @option 432: 433: @cindex -h, command-line option 434: @cindex --help, command-line option 435: @item --help 436: @itemx -h 437: Print a message about the command-line options 438: 439: @cindex -v, command-line option 440: @cindex --version, command-line option 441: @item --version 442: @itemx -v 443: Print version and exit 444: @end table 445: 446: @c env vars GFORTHDIR GFORTHDATADIR 447: 448: @c **************************************************************** 449: @node Example, Input File Format, Invoking Vmgen, Top 450: @chapter Example 451: @cindex example of a Vmgen-based interpreter 452: 453: @menu 454: * Example overview:: 455: * Using profiling to create superinstructions:: 456: @end menu 457: 458: @c -------------------------------------------------------------------- 459: @node Example overview, Using profiling to create superinstructions, Example, Example 460: @section Example overview 461: @cindex example overview 462: @cindex @file{vmgen-ex} 463: @cindex @file{vmgen-ex2} 464: 465: There are two versions of the same example for using Vmgen: 466: @file{vmgen-ex} and @file{vmgen-ex2} (you can also see Gforth as 467: example, but it uses additional (undocumented) features, and also 468: differs in some other respects). The example implements @emph{mini}, a 469: tiny Modula-2-like language with a small JavaVM-like virtual machine. 470: 471: The difference between the examples is that @file{vmgen-ex} uses many 472: casts, and @file{vmgen-ex2} tries to avoids most casts and uses unions 473: instead. In the rest of this manual we usually mention just files in 474: @file{vmgen-ex}; if you want to use unions, use the equivalent file in 475: @file{vmgen-ex2}. 476: @cindex unions example 477: @cindex casts example 478: 479: The files provided with each example are: 480: @cindex example files 481: 482: @example 483: Makefile 484: README 485: disasm.c wrapper file 486: engine.c wrapper file 487: peephole.c wrapper file 488: profile.c wrapper file 489: mini-inst.vmg simple VM instructions 490: mini-super.vmg superinstructions (empty at first) 491: mini.h common declarations 492: mini.l scanner 493: mini.y front end (parser, VM code generator) 494: support.c main() and other support functions 495: fib.mini example mini program 496: simple.mini example mini program 497: test.mini example mini program (tests everything) 498: test.out test.mini output 499: stat.awk script for aggregating profile information 500: peephole-blacklist list of instructions not allowed in superinstructions 501: seq2rule.awk script for creating superinstructions 502: @end example 503: 504: For your own interpreter, you would typically copy the following files 505: and change little, if anything: 506: @cindex wrapper files 507: 508: @example 509: disasm.c wrapper file 510: engine.c wrapper file 511: peephole.c wrapper file 512: profile.c wrapper file 513: stat.awk script for aggregating profile information 514: seq2rule.awk script for creating superinstructions 515: @end example 516: 517: @noindent 518: You would typically change much in or replace the following files: 519: 520: @example 521: Makefile 522: mini-inst.vmg simple VM instructions 523: mini.h common declarations 524: mini.l scanner 525: mini.y front end (parser, VM code generator) 526: support.c main() and other support functions 527: peephole-blacklist list of instructions not allowed in superinstructions 528: @end example 529: 530: You can build the example by @code{cd}ing into the example's directory, 531: and then typing @code{make}; you can check that it works with @code{make 532: check}. You can run run mini programs like this: 533: 534: @example 535: ./mini fib.mini 536: @end example 537: 538: To learn about the options, type @code{./mini -h}. 539: 540: @c -------------------------------------------------------------------- 541: @node Using profiling to create superinstructions, , Example overview, Example 542: @section Using profiling to create superinstructions 543: @cindex profiling example 544: @cindex superinstructions example 545: 546: I have not added rules for this in the @file{Makefile} (there are many 547: options for selecting superinstructions, and I did not want to hardcode 548: one into the @file{Makefile}), but there are some supporting scripts, and 549: here's an example: 550: 551: Suppose you want to use @file{fib.mini} and @file{test.mini} as training 552: programs, you get the profiles like this: 553: 554: @example 555: make fib.prof test.prof #takes a few seconds 556: @end example 557: 558: You can aggregate these profiles with @file{stat.awk}: 559: 560: @example 561: awk -f stat.awk fib.prof test.prof 562: @end example 563: 564: The result contains lines like: 565: 566: @example 567: 2 16 36910041 loadlocal lit 568: @end example 569: 570: This means that the sequence @code{loadlocal lit} statically occurs a 571: total of 16 times in 2 profiles, with a dynamic execution count of 572: 36910041. 573: 574: The numbers can be used in various ways to select superinstructions. 575: E.g., if you just want to select all sequences with a dynamic 576: execution count exceeding 10000, you would use the following pipeline: 577: 578: @example 579: awk -f stat.awk fib.prof test.prof| 580: awk '$3>=10000'| #select sequences 581: fgrep -v -f peephole-blacklist| #eliminate wrong instructions 582: awk -f seq2rule.awk| #transform sequences into superinstruction rules 583: sort -k 3 >mini-super.vmg #sort sequences 584: @end example 585: 586: The file @file{peephole-blacklist} contains all instructions that 587: directly access a stack or stack pointer (for mini: @code{call}, 588: @code{return}); the sort step is necessary to ensure that prefixes 589: precede larger superinstructions. 590: 591: Now you can create a version of mini with superinstructions by just 592: saying @samp{make} 593: 594: 595: @c *************************************************************** 596: @node Input File Format, Error messages, Example, Top 597: @chapter Input File Format 598: @cindex input file format 599: @cindex format, input file 600: 601: Vmgen takes as input a file containing specifications of virtual machine 602: instructions. This file usually has a name ending in @file{.vmg}. 603: 604: Most examples are taken from the example in @file{vmgen-ex}. 605: 606: @menu 607: * Input File Grammar:: 608: * Simple instructions:: 609: * Superinstructions:: 610: * Store Optimization:: 611: * Register Machines:: How to define register VM instructions 612: @end menu 613: 614: @c -------------------------------------------------------------------- 615: @node Input File Grammar, Simple instructions, Input File Format, Input File Format 616: @section Input File Grammar 617: @cindex grammar, input file 618: @cindex input file grammar 619: 620: The grammar is in EBNF format, with @code{@var{a}|@var{b}} meaning 621: ``@var{a} or @var{b}'', @code{@{@var{c}@}} meaning 0 or more repetitions 622: of @var{c} and @code{[@var{d}]} meaning 0 or 1 repetitions of @var{d}. 623: 624: @cindex free-format, not 625: @cindex newlines, significance in syntax 626: Vmgen input is not free-format, so you have to take care where you put 627: newlines (and, in a few cases, white space). 628: 629: @example 630: description: @{instruction|comment|eval-escape|c-escape@} 631: 632: instruction: simple-inst|superinst 633: 634: simple-inst: ident '(' stack-effect ')' newline c-code newline newline 635: 636: stack-effect: @{ident@} '--' @{ident@} 637: 638: super-inst: ident '=' ident @{ident@} 639: 640: comment: '\ ' text newline 641: 642: eval-escape: '\E ' text newline 643: 644: c-escape: '\C ' text newline 645: @end example 646: @c \+ \- \g \f \c 647: 648: Note that the @code{\}s in this grammar are meant literally, not as 649: C-style encodings for non-printable characters. 650: 651: There are two ways to delimit the C code in @code{simple-inst}: 652: 653: @itemize @bullet 654: 655: @item 656: If you start it with a @samp{@{} at the start of a line (i.e., not even 657: white space before it), you have to end it with a @samp{@}} at the start 658: of a line (followed by a newline). In this case you may have empty 659: lines within the C code (typically used between variable definitions and 660: statements). 661: 662: @item 663: You do not start it with @samp{@{}. Then the C code ends at the first 664: empty line, so you cannot have empty lines within this code. 665: 666: @end itemize 667: 668: The text in @code{comment}, @code{eval-escape} and @code{c-escape} must 669: not contain a newline. @code{Ident} must conform to the usual 670: conventions of C identifiers (otherwise the C compiler would choke on 671: the Vmgen output), except that idents in @code{stack-effect} may have a 672: stack prefix (for stack prefix syntax, @pxref{Eval escapes}). 673: 674: @cindex C escape 675: @cindex @code{\C} 676: @cindex conditional compilation of Vmgen output 677: The @code{c-escape} passes the text through to each output file (without 678: the @samp{\C}). This is useful mainly for conditional compilation 679: (i.e., you write @samp{\C #if ...} etc.). 680: 681: @cindex sync lines 682: @cindex @code{#line} 683: In addition to the syntax given in the grammer, Vmgen also processes 684: sync lines (lines starting with @samp{#line}), as produced by @samp{m4 685: -s} (@pxref{Invoking m4, , Invoking m4, m4.info, GNU m4}) and similar 686: tools. This allows associating C compiler error messages with the 687: original source of the C code. 688: 689: Vmgen understands a few extensions beyond the grammar given here, but 690: these extensions are only useful for building Gforth. You can find a 691: description of the format used for Gforth in @file{prim}. 692: 693: @menu 694: * Eval escapes:: what follows \E 695: @end menu 696: 697: @node Eval escapes, , Input File Grammar, Input File Grammar 698: @subsection Eval escapes 699: @cindex escape to Forth 700: @cindex eval escape 701: @cindex @code{\E} 702: 703: @c woanders? 704: The text in @code{eval-escape} is Forth code that is evaluated when 705: Vmgen reads the line. You will normally use this feature to define 706: stacks and types. 707: 708: If you do not know (and do not want to learn) Forth, you can build the 709: text according to the following grammar; these rules are normally all 710: Forth you need for using Vmgen: 711: 712: @example 713: text: stack-decl|type-prefix-decl|stack-prefix-decl|set-flag 714: 715: stack-decl: 'stack ' ident ident ident 716: type-prefix-decl: 717: 's" ' string '" ' ('single'|'double') ident 'type-prefix' ident 718: stack-prefix-decl: ident 'stack-prefix' string 719: set-flag: ('store-optimization'|'include-skipped-insts') ('on'|'off') 720: @end example 721: 722: Note that the syntax of this code is not checked thoroughly (there are 723: many other Forth program fragments that could be written in an 724: eval-escape). 725: 726: A stack prefix can contain letters, digits, or @samp{:}, and may start 727: with an @samp{#}; e.g., in Gforth the return stack has the stack prefix 728: @samp{R:}. This restriction is not checked during the stack prefix 729: definition, but it is enforced by the parsing rules for stack items 730: later. 731: 732: If you know Forth, the stack effects of the non-standard words involved 733: are: 734: @findex stack 735: @findex type-prefix 736: @findex single 737: @findex double 738: @findex stack-prefix 739: @findex store-optimization 740: @example 741: stack ( "name" "pointer" "type" -- ) 742: ( name execution: -- stack ) 743: type-prefix ( addr u item-size stack "prefix" -- ) 744: single ( -- item-size ) 745: double ( -- item-size ) 746: stack-prefix ( stack "prefix" -- ) 747: store-optimization ( -- addr ) 748: include-skipped-insts ( -- addr ) 749: @end example 750: 751: An @var{item-size} takes three cells on the stack. 752: 753: @c -------------------------------------------------------------------- 754: @node Simple instructions, Superinstructions, Input File Grammar, Input File Format 755: @section Simple instructions 756: @cindex simple VM instruction 757: @cindex instruction, simple VM 758: 759: We will use the following simple VM instruction description as example: 760: 761: @example 762: sub ( i1 i2 -- i ) 763: i = i1-i2; 764: @end example 765: 766: The first line specifies the name of the VM instruction (@code{sub}) and 767: its stack effect (@code{i1 i2 -- i}). The rest of the description is 768: just plain C code. 769: 770: @cindex stack effect 771: @cindex effect, stack 772: The stack effect specifies that @code{sub} pulls two integers from the 773: data stack and puts them in the C variables @code{i1} and @code{i2} 774: (with the rightmost item (@code{i2}) taken from the top of stack; 775: intuition: if you push @code{i1}, then @code{i2} on the stack, the 776: resulting stack picture is @code{i1 i2}) and later pushes one integer 777: (@code{i}) on the data stack (the rightmost item is on the top 778: afterwards). 779: 780: @cindex prefix, type 781: @cindex type prefix 782: @cindex default stack of a type prefix 783: How do we know the type and stack of the stack items? Vmgen uses 784: prefixes, similar to Fortran; in contrast to Fortran, you have to 785: define the prefix first: 786: 787: @example 788: \E s" Cell" single data-stack type-prefix i 789: @end example 790: 791: This defines the prefix @code{i} to refer to the type @code{Cell} 792: (defined as @code{long} in @file{mini.h}) and, by default, to the 793: @code{data-stack}. It also specifies that this type takes one stack 794: item (@code{single}). The type prefix is part of the variable name. 795: 796: @cindex stack definition 797: @cindex defining a stack 798: Before we can use @code{data-stack} in this way, we have to define it: 799: 800: @example 801: \E stack data-stack sp Cell 802: @end example 803: @c !! use something other than Cell 804: 805: @cindex stack basic type 806: @cindex basic type of a stack 807: @cindex type of a stack, basic 808: This line defines the stack @code{data-stack}, which uses the stack 809: pointer @code{sp}, and each item has the basic type @code{Cell}; other 810: types have to fit into one or two @code{Cell}s (depending on whether the 811: type is @code{single} or @code{double} wide), and are cast from and to 812: Cells on accessing the @code{data-stack} with type cast macros 813: (@pxref{VM engine}). By default, stacks grow towards lower addresses in 814: Vmgen-erated interpreters (@pxref{Stack growth direction}). 815: 816: @cindex stack prefix 817: @cindex prefix, stack 818: We can override the default stack of a stack item by using a stack 819: prefix. E.g., consider the following instruction: 820: 821: @example 822: lit ( #i -- i ) 823: @end example 824: 825: The VM instruction @code{lit} takes the item @code{i} from the 826: instruction stream (indicated by the prefix @code{#}), and pushes it on 827: the (default) data stack. The stack prefix is not part of the variable 828: name. Stack prefixes are defined like this: 829: 830: @example 831: \E inst-stream stack-prefix # 832: @end example 833: 834: This definition defines that the stack prefix @code{#} specifies the 835: ``stack'' @code{inst-stream}. Since the instruction stream behaves a 836: little differently than an ordinary stack, it is predefined, and you do 837: not need to define it. 838: 839: @cindex instruction stream 840: The instruction stream contains instructions and their immediate 841: arguments, so specifying that an argument comes from the instruction 842: stream indicates an immediate argument. Of course, instruction stream 843: arguments can only appear to the left of @code{--} in the stack effect. 844: If there are multiple instruction stream arguments, the leftmost is the 845: first one (just as the intuition suggests). 846: 847: @menu 848: * C Code Macros:: Macros recognized by Vmgen 849: * C Code restrictions:: Vmgen makes assumptions about C code 850: * Stack growth direction:: is configurable per stack 851: @end menu 852: 853: @c -------------------------------------------------------------------- 854: @node C Code Macros, C Code restrictions, Simple instructions, Simple instructions 855: @subsection C Code Macros 856: @cindex macros recognized by Vmgen 857: @cindex basic block, VM level 858: 859: Vmgen recognizes the following strings in the C code part of simple 860: instructions: 861: 862: @table @code 863: 864: @item SET_IP 865: @findex SET_IP 866: As far as Vmgen is concerned, a VM instruction containing this ends a VM 867: basic block (used in profiling to delimit profiled sequences). On the C 868: level, this also sets the instruction pointer. 869: 870: @item SUPER_END 871: @findex SUPER_END 872: This ends a basic block (for profiling), even if the instruction 873: contains no @code{SET_IP}. 874: 875: @item INST_TAIL; 876: @findex INST_TAIL; 877: Vmgen replaces @samp{INST_TAIL;} with code for ending a VM instruction and 878: dispatching the next VM instruction. Even without a @samp{INST_TAIL;} this 879: happens automatically when control reaches the end of the C code. If 880: you want to have this in the middle of the C code, you need to use 881: @samp{INST_TAIL;}. A typical example is a conditional VM branch: 882: 883: @example 884: if (branch_condition) @{ 885: SET_IP(target); INST_TAIL; 886: @} 887: /* implicit tail follows here */ 888: @end example 889: 890: In this example, @samp{INST_TAIL;} is not strictly necessary, because there 891: is another one implicitly after the if-statement, but using it improves 892: branch prediction accuracy slightly and allows other optimizations. 893: 894: @item SUPER_CONTINUE 895: @findex SUPER_CONTINUE 896: This indicates that the implicit tail at the end of the VM instruction 897: dispatches the sequentially next VM instruction even if there is a 898: @code{SET_IP} in the VM instruction. This enables an optimization that 899: is not yet implemented in the vmgen-ex code (but in Gforth). The 900: typical application is in conditional VM branches: 901: 902: @example 903: if (branch_condition) @{ 904: SET_IP(target); INST_TAIL; /* now this INST_TAIL is necessary */ 905: @} 906: SUPER_CONTINUE; 907: @end example 908: 909: @item VM_JUMP 910: @findex VM_JUMP 911: @code{VM_JUMP(target)} is equivalent to @code{goto *(target)}, but 912: allows Vmgen to do dynamic superinstructions and replication. You 913: still need to say @code{SUPER_END}. Also, the goto only happens at 914: the end (wherever the VM_JUMP is). Essentially, this just suppresses 915: much of the ordinary dispatch mechanism. 916: 917: @end table 918: 919: Note that Vmgen is not smart about C-level tokenization, comments, 920: strings, or conditional compilation, so it will interpret even a 921: commented-out SUPER_END as ending a basic block (or, e.g., 922: @samp{RESET_IP;} as @samp{SET_IP;}). Conversely, Vmgen requires the literal 923: presence of these strings; Vmgen will not see them if they are hiding in 924: a C preprocessor macro. 925: 926: 927: @c -------------------------------------------------------------------- 928: @node C Code restrictions, Stack growth direction, C Code Macros, Simple instructions 929: @subsection C Code restrictions 930: @cindex C code restrictions 931: @cindex restrictions on C code 932: @cindex assumptions about C code 933: 934: @cindex accessing stack (pointer) 935: @cindex stack pointer, access 936: @cindex instruction pointer, access 937: Vmgen generates code and performs some optimizations under the 938: assumption that the user-supplied C code does not access the stack 939: pointers or stack items, and that accesses to the instruction pointer 940: only occur through special macros. In general you should heed these 941: restrictions. However, if you need to break these restrictions, read 942: the following. 943: 944: Accessing a stack or stack pointer directly can be a problem for several 945: reasons: 946: @cindex stack caching, restriction on C code 947: @cindex superinstructions, restrictions on components 948: 949: @itemize @bullet 950: 951: @item 952: Vmgen optionally supports caching the top-of-stack item in a local 953: variable (that is allocated to a register). This is the most frequent 954: source of trouble. You can deal with it either by not using 955: top-of-stack caching (slowdown factor 1-1.4, depending on machine), or 956: by inserting flushing code (e.g., @samp{IF_spTOS(sp[...] = spTOS);}) at 957: the start and reloading code (e.g., @samp{IF_spTOS(spTOS = sp[0])}) at 958: the end of problematic C code. Vmgen inserts a stack pointer update 959: before the start of the user-supplied C code, so the flushing code has 960: to use an index that corrects for that. In the future, this flushing 961: may be done automatically by mentioning a special string in the C code. 962: @c sometimes flushing and/or reloading unnecessary 963: 964: @item 965: The Vmgen-erated code loads the stack items from stack-pointer-indexed 966: memory into variables before the user-supplied C code, and stores them 967: from variables to stack-pointer-indexed memory afterwards. If you do 968: any writes to the stack through its stack pointer in your C code, it 969: will not affect the variables, and your write may be overwritten by the 970: stores after the C code. Similarly, a read from a stack using a stack 971: pointer will not reflect computations of stack items in the same VM 972: instruction. 973: 974: @item 975: Superinstructions keep stack items in variables across the whole 976: superinstruction. So you should not include VM instructions, that 977: access a stack or stack pointer, as components of superinstructions 978: (@pxref{VM profiler}). 979: 980: @end itemize 981: 982: You should access the instruction pointer only through its special 983: macros (@samp{IP}, @samp{SET_IP}, @samp{IPTOS}); this ensure that these 984: macros can be implemented in several ways for best performance. 985: @samp{IP} points to the next instruction, and @samp{IPTOS} is its 986: contents. 987: 988: @c -------------------------------------------------------------------- 989: @node Stack growth direction, , C Code restrictions, Simple instructions 990: @subsection Stack growth direction 991: @cindex stack growth direction 992: 993: @cindex @code{stack-access-transform} 994: By default, the stacks grow towards lower addresses. You can change 995: this for a stack by setting the @code{stack-access-transform} field of 996: the stack to an xt @code{( itemnum -- index )} that performs the 997: appropriate index transformation. 998: 999: E.g., if you want to let @code{data-stack} grow towards higher 1000: addresses, with the stack pointer always pointing just beyond the 1001: top-of-stack, use this right after defining @code{data-stack}: 1002: 1003: @example 1004: \E : sp-access-transform ( itemnum -- index ) negate 1- ; 1005: \E ' sp-access-transform ' data-stack >body stack-access-transform ! 1006: @end example 1007: 1008: This means that @code{sp-access-transform} will be used to generate 1009: indexes for accessing @code{data-stack}. The definition of 1010: @code{sp-access-transform} above transforms n into -n-1, e.g, 1 into -2. 1011: This will access the 0th data-stack element (top-of-stack) at sp[-1], 1012: the 1st at sp[-2], etc., which is the typical way upward-growing 1013: stacks are used. If you need a different transform and do not know 1014: enough Forth to program it, let me know. 1015: 1016: @c -------------------------------------------------------------------- 1017: @node Superinstructions, Store Optimization, Simple instructions, Input File Format 1018: @section Superinstructions 1019: @cindex superinstructions, defining 1020: @cindex defining superinstructions 1021: 1022: Note: don't invest too much work in (static) superinstructions; a future 1023: version of Vmgen will support dynamic superinstructions (see Ian 1024: Piumarta and Fabio Riccardi, @cite{Optimizing Direct Threaded Code by 1025: Selective Inlining}, PLDI'98), and static superinstructions have much 1026: less benefit in that context (preliminary results indicate only a factor 1027: 1.1 speedup). 1028: 1029: Here is an example of a superinstruction definition: 1030: 1031: @example 1032: lit_sub = lit sub 1033: @end example 1034: 1035: @code{lit_sub} is the name of the superinstruction, and @code{lit} and 1036: @code{sub} are its components. This superinstruction performs the same 1037: action as the sequence @code{lit} and @code{sub}. It is generated 1038: automatically by the VM code generation functions whenever that sequence 1039: occurs, so if you want to use this superinstruction, you just need to 1040: add this definition (and even that can be partially automatized, 1041: @pxref{VM profiler}). 1042: 1043: @cindex prefixes of superinstructions 1044: Vmgen requires that the component instructions are simple instructions 1045: defined before superinstructions using the components. Currently, Vmgen 1046: also requires that all the subsequences at the start of a 1047: superinstruction (prefixes) must be defined as superinstruction before 1048: the superinstruction. I.e., if you want to define a superinstruction 1049: 1050: @example 1051: foo4 = load add sub mul 1052: @end example 1053: 1054: you first have to define @code{load}, @code{add}, @code{sub} and 1055: @code{mul}, plus 1056: 1057: @example 1058: foo2 = load add 1059: foo3 = load add sub 1060: @end example 1061: 1062: Here, @code{sumof4} is the longest prefix of @code{sumof5}, and @code{sumof3} 1063: is the longest prefix of @code{sumof4}. 1064: 1065: Note that Vmgen assumes that only the code it generates accesses stack 1066: pointers, the instruction pointer, and various stack items, and it 1067: performs optimizations based on this assumption. Therefore, VM 1068: instructions where your C code changes the instruction pointer should 1069: only be used as last component; a VM instruction where your C code 1070: accesses a stack pointer should not be used as component at all. Vmgen 1071: does not check these restrictions, they just result in bugs in your 1072: interpreter. 1073: 1074: @cindex include-skipped-insts 1075: The Vmgen flag @code{include-skipped-insts} influences superinstruction 1076: code generation. Currently there is no support in the peephole 1077: optimizer for both variations, so leave this flag alone for now. 1078: 1079: @c ------------------------------------------------------------------- 1080: @node Store Optimization, Register Machines, Superinstructions, Input File Format 1081: @section Store Optimization 1082: @cindex store optimization 1083: @cindex optimization, stack stores 1084: @cindex stack stores, optimization 1085: @cindex eliminating stack stores 1086: 1087: This minor optimization (0.6\%--0.8\% reduction in executed instructions 1088: for Gforth) puts additional requirements on the instruction descriptions 1089: and is therefore disabled by default. 1090: 1091: What does it do? Consider an instruction like 1092: 1093: @example 1094: dup ( n -- n n ) 1095: @end example 1096: 1097: For simplicity, also assume that we are not caching the top-of-stack in 1098: a register. Now, the C code for dup first loads @code{n} from the 1099: stack, and then stores it twice to the stack, one time to the address 1100: where it came from; that time is unnecessary, but gcc does not optimize 1101: it away, so vmgen can do it instead (if you turn on the store 1102: optimization). 1103: 1104: Vmgen uses the stack item's name to determine if the stack item contains 1105: the same value as it did at the start. Therefore, if you use the store 1106: optimization, you have to ensure that stack items that have the same 1107: name on input and output also have the same value, and are not changed 1108: in the C code you supply. I.e., the following code could fail if you 1109: turn on the store optimization: 1110: 1111: @example 1112: add1 ( n -- n ) 1113: n++; 1114: @end example 1115: 1116: Instead, you have to use different names, i.e.: 1117: 1118: @example 1119: add1 ( n1 -- n2 ) 1120: n2=n1+1; 1121: @end example 1122: 1123: Similarly, the store optimization assumes that the stack pointer is only 1124: changed by Vmgen-erated code. If your C code changes the stack pointer, 1125: use different names in input and output stack items to avoid a (probably 1126: wrong) store optimization, or turn the store optimization off for this 1127: VM instruction. 1128: 1129: To turn on the store optimization, write 1130: 1131: @example 1132: \E store-optimization on 1133: @end example 1134: 1135: at the start of the file. You can turn this optimization on or off 1136: between any two VM instruction descriptions. For turning it off again, 1137: you can use 1138: 1139: @example 1140: \E store-optimization off 1141: @end example 1142: 1143: @c ------------------------------------------------------------------- 1144: @node Register Machines, , Store Optimization, Input File Format 1145: @section Register Machines 1146: @cindex Register VM 1147: @cindex Superinstructions for register VMs 1148: @cindex tracing of register VMs 1149: 1150: If you want to implement a register VM rather than a stack VM with 1151: Vmgen, there are two ways to do it: Directly and through 1152: superinstructions. 1153: 1154: If you use the direct way, you define instructions that take the 1155: register numbers as immediate arguments, like this: 1156: 1157: @example 1158: add3 ( #src1 #src2 #dest -- ) 1159: reg[dest] = reg[src1]+reg[src2]; 1160: @end example 1161: 1162: A disadvantage of this method is that during tracing you only see the 1163: register numbers, but not the register contents. Actually, with an 1164: appropriate definition of @code{printarg_src} (@pxref{VM engine}), you 1165: can print the values of the source registers on entry, but you cannot 1166: print the value of the destination register on exit. 1167: 1168: If you use superinstructions to define a register VM, you define simple 1169: instructions that use a stack, and then define superinstructions that 1170: have no overall stack effect, like this: 1171: 1172: @example 1173: loadreg ( #src -- n ) 1174: n = reg[src]; 1175: 1176: storereg ( n #dest -- ) 1177: reg[dest] = n; 1178: 1179: adds ( n1 n2 -- n ) 1180: n = n1+n2; 1181: 1182: add3 = loadreg loadreg adds storereg 1183: @end example 1184: 1185: An advantage of this method is that you see the values and not just the 1186: register numbers in tracing. A disadvantage of this method is that 1187: currently you cannot generate superinstructions directly, but only 1188: through generating a sequence of simple instructions (we might change 1189: this in the future if there is demand). 1190: 1191: Could the register VM support be improved, apart from the issues 1192: mentioned above? It is hard to see how to do it in a general way, 1193: because there are a number of different designs that different people 1194: mean when they use the term @emph{register machine} in connection with 1195: VM interpreters. However, if you have ideas or requests in that 1196: direction, please let me know (@pxref{Contact}). 1197: 1198: @c ******************************************************************** 1199: @node Error messages, Using the generated code, Input File Format, Top 1200: @chapter Error messages 1201: @cindex error messages 1202: 1203: These error messages are created by Vmgen: 1204: 1205: @table @code 1206: 1207: @cindex @code{# can only be on the input side} error 1208: @item # can only be on the input side 1209: You have used an instruction-stream prefix (usually @samp{#}) after the 1210: @samp{--} (the output side); you can only use it before (the input 1211: side). 1212: 1213: @cindex @code{prefix for this combination must be defined earlier} error 1214: @item the prefix for this superinstruction must be defined earlier 1215: You have defined a superinstruction (e.g. @code{abc = a b c}) without 1216: defining its direct prefix (e.g., @code{ab = a b}), 1217: @xref{Superinstructions}. 1218: 1219: @cindex @code{sync line syntax} error 1220: @item sync line syntax 1221: If you are using a preprocessor (e.g., @command{m4}) to generate Vmgen 1222: input code, you may want to create @code{#line} directives (aka sync 1223: lines). This error indicates that such a line is not in th syntax 1224: expected by Vmgen (this should not happen; please report the offending 1225: line in a bug report). 1226: 1227: @cindex @code{syntax error, wrong char} error 1228: @item syntax error, wrong char 1229: A syntax error. If you do not see right away where the error is, it may 1230: be helpful to check the following: Did you put an empty line in a VM 1231: instruction where the C code is not delimited by braces (then the empty 1232: line ends the VM instruction)? If you used brace-delimited C code, did 1233: you put the delimiting braces (and only those) at the start of the line, 1234: without preceding white space? Did you forget a delimiting brace? 1235: 1236: @cindex @code{too many stacks} error 1237: @item too many stacks 1238: Vmgen currently supports 3 stacks (plus the instruction stream); if you 1239: need more, let us know. 1240: 1241: @cindex @code{unknown prefix} error 1242: @item unknown prefix 1243: The stack item does not match any defined type prefix (after stripping 1244: away any stack prefix). You should either declare the type prefix you 1245: want for that stack item, or use a different type prefix 1246: 1247: @cindex @code{unknown primitive} error 1248: @item unknown primitive 1249: You have used the name of a simple VM instruction in a superinstruction 1250: definition without defining the simple VM instruction first. 1251: 1252: @end table 1253: 1254: In addition, the C compiler can produce errors due to code produced by 1255: Vmgen; e.g., you need to define type cast functions. 1256: 1257: @c ******************************************************************** 1258: @node Using the generated code, Hints, Error messages, Top 1259: @chapter Using the generated code 1260: @cindex generated code, usage 1261: @cindex Using vmgen-erated code 1262: 1263: The easiest way to create a working VM interpreter with Vmgen is 1264: probably to start with @file{vmgen-ex}, and modify it for your purposes. 1265: This chapter explains what the various wrapper and generated files do. 1266: It also contains reference-manual style descriptions of the macros, 1267: variables etc. used by the generated code, and you can skip that on 1268: first reading. 1269: 1270: @menu 1271: * VM engine:: Executing VM code 1272: * VM instruction table:: 1273: * VM code generation:: Creating VM code (in the front-end) 1274: * Peephole optimization:: Creating VM superinstructions 1275: * VM disassembler:: for debugging the front end 1276: * VM profiler:: for finding worthwhile superinstructions 1277: @end menu 1278: 1279: @c -------------------------------------------------------------------- 1280: @node VM engine, VM instruction table, Using the generated code, Using the generated code 1281: @section VM engine 1282: @cindex VM instruction execution 1283: @cindex engine 1284: @cindex executing VM code 1285: @cindex @file{engine.c} 1286: @cindex @file{-vm.i} output file 1287: 1288: The VM engine is the VM interpreter that executes the VM code. It is 1289: essential for an interpretive system. 1290: 1291: Vmgen supports two methods of VM instruction dispatch: @emph{threaded 1292: code} (fast, but gcc-specific), and @emph{switch dispatch} (slow, but 1293: portable across C compilers); you can use conditional compilation 1294: (@samp{defined(__GNUC__)}) to choose between these methods, and our 1295: example does so. 1296: 1297: For both methods, the VM engine is contained in a C-level function. 1298: Vmgen generates most of the contents of the function for you 1299: (@file{@var{name}-vm.i}), but you have to define this function, and 1300: macros and variables used in the engine, and initialize the variables. 1301: In our example the engine function also includes 1302: @file{@var{name}-labels.i} (@pxref{VM instruction table}). 1303: 1304: @cindex tracing VM code 1305: @cindex superinstructions and tracing 1306: In addition to executing the code, the VM engine can optionally also 1307: print out a trace of the executed instructions, their arguments and 1308: results. For superinstructions it prints the trace as if only component 1309: instructions were executed; this allows to introduce new 1310: superinstructions while keeping the traces comparable to old ones 1311: (important for regression tests). 1312: 1313: It costs significant performance to check in each instruction whether to 1314: print tracing code, so we recommend producing two copies of the engine: 1315: one for fast execution, and one for tracing. See the rules for 1316: @file{engine.o} and @file{engine-debug.o} in @file{vmgen-ex/Makefile} 1317: for an example. 1318: 1319: The following macros and variables are used in @file{@var{name}-vm.i}: 1320: 1321: @table @code 1322: 1323: @findex LABEL 1324: @item LABEL(@var{inst_name}) 1325: This is used just before each VM instruction to provide a jump or 1326: @code{switch} label (the @samp{:} is provided by Vmgen). For switch 1327: dispatch this should expand to @samp{case @var{label}:}; for 1328: threaded-code dispatch this should just expand to @samp{@var{label}:}. 1329: In either case @var{label} is usually the @var{inst_name} with some 1330: prefix or suffix to avoid naming conflicts. 1331: 1332: @findex LABEL2 1333: @item LABEL2(@var{inst_name}) 1334: This will be used for dynamic superinstructions; at the moment, this 1335: should expand to nothing. 1336: 1337: @findex NAME 1338: @item NAME(@var{inst_name_string}) 1339: Called on entering a VM instruction with a string containing the name of 1340: the VM instruction as parameter. In normal execution this should be 1341: expand to nothing, but for tracing this usually prints the name, and 1342: possibly other information (several VM registers in our example). 1343: 1344: @findex DEF_CA 1345: @item DEF_CA 1346: Usually empty. Called just inside a new scope at the start of a VM 1347: instruction. Can be used to define variables that should be visible 1348: during every VM instruction. If you define this macro as non-empty, you 1349: have to provide the finishing @samp{;} in the macro. 1350: 1351: @findex NEXT_P0 1352: @findex NEXT_P1 1353: @findex NEXT_P2 1354: @item NEXT_P0 NEXT_P1 NEXT_P2 1355: The three parts of instruction dispatch. They can be defined in 1356: different ways for best performance on various processors (see 1357: @file{engine.c} in the example or @file{engine/threaded.h} in Gforth). 1358: @samp{NEXT_P0} is invoked right at the start of the VM instruction (but 1359: after @samp{DEF_CA}), @samp{NEXT_P1} right after the user-supplied C 1360: code, and @samp{NEXT_P2} at the end. The actual jump has to be 1361: performed by @samp{NEXT_P2} (if you would do it earlier, important parts 1362: of the VM instruction would not be executed). 1363: 1364: The simplest variant is if @samp{NEXT_P2} does everything and the other 1365: macros do nothing. Then also related macros like @samp{IP}, 1366: @samp{SET_IP}, @samp{IP}, @samp{INC_IP} and @samp{IPTOS} are very 1367: straightforward to define. For switch dispatch this code consists just 1368: of a jump to the dispatch code (@samp{goto next_inst;} in our example); 1369: for direct threaded code it consists of something like 1370: @samp{(@{cfa=*ip++; goto *cfa;@})}. 1371: 1372: Pulling code (usually the @samp{cfa=*ip++;}) up into @samp{NEXT_P1} 1373: usually does not cause problems, but pulling things up into 1374: @samp{NEXT_P0} usually requires changing the other macros (and, at least 1375: for Gforth on Alpha, it does not buy much, because the compiler often 1376: manages to schedule the relevant stuff up by itself). An even more 1377: extreme variant is to pull code up even further, into, e.g., NEXT_P1 of 1378: the previous VM instruction (prefetching, useful on PowerPCs). 1379: 1380: @findex INC_IP 1381: @item INC_IP(@var{n}) 1382: This increments @code{IP} by @var{n}. 1383: 1384: @findex SET_IP 1385: @item SET_IP(@var{target}) 1386: This sets @code{IP} to @var{target}. 1387: 1388: @cindex type cast macro 1389: @findex vm_@var{A}2@var{B} 1390: @item vm_@var{A}2@var{B}(a,b) 1391: Type casting macro that assigns @samp{a} (of type @var{A}) to @samp{b} 1392: (of type @var{B}). This is mainly used for getting stack items into 1393: variables and back. So you need to define macros for every combination 1394: of stack basic type (@code{Cell} in our example) and type-prefix types 1395: used with that stack (in both directions). For the type-prefix type, 1396: you use the type-prefix (not the C type string) as type name (e.g., 1397: @samp{vm_Cell2i}, not @samp{vm_Cell2Cell}). In addition, you have to 1398: define a vm_@var{X}2@var{X} macro for the stack's basic type @var{X} 1399: (used in superinstructions). 1400: 1401: @cindex instruction stream, basic type 1402: The stack basic type for the predefined @samp{inst-stream} is 1403: @samp{Cell}. If you want a stack with the same item size, making its 1404: basic type @samp{Cell} usually reduces the number of macros you have to 1405: define. 1406: 1407: @cindex unions in type cast macros 1408: @cindex casts in type cast macros 1409: @cindex type casting between floats and integers 1410: Here our examples differ a lot: @file{vmgen-ex} uses casts in these 1411: macros, whereas @file{vmgen-ex2} uses union-field selection (or 1412: assignment to union fields). Note that casting floats into integers and 1413: vice versa changes the bit pattern (and you do not want that). In this 1414: case your options are to use a (temporary) union, or to take the address 1415: of the value, cast the pointer, and dereference that (not always 1416: possible, and sometimes expensive). 1417: 1418: @findex vm_two@var{A}2@var{B} 1419: @findex vm_@var{B}2two@var{A} 1420: @item vm_two@var{A}2@var{B}(a1,a2,b) 1421: @item vm_@var{B}2two@var{A}(b,a1,a2) 1422: Type casting between two stack items (@code{a1}, @code{a2}) and a 1423: variable @code{b} of a type that takes two stack items. This does not 1424: occur in our small examples, but you can look at Gforth for examples 1425: (see @code{vm_twoCell2d} in @file{engine/forth.h}). 1426: 1427: @cindex stack pointer definition 1428: @cindex instruction pointer definition 1429: @item @var{stackpointer} 1430: For each stack used, the stackpointer name given in the stack 1431: declaration is used. For a regular stack this must be an l-expression; 1432: typically it is a variable declared as a pointer to the stack's basic 1433: type. For @samp{inst-stream}, the name is @samp{IP}, and it can be a 1434: plain r-value; typically it is a macro that abstracts away the 1435: differences between the various implementations of @code{NEXT_P*}. 1436: 1437: @cindex IMM_ARG 1438: @findex IMM_ARG 1439: @item IMM_ARG(access,value) 1440: Define this to expland to ``(access)''. This is just a placeholder for 1441: future extensions. 1442: 1443: @cindex top of stack caching 1444: @cindex stack caching 1445: @cindex TOS 1446: @findex IPTOS 1447: @item @var{stackpointer}TOS 1448: The top-of-stack for the stack pointed to by @var{stackpointer}. If you 1449: are using top-of-stack caching for that stack, this should be defined as 1450: variable; if you are not using top-of-stack caching for that stack, this 1451: should be a macro expanding to @samp{@var{stackpointer}[0]}. The stack 1452: pointer for the predefined @samp{inst-stream} is called @samp{IP}, so 1453: the top-of-stack is called @samp{IPTOS}. 1454: 1455: @findex IF_@var{stackpointer}TOS 1456: @item IF_@var{stackpointer}TOS(@var{expr}) 1457: Macro for executing @var{expr}, if top-of-stack caching is used for the 1458: @var{stackpointer} stack. I.e., this should do @var{expr} if there is 1459: top-of-stack caching for @var{stackpointer}; otherwise it should do 1460: nothing. 1461: 1462: @findex SUPER_END 1463: @item SUPER_END 1464: This is used by the VM profiler (@pxref{VM profiler}); it should not do 1465: anything in normal operation, and call @code{vm_count_block(IP)} for 1466: profiling. 1467: 1468: @findex SUPER_CONTINUE 1469: @item SUPER_CONTINUE 1470: This is just a hint to Vmgen and does nothing at the C level. 1471: 1472: @findex MAYBE_UNUSED 1473: @item MAYBE_UNUSED 1474: This should be defined as @code{__attribute__((unused))} for gcc-2.7 and 1475: higher. It suppresses the warnings about unused variables in the code 1476: for superinstructions. You need to define this only if you are using 1477: superinstructions. 1478: 1479: @findex VM_DEBUG 1480: @item VM_DEBUG 1481: If this is defined, the tracing code will be compiled in (slower 1482: interpretation, but better debugging). Our example compiles two 1483: versions of the engine, a fast-running one that cannot trace, and one 1484: with potential tracing and profiling. 1485: 1486: @findex vm_debug 1487: @item vm_debug 1488: Needed only if @samp{VM_DEBUG} is defined. If this variable contains 1489: true, the VM instructions produce trace output. It can be turned on or 1490: off at any time. 1491: 1492: @findex vm_out 1493: @item vm_out 1494: Needed only if @samp{VM_DEBUG} is defined. Specifies the file on which 1495: to print the trace output (type @samp{FILE *}). 1496: 1497: @findex printarg_@var{type} 1498: @item printarg_@var{type}(@var{value}) 1499: Needed only if @samp{VM_DEBUG} is defined. Macro or function for 1500: printing @var{value} in a way appropriate for the @var{type}. This is 1501: used for printing the values of stack items during tracing. @var{Type} 1502: is normally the type prefix specified in a @code{type-prefix} definition 1503: (e.g., @samp{printarg_i}); in superinstructions it is currently the 1504: basic type of the stack. 1505: 1506: @end table 1507: 1508: 1509: @c -------------------------------------------------------------------- 1510: @node VM instruction table, VM code generation, VM engine, Using the generated code 1511: @section VM instruction table 1512: @cindex instruction table 1513: @cindex opcode definition 1514: @cindex labels for threaded code 1515: @cindex @code{vm_prim}, definition 1516: @cindex @file{-labels.i} output file 1517: 1518: For threaded code we also need to produce a table containing the labels 1519: of all VM instructions. This is needed for VM code generation 1520: (@pxref{VM code generation}), and it has to be done in the engine 1521: function, because the labels are not visible outside. It then has to be 1522: passed outside the function (and assigned to @samp{vm_prim}), to be used 1523: by the VM code generation functions. 1524: 1525: This means that the engine function has to be called first to produce 1526: the VM instruction table, and later, after generating VM code, it has to 1527: be called again to execute the generated VM code (yes, this is ugly). 1528: In our example program, these two modes of calling the engine function 1529: are differentiated by the value of the parameter ip0 (if it equals 0, 1530: then the table is passed out, otherwise the VM code is executed); in our 1531: example, we pass the table out by assigning it to @samp{vm_prim} and 1532: returning from @samp{engine}. 1533: 1534: In our example (@file{vmgen-ex/engine.c}), we also build such a table for 1535: switch dispatch; this is mainly done for uniformity. 1536: 1537: For switch dispatch, we also need to define the VM instruction opcodes 1538: used as case labels in an @code{enum}. 1539: 1540: For both purposes (VM instruction table, and enum), the file 1541: @file{@var{name}-labels.i} is generated by Vmgen. You have to define 1542: the following macro used in this file: 1543: 1544: @table @code 1545: 1546: @findex INST_ADDR 1547: @item INST_ADDR(@var{inst_name}) 1548: For switch dispatch, this is just the name of the switch label (the same 1549: name as used in @samp{LABEL(@var{inst_name})}), for both uses of 1550: @file{@var{name}-labels.i}. For threaded-code dispatch, this is the 1551: address of the label defined in @samp{LABEL(@var{inst_name})}); the 1552: address is taken with @samp{&&} (@pxref{Labels as Values, , Labels as 1553: Values, gcc.info, GNU C Manual}). 1554: 1555: @end table 1556: 1557: 1558: @c -------------------------------------------------------------------- 1559: @node VM code generation, Peephole optimization, VM instruction table, Using the generated code 1560: @section VM code generation 1561: @cindex VM code generation 1562: @cindex code generation, VM 1563: @cindex @file{-gen.i} output file 1564: 1565: Vmgen generates VM code generation functions in @file{@var{name}-gen.i} 1566: that the front end can call to generate VM code. This is essential for 1567: an interpretive system. 1568: 1569: @findex gen_@var{inst} 1570: For a VM instruction @samp{x ( #a b #c -- d )}, Vmgen generates a 1571: function with the prototype 1572: 1573: @example 1574: void gen_x(Inst **ctp, a_type a, c_type c) 1575: @end example 1576: 1577: The @code{ctp} argument points to a pointer to the next instruction. 1578: @code{*ctp} is increased by the generation functions; i.e., you should 1579: allocate memory for the code to be generated beforehand, and start with 1580: *ctp set at the start of this memory area. Before running out of 1581: memory, allocate a new area, and generate a VM-level jump to the new 1582: area (this overflow handling is not implemented in our examples). 1583: 1584: @cindex immediate arguments, VM code generation 1585: The other arguments correspond to the immediate arguments of the VM 1586: instruction (with their appropriate types as defined in the 1587: @code{type_prefix} declaration. 1588: 1589: The following types, variables, and functions are used in 1590: @file{@var{name}-gen.i}: 1591: 1592: @table @code 1593: 1594: @findex Inst 1595: @item Inst 1596: The type of the VM instruction; if you use threaded code, this is 1597: @code{void *}; for switch dispatch this is an integer type. 1598: 1599: @cindex @code{vm_prim}, use 1600: @item vm_prim 1601: The VM instruction table (type: @code{Inst *}, @pxref{VM instruction table}). 1602: 1603: @findex gen_inst 1604: @item gen_inst(Inst **ctp, Inst i) 1605: This function compiles the instruction @code{i}. Take a look at it in 1606: @file{vmgen-ex/peephole.c}. It is trivial when you don't want to use 1607: superinstructions (just the last two lines of the example function), and 1608: slightly more complicated in the example due to its ability to use 1609: superinstructions (@pxref{Peephole optimization}). 1610: 1611: @findex genarg_@var{type_prefix} 1612: @item genarg_@var{type_prefix}(Inst **ctp, @var{type} @var{type_prefix}) 1613: This compiles an immediate argument of @var{type} (as defined in a 1614: @code{type-prefix} definition). These functions are trivial to define 1615: (see @file{vmgen-ex/support.c}). You need one of these functions for 1616: every type that you use as immediate argument. 1617: 1618: @end table 1619: 1620: @findex BB_BOUNDARY 1621: In addition to using these functions to generate code, you should call 1622: @code{BB_BOUNDARY} at every basic block entry point if you ever want to 1623: use superinstructions (or if you want to use the profiling supported by 1624: Vmgen; but this support is also useful mainly for selecting 1625: superinstructions). If you use @code{BB_BOUNDARY}, you should also 1626: define it (take a look at its definition in @file{vmgen-ex/mini.y}). 1627: 1628: You do not need to call @code{BB_BOUNDARY} after branches, because you 1629: will not define superinstructions that contain branches in the middle 1630: (and if you did, and it would work, there would be no reason to end the 1631: superinstruction at the branch), and because the branches announce 1632: themselves to the profiler. 1633: 1634: 1635: @c -------------------------------------------------------------------- 1636: @node Peephole optimization, VM disassembler, VM code generation, Using the generated code 1637: @section Peephole optimization 1638: @cindex peephole optimization 1639: @cindex superinstructions, generating 1640: @cindex @file{peephole.c} 1641: @cindex @file{-peephole.i} output file 1642: 1643: You need peephole optimization only if you want to use 1644: superinstructions. But having the code for it does not hurt much if you 1645: do not use superinstructions. 1646: 1647: A simple greedy peephole optimization algorithm is used for 1648: superinstruction selection: every time @code{gen_inst} compiles a VM 1649: instruction, it checks if it can combine it with the last VM instruction 1650: (which may also be a superinstruction resulting from a previous peephole 1651: optimization); if so, it changes the last instruction to the combined 1652: instruction instead of laying down @code{i} at the current @samp{*ctp}. 1653: 1654: The code for peephole optimization is in @file{vmgen-ex/peephole.c}. 1655: You can use this file almost verbatim. Vmgen generates 1656: @file{@var{file}-peephole.i} which contains data for the peephole 1657: optimizer. 1658: 1659: @findex init_peeptable 1660: You have to call @samp{init_peeptable()} after initializing 1661: @samp{vm_prim}, and before compiling any VM code to initialize data 1662: structures for peephole optimization. After that, compiling with the VM 1663: code generation functions will automatically combine VM instructions 1664: into superinstructions. Since you do not want to combine instructions 1665: across VM branch targets (otherwise there will not be a proper VM 1666: instruction to branch to), you have to call @code{BB_BOUNDARY} 1667: (@pxref{VM code generation}) at branch targets. 1668: 1669: 1670: @c -------------------------------------------------------------------- 1671: @node VM disassembler, VM profiler, Peephole optimization, Using the generated code 1672: @section VM disassembler 1673: @cindex VM disassembler 1674: @cindex disassembler, VM code 1675: @cindex @file{disasm.c} 1676: @cindex @file{-disasm.i} output file 1677: 1678: A VM code disassembler is optional for an interpretive system, but 1679: highly recommended during its development and maintenance, because it is 1680: very useful for detecting bugs in the front end (and for distinguishing 1681: them from VM interpreter bugs). 1682: 1683: Vmgen supports VM code disassembling by generating 1684: @file{@var{file}-disasm.i}. This code has to be wrapped into a 1685: function, as is done in @file{vmgen-ex/disasm.c}. You can use this file 1686: almost verbatim. In addition to @samp{vm_@var{A}2@var{B}(a,b)}, 1687: @samp{vm_out}, @samp{printarg_@var{type}(@var{value})}, which are 1688: explained above, the following macros and variables are used in 1689: @file{@var{file}-disasm.i} (and you have to define them): 1690: 1691: @table @code 1692: 1693: @item ip 1694: This variable points to the opcode of the current VM instruction. 1695: 1696: @cindex @code{IP}, @code{IPTOS} in disassmbler 1697: @item IP IPTOS 1698: @samp{IPTOS} is the first argument of the current VM instruction, and 1699: @samp{IP} points to it; this is just as in the engine, but here 1700: @samp{ip} points to the opcode of the VM instruction (in contrast to the 1701: engine, where @samp{ip} points to the next cell, or even one further). 1702: 1703: @findex VM_IS_INST 1704: @item VM_IS_INST(Inst i, int n) 1705: Tests if the opcode @samp{i} is the same as the @samp{n}th entry in the 1706: VM instruction table. 1707: 1708: @end table 1709: 1710: 1711: @c -------------------------------------------------------------------- 1712: @node VM profiler, , VM disassembler, Using the generated code 1713: @section VM profiler 1714: @cindex VM profiler 1715: @cindex profiling for selecting superinstructions 1716: @cindex superinstructions and profiling 1717: @cindex @file{profile.c} 1718: @cindex @file{-profile.i} output file 1719: 1720: The VM profiler is designed for getting execution and occurence counts 1721: for VM instruction sequences, and these counts can then be used for 1722: selecting sequences as superinstructions. The VM profiler is probably 1723: not useful as profiling tool for the interpretive system. I.e., the VM 1724: profiler is useful for the developers, but not the users of the 1725: interpretive system. 1726: 1727: The output of the profiler is: for each basic block (executed at least 1728: once), it produces the dynamic execution count of that basic block and 1729: all its subsequences; e.g., 1730: 1731: @example 1732: 9227465 lit storelocal 1733: 9227465 storelocal branch 1734: 9227465 lit storelocal branch 1735: @end example 1736: 1737: I.e., a basic block consisting of @samp{lit storelocal branch} is 1738: executed 9227465 times. 1739: 1740: @cindex @file{stat.awk} 1741: @cindex @file{seq2rule.awk} 1742: This output can be combined in various ways. E.g., 1743: @file{vmgen-ex/stat.awk} adds up the occurences of a given sequence wrt 1744: dynamic execution, static occurence, and per-program occurence. E.g., 1745: 1746: @example 1747: 2 16 36910041 loadlocal lit 1748: @end example 1749: 1750: @noindent 1751: indicates that the sequence @samp{loadlocal lit} occurs in 2 programs, 1752: in 16 places, and has been executed 36910041 times. Now you can select 1753: superinstructions in any way you like (note that compile time and space 1754: typically limit the number of superinstructions to 100--1000). After 1755: you have done that, @file{vmgen/seq2rule.awk} turns lines of the form 1756: above into rules for inclusion in a Vmgen input file. Note that this 1757: script does not ensure that all prefixes are defined, so you have to do 1758: that in other ways. So, an overall script for turning profiles into 1759: superinstructions can look like this: 1760: 1761: @example 1762: awk -f stat.awk fib.prof test.prof| 1763: awk '$3>=10000'| #select sequences 1764: fgrep -v -f peephole-blacklist| #eliminate wrong instructions 1765: awk -f seq2rule.awk| #turn into superinstructions 1766: sort -k 3 >mini-super.vmg #sort sequences 1767: @end example 1768: 1769: Here the dynamic count is used for selecting sequences (preliminary 1770: results indicate that the static count gives better results, though); 1771: the third line eliminates sequences containing instructions that must not 1772: occur in a superinstruction, because they access a stack directly. The 1773: dynamic count selection ensures that all subsequences (including 1774: prefixes) of longer sequences occur (because subsequences have at least 1775: the same count as the longer sequences); the sort in the last line 1776: ensures that longer superinstructions occur after their prefixes. 1777: 1778: But before using this, you have to have the profiler. Vmgen supports its 1779: creation by generating @file{@var{file}-profile.i}; you also need the 1780: wrapper file @file{vmgen-ex/profile.c} that you can use almost verbatim. 1781: 1782: @cindex @code{SUPER_END} in profiling 1783: @cindex @code{BB_BOUNDARY} in profiling 1784: The profiler works by recording the targets of all VM control flow 1785: changes (through @code{SUPER_END} during execution, and through 1786: @code{BB_BOUNDARY} in the front end), and counting (through 1787: @code{SUPER_END}) how often they were targeted. After the program run, 1788: the numbers are corrected such that each VM basic block has the correct 1789: count (entering a block without executing a branch does not increase the 1790: count, and the correction fixes that), then the subsequences of all 1791: basic blocks are printed. To get all this, you just have to define 1792: @code{SUPER_END} (and @code{BB_BOUNDARY}) appropriately, and call 1793: @code{vm_print_profile(FILE *file)} when you want to output the profile 1794: on @code{file}. 1795: 1796: @cindex @code{VM_IS_INST} in profiling 1797: The @file{@var{file}-profile.i} is similar to the disassembler file, and 1798: it uses variables and functions defined in @file{vmgen-ex/profile.c}, 1799: plus @code{VM_IS_INST} already defined for the VM disassembler 1800: (@pxref{VM disassembler}). 1801: 1802: @c ********************************************************** 1803: @node Hints, The future, Using the generated code, Top 1804: @chapter Hints 1805: @cindex hints 1806: 1807: @menu 1808: * Floating point:: and stacks 1809: @end menu 1810: 1811: @c -------------------------------------------------------------------- 1812: @node Floating point, , Hints, Hints 1813: @section Floating point 1814: 1815: How should you deal with floating point values? Should you use the same 1816: stack as for integers/pointers, or a different one? This section 1817: discusses this issue with a view on execution speed. 1818: 1819: The simpler approach is to use a separate floating-point stack. This 1820: allows you to choose FP value size without considering the size of the 1821: integers/pointers, and you avoid a number of performance problems. The 1822: main downside is that this needs an FP stack pointer (and that may not 1823: fit in the register file on the 386 arhitecture, costing some 1824: performance, but comparatively little if you take the other option into 1825: account). If you use a separate FP stack (with stack pointer @code{fp}), 1826: using an fpTOS is helpful on most machines, but some spill the fpTOS 1827: register into memory, and fpTOS should not be used there. 1828: 1829: The other approach is to share one stack (pointed to by, say, @code{sp}) 1830: between integer/pointer and floating-point values. This is ok if you do 1831: not use @code{spTOS}. If you do use @code{spTOS}, the compiler has to 1832: decide whether to put that variable into an integer or a floating point 1833: register, and the other type of operation becomes quite expensive on 1834: most machines (because moving values between integer and FP registers is 1835: quite expensive). If a value of one type has to be synthesized out of 1836: two values of the other type (@code{double} types), things are even more 1837: interesting. 1838: 1839: One way around this problem would be to not use the @code{spTOS} 1840: supported by Vmgen, but to use explicit top-of-stack variables (one for 1841: integers, one for FP values), and having a kind of accumulator+stack 1842: architecture (e.g., Ocaml bytecode uses this approach); however, this is 1843: a major change, and it's ramifications are not completely clear. 1844: 1845: @c ********************************************************** 1846: @node The future, Changes, Hints, Top 1847: @chapter The future 1848: @cindex future ideas 1849: 1850: We have a number of ideas for future versions of Vmgen. However, there 1851: are so many possible things to do that we would like some feedback from 1852: you. What are you doing with Vmgen, what features are you missing, and 1853: why? 1854: 1855: One idea we are thinking about is to generate just one @file{.c} file 1856: instead of letting you copy and adapt all the wrapper files (you would 1857: still have to define stuff like the type-specific macros, and stack 1858: pointers etc. somewhere). The advantage would be that, if we change the 1859: wrapper files between versions, you would not need to integrate your 1860: changes and our changes to them; Vmgen would also be easier to use for 1861: beginners. The main disadvantage of that is that it would reduce the 1862: flexibility of Vmgen a little (well, those who like flexibility could 1863: still patch the resulting @file{.c} file, like they are now doing for 1864: the wrapper files). In any case, if you are doing things to the wrapper 1865: files that would cause problems in a generated-@file{.c}-file approach, 1866: please let us know. 1867: 1868: @c ********************************************************** 1869: @node Changes, Contact, The future, Top 1870: @chapter Changes 1871: @cindex Changes from old versions 1872: 1873: User-visible changes between 0.5.9-20020822 and 0.5.9-20020901: 1874: 1875: The store optimization is now disabled by default, but can be enabled by 1876: the user (@pxref{Store Optimization}). Documentation for this 1877: optimization is also new. 1878: 1879: User-visible changes between 0.5.9-20010501 and 0.5.9-20020822: 1880: 1881: There is now a manual (in info, HTML, Postscript, or plain text format). 1882: 1883: There is the vmgen-ex2 variant of the vmgen-ex example; the new 1884: variant uses a union type instead of lots of casting. 1885: 1886: Both variants of the example can now be compiled with an ANSI C compiler 1887: (using switch dispatch and losing quite a bit of performance); tested 1888: with @command{lcc}. 1889: 1890: Users of the gforth-0.5.9-20010501 version of Vmgen need to change 1891: several things in their source code to use the current version. I 1892: recommend keeping the gforth-0.5.9-20010501 version until you have 1893: completed the change (note that you can have several versions of Gforth 1894: installed at the same time). I hope to avoid such incompatible changes 1895: in the future. 1896: 1897: The required changes are: 1898: 1899: @table @code 1900: 1901: @cindex @code{TAIL;}, changes 1902: @item TAIL; 1903: has been renamed into @code{INST_TAIL;} (less chance of an accidental 1904: match). 1905: 1906: @cindex @code{vm_@var{A}2@var{B}}, changes 1907: @item vm_@var{A}2@var{B} 1908: now takes two arguments. 1909: 1910: @cindex @code{vm_two@var{A}2@var{B}}, changes 1911: @item vm_two@var{A}2@var{B}(b,a1,a2); 1912: changed to vm_two@var{A}2@var{B}(a1,a2,b) (note the absence of the @samp{;}). 1913: 1914: @end table 1915: 1916: Also some new macros have to be defined, e.g., @code{INST_ADDR}, and 1917: @code{LABEL}; some macros have to be defined in new contexts, e.g., 1918: @code{VM_IS_INST} is now also needed in the disassembler. 1919: 1920: @c ********************************************************* 1921: @node Contact, Copying This Manual, Changes, Top 1922: @chapter Contact 1923: 1924: To report a bug, use 1925: @url{https://savannah.gnu.org/bugs/?func=addbug&group_id=2672}. 1926: 1927: For discussion on Vmgen (e.g., how to use it), use the mailing list 1928: @email{bug-vmgen@@mail.freesoftware.fsf.org} (use 1929: @url{http://mail.gnu.org/mailman/listinfo/help-vmgen} to subscribe). 1930: 1931: You can find vmgen information at 1932: @url{http://www.complang.tuwien.ac.at/anton/vmgen/}. 1933: 1934: @c *********************************************************** 1935: @node Copying This Manual, Index, Contact, Top 1936: @appendix Copying This Manual 1937: 1938: @menu 1939: * GNU Free Documentation License:: License for copying this manual. 1940: @end menu 1941: 1942: @include fdl.texi 1943: 1944: 1945: @node Index, , Copying This Manual, Top 1946: @unnumbered Index 1947: 1948: @printindex cp 1949: 1950: @bye