[gforth] / gforth / doc / gforth.ds  

gforth: gforth/doc/gforth.ds


1 : anton 1.1 \input texinfo @c -*-texinfo-*-
2 :     @comment The source is gforth.ds, from which gforth.texi is generated
3 : crook 1.28
4 : crook 1.21 @comment TODO: nac29jan99 - a list of things to add in the next edit:
5 : crook 1.28 @comment 1. x-ref all ambiguous or implementation-defined features?
6 :     @comment 2. Describe the use of Auser Avariable AConstant A, etc.
7 :     @comment 3. words in miscellaneous section need a home.
8 :     @comment 4. search for TODO for other minor and major works required.
9 :     @comment 5. [rats] change all @var to @i in Forth source so that info
10 :     @comment file looks decent.
11 : anton 1.36 @c Not an improvement IMO - anton
12 :     @c and anyway, this should be taken up
13 :     @c with Karl Berry (the texinfo guy) - anton
14 : crook 1.29 @comment .. would be useful to have a word that identified all deferred words
15 :     @comment should semantics stuff in intro be moved to another section
16 :    
17 : anton 1.66 @c POSTPONE, COMPILE, [COMPILE], LITERAL should have their own section
18 : crook 1.28
19 : anton 1.1 @comment %**start of header (This is for running Texinfo on a region.)
20 :     @setfilename gforth.info
21 :     @settitle Gforth Manual
22 : anton 1.108 @dircategory Software development
23 : anton 1.1 @direntry
24 :     * Gforth: (gforth). A fast interpreter for the Forth language.
25 :     @end direntry
26 : anton 1.49 @c The Texinfo manual also recommends doing this, but for Gforth it may
27 :     @c not make much sense
28 :     @c @dircategory Individual utilities
29 :     @c @direntry
30 :     @c * Gforth: (gforth)Invoking Gforth. gforth, gforth-fast, gforthmi
31 :     @c @end direntry
32 :    
33 : anton 1.1 @comment @setchapternewpage odd
34 : crook 1.29 @comment TODO this gets left in by HTML converter
35 : anton 1.12 @macro progstyle {}
36 :     Programming style note:
37 : anton 1.3 @end macro
38 : anton 1.48
39 :     @macro assignment {}
40 :     @table @i
41 :     @item Assignment:
42 :     @end macro
43 :     @macro endassignment {}
44 :     @end table
45 :     @end macro
46 :    
47 : anton 1.1 @comment %**end of header (This is for running Texinfo on a region.)
48 :    
49 : crook 1.29
50 :     @comment ----------------------------------------------------------
51 :     @comment macros for beautifying glossary entries
52 :     @comment if these are used, need to strip them out for HTML converter
53 :     @comment else they get repeated verbatim in HTML output.
54 :     @comment .. not working yet.
55 :    
56 :     @macro GLOSS-START {}
57 :     @iftex
58 :     @ninerm
59 :     @end iftex
60 :     @end macro
61 :    
62 :     @macro GLOSS-END {}
63 :     @iftex
64 :     @rm
65 :     @end iftex
66 :     @end macro
67 :    
68 :     @comment ----------------------------------------------------------
69 :    
70 :    
71 : anton 1.10 @include version.texi
72 :    
73 : anton 1.49 @ifnottex
74 : anton 1.11 This file documents Gforth @value{VERSION}
75 : anton 1.1
76 : anton 1.108 Copyright @copyright{} 1995, 1996, 1997, 1998, 2000, 2003 Free Software Foundation, Inc.
77 : anton 1.1
78 :     Permission is granted to make and distribute verbatim copies of
79 :     this manual provided the copyright notice and this permission notice
80 :     are preserved on all copies.
81 :    
82 :     @ignore
83 :     Permission is granted to process this file through TeX and print the
84 :     results, provided the printed document carries a copying permission
85 :     notice identical to this one except for the removal of this paragraph
86 :     (this paragraph not being relevant to the printed manual).
87 :    
88 :     @end ignore
89 :     Permission is granted to copy and distribute modified versions of this
90 :     manual under the conditions for verbatim copying, provided also that the
91 :     sections entitled "Distribution" and "General Public License" are
92 :     included exactly as in the original, and provided that the entire
93 :     resulting derived work is distributed under the terms of a permission
94 :     notice identical to this one.
95 :    
96 :     Permission is granted to copy and distribute translations of this manual
97 :     into another language, under the above conditions for modified versions,
98 :     except that the sections entitled "Distribution" and "General Public
99 :     License" may be included in a translation approved by the author instead
100 :     of in the original English.
101 : anton 1.49 @end ifnottex
102 : anton 1.1
103 :     @finalout
104 :     @titlepage
105 :     @sp 10
106 :     @center @titlefont{Gforth Manual}
107 :     @sp 2
108 : anton 1.11 @center for version @value{VERSION}
109 : anton 1.1 @sp 2
110 : anton 1.34 @center Neal Crook
111 : anton 1.1 @center Anton Ertl
112 : pazsan 1.6 @center Bernd Paysan
113 : anton 1.5 @center Jens Wilke
114 : anton 1.1 @sp 3
115 : crook 1.47 @center This manual is permanently under construction and was last updated on 15-Mar-2000
116 : anton 1.1
117 :     @comment The following two commands start the copyright page.
118 :     @page
119 :     @vskip 0pt plus 1filll
120 : anton 1.108 Copyright @copyright{} 1995, 1996, 1997, 1998, 2000, 2003 Free Software Foundation, Inc.
121 : anton 1.1
122 :     @comment !! Published by ... or You can get a copy of this manual ...
123 :    
124 :     Permission is granted to make and distribute verbatim copies of
125 :     this manual provided the copyright notice and this permission notice
126 :     are preserved on all copies.
127 :    
128 :     Permission is granted to copy and distribute modified versions of this
129 :     manual under the conditions for verbatim copying, provided also that the
130 :     sections entitled "Distribution" and "General Public License" are
131 :     included exactly as in the original, and provided that the entire
132 :     resulting derived work is distributed under the terms of a permission
133 :     notice identical to this one.
134 :    
135 :     Permission is granted to copy and distribute translations of this manual
136 :     into another language, under the above conditions for modified versions,
137 :     except that the sections entitled "Distribution" and "General Public
138 :     License" may be included in a translation approved by the author instead
139 :     of in the original English.
140 :     @end titlepage
141 :    
142 :     @node Top, License, (dir), (dir)
143 : anton 1.49 @ifnottex
144 : anton 1.1 Gforth is a free implementation of ANS Forth available on many
145 : anton 1.11 personal machines. This manual corresponds to version @value{VERSION}.
146 : anton 1.49 @end ifnottex
147 : anton 1.1
148 :     @menu
149 : crook 1.21 * License:: The GPL
150 : crook 1.26 * Goals:: About the Gforth Project
151 : crook 1.29 * Gforth Environment:: Starting (and exiting) Gforth
152 : anton 1.48 * Tutorial:: Hands-on Forth Tutorial
153 : crook 1.21 * Introduction:: An introduction to ANS Forth
154 : anton 1.1 * Words:: Forth words available in Gforth
155 : anton 1.24 * Error messages:: How to interpret them
156 : anton 1.1 * Tools:: Programming tools
157 :     * ANS conformance:: Implementation-defined options etc.
158 : anton 1.65 * Standard vs Extensions:: Should I use extensions?
159 : anton 1.1 * Model:: The abstract machine of Gforth
160 :     * Integrating Gforth:: Forth as scripting language for applications
161 :     * Emacs and Gforth:: The Gforth Mode
162 :     * Image Files:: @code{.fi} files contain compiled code
163 :     * Engine:: The inner interpreter and the primitives
164 : anton 1.24 * Binding to System Library::
165 : pazsan 1.13 * Cross Compiler:: The Cross Compiler
166 : anton 1.1 * Bugs:: How to report them
167 :     * Origin:: Authors and ancestors of Gforth
168 : crook 1.21 * Forth-related information:: Books and places to look on the WWW
169 : anton 1.1 * Word Index:: An item for each Forth word
170 :     * Concept Index:: A menu covering many topics
171 : anton 1.12
172 : anton 1.91 @detailmenu
173 :     --- The Detailed Node Listing ---
174 : anton 1.12
175 : crook 1.29 Gforth Environment
176 :    
177 : anton 1.32 * Invoking Gforth:: Getting in
178 :     * Leaving Gforth:: Getting out
179 :     * Command-line editing::
180 : anton 1.48 * Environment variables:: that affect how Gforth starts up
181 : anton 1.32 * Gforth Files:: What gets installed and where
182 : anton 1.48 * Startup speed:: When 35ms is not fast enough ...
183 :    
184 :     Forth Tutorial
185 :    
186 :     * Starting Gforth Tutorial::
187 :     * Syntax Tutorial::
188 :     * Crash Course Tutorial::
189 :     * Stack Tutorial::
190 :     * Arithmetics Tutorial::
191 :     * Stack Manipulation Tutorial::
192 :     * Using files for Forth code Tutorial::
193 :     * Comments Tutorial::
194 :     * Colon Definitions Tutorial::
195 :     * Decompilation Tutorial::
196 :     * Stack-Effect Comments Tutorial::
197 :     * Types Tutorial::
198 :     * Factoring Tutorial::
199 :     * Designing the stack effect Tutorial::
200 :     * Local Variables Tutorial::
201 :     * Conditional execution Tutorial::
202 :     * Flags and Comparisons Tutorial::
203 :     * General Loops Tutorial::
204 :     * Counted loops Tutorial::
205 :     * Recursion Tutorial::
206 :     * Leaving definitions or loops Tutorial::
207 :     * Return Stack Tutorial::
208 :     * Memory Tutorial::
209 :     * Characters and Strings Tutorial::
210 :     * Alignment Tutorial::
211 : anton 1.87 * Files Tutorial::
212 : anton 1.48 * Interpretation and Compilation Semantics and Immediacy Tutorial::
213 :     * Execution Tokens Tutorial::
214 :     * Exceptions Tutorial::
215 :     * Defining Words Tutorial::
216 :     * Arrays and Records Tutorial::
217 :     * POSTPONE Tutorial::
218 :     * Literal Tutorial::
219 :     * Advanced macros Tutorial::
220 :     * Compilation Tokens Tutorial::
221 :     * Wordlists and Search Order Tutorial::
222 : crook 1.29
223 : anton 1.24 An Introduction to ANS Forth
224 :    
225 : anton 1.67 * Introducing the Text Interpreter::
226 :     * Stacks and Postfix notation::
227 :     * Your first definition::
228 :     * How does that work?::
229 :     * Forth is written in Forth::
230 :     * Review - elements of a Forth system::
231 :     * Where to go next::
232 :     * Exercises::
233 : anton 1.24
234 : anton 1.12 Forth Words
235 :    
236 :     * Notation::
237 : anton 1.65 * Case insensitivity::
238 :     * Comments::
239 :     * Boolean Flags::
240 : anton 1.12 * Arithmetic::
241 :     * Stack Manipulation::
242 :     * Memory::
243 :     * Control Structures::
244 :     * Defining Words::
245 : anton 1.65 * Interpretation and Compilation Semantics::
246 : crook 1.47 * Tokens for Words::
247 : anton 1.81 * Compiling words::
248 : anton 1.65 * The Text Interpreter::
249 :     * Word Lists::
250 :     * Environmental Queries::
251 : anton 1.12 * Files::
252 :     * Blocks::
253 :     * Other I/O::
254 : anton 1.78 * Locals::
255 :     * Structures::
256 :     * Object-oriented Forth::
257 : anton 1.12 * Programming Tools::
258 :     * Assembler and Code Words::
259 :     * Threading Words::
260 : anton 1.65 * Passing Commands to the OS::
261 :     * Keeping track of Time::
262 :     * Miscellaneous Words::
263 : anton 1.12
264 :     Arithmetic
265 :    
266 :     * Single precision::
267 : anton 1.67 * Double precision:: Double-cell integer arithmetic
268 : anton 1.12 * Bitwise operations::
269 : anton 1.67 * Numeric comparison::
270 : anton 1.32 * Mixed precision:: Operations with single and double-cell integers
271 : anton 1.12 * Floating Point::
272 :    
273 :     Stack Manipulation
274 :    
275 :     * Data stack::
276 :     * Floating point stack::
277 :     * Return stack::
278 :     * Locals stack::
279 :     * Stack pointer manipulation::
280 :    
281 :     Memory
282 :    
283 : anton 1.32 * Memory model::
284 :     * Dictionary allocation::
285 :     * Heap Allocation::
286 :     * Memory Access::
287 :     * Address arithmetic::
288 :     * Memory Blocks::
289 : anton 1.12
290 :     Control Structures
291 :    
292 : anton 1.41 * Selection:: IF ... ELSE ... ENDIF
293 :     * Simple Loops:: BEGIN ...
294 : anton 1.32 * Counted Loops:: DO
295 : anton 1.67 * Arbitrary control structures::
296 :     * Calls and returns::
297 : anton 1.12 * Exception Handling::
298 :    
299 :     Defining Words
300 :    
301 : anton 1.67 * CREATE::
302 : crook 1.44 * Variables:: Variables and user variables
303 : anton 1.67 * Constants::
304 : crook 1.44 * Values:: Initialised variables
305 : anton 1.67 * Colon Definitions::
306 : crook 1.44 * Anonymous Definitions:: Definitions without names
307 : anton 1.71 * Supplying names:: Passing definition names as strings
308 : anton 1.67 * User-defined Defining Words::
309 : crook 1.44 * Deferred words:: Allow forward references
310 : anton 1.67 * Aliases::
311 : crook 1.47
312 : anton 1.63 User-defined Defining Words
313 :    
314 :     * CREATE..DOES> applications::
315 :     * CREATE..DOES> details::
316 :     * Advanced does> usage example::
317 : anton 1.91 * @code{Const-does>}::
318 : anton 1.63
319 : crook 1.47 Interpretation and Compilation Semantics
320 :    
321 : anton 1.67 * Combined words::
322 : anton 1.12
323 : anton 1.71 Tokens for Words
324 :    
325 :     * Execution token:: represents execution/interpretation semantics
326 :     * Compilation token:: represents compilation semantics
327 :     * Name token:: represents named words
328 :    
329 : anton 1.82 Compiling words
330 :    
331 :     * Literals:: Compiling data values
332 :     * Macros:: Compiling words
333 :    
334 : crook 1.21 The Text Interpreter
335 :    
336 : anton 1.67 * Input Sources::
337 :     * Number Conversion::
338 :     * Interpret/Compile states::
339 :     * Interpreter Directives::
340 : crook 1.21
341 : crook 1.26 Word Lists
342 :    
343 : anton 1.75 * Vocabularies::
344 : anton 1.67 * Why use word lists?::
345 : anton 1.75 * Word list example::
346 : crook 1.26
347 :     Files
348 :    
349 : anton 1.48 * Forth source files::
350 :     * General files::
351 :     * Search Paths::
352 :    
353 :     Search Paths
354 :    
355 : anton 1.75 * Source Search Paths::
356 : crook 1.26 * General Search Paths::
357 :    
358 :     Other I/O
359 :    
360 : anton 1.32 * Simple numeric output:: Predefined formats
361 :     * Formatted numeric output:: Formatted (pictured) output
362 :     * String Formats:: How Forth stores strings in memory
363 : anton 1.67 * Displaying characters and strings:: Other stuff
364 : anton 1.32 * Input:: Input
365 : crook 1.26
366 :     Locals
367 :    
368 :     * Gforth locals::
369 :     * ANS Forth locals::
370 :    
371 :     Gforth locals
372 :    
373 :     * Where are locals visible by name?::
374 :     * How long do locals live?::
375 : anton 1.78 * Locals programming style::
376 :     * Locals implementation::
377 : crook 1.26
378 : anton 1.12 Structures
379 :    
380 :     * Why explicit structure support?::
381 :     * Structure Usage::
382 :     * Structure Naming Convention::
383 :     * Structure Implementation::
384 :     * Structure Glossary::
385 :    
386 :     Object-oriented Forth
387 :    
388 : anton 1.48 * Why object-oriented programming?::
389 :     * Object-Oriented Terminology::
390 :     * Objects::
391 :     * OOF::
392 :     * Mini-OOF::
393 : crook 1.23 * Comparison with other object models::
394 : anton 1.12
395 : anton 1.24 The @file{objects.fs} model
396 : anton 1.12
397 :     * Properties of the Objects model::
398 :     * Basic Objects Usage::
399 : anton 1.41 * The Objects base class::
400 : anton 1.12 * Creating objects::
401 :     * Object-Oriented Programming Style::
402 :     * Class Binding::
403 :     * Method conveniences::
404 :     * Classes and Scoping::
405 : anton 1.41 * Dividing classes::
406 : anton 1.12 * Object Interfaces::
407 :     * Objects Implementation::
408 :     * Objects Glossary::
409 :    
410 : anton 1.24 The @file{oof.fs} model
411 : anton 1.12
412 : anton 1.67 * Properties of the OOF model::
413 :     * Basic OOF Usage::
414 :     * The OOF base class::
415 :     * Class Declaration::
416 :     * Class Implementation::
417 : anton 1.12
418 : anton 1.24 The @file{mini-oof.fs} model
419 : crook 1.23
420 : anton 1.48 * Basic Mini-OOF Usage::
421 :     * Mini-OOF Example::
422 :     * Mini-OOF Implementation::
423 : crook 1.23
424 : anton 1.78 Programming Tools
425 :    
426 :     * Examining::
427 :     * Forgetting words::
428 :     * Debugging:: Simple and quick.
429 :     * Assertions:: Making your programs self-checking.
430 :     * Singlestep Debugger:: Executing your program word by word.
431 :    
432 :     Assembler and Code Words
433 :    
434 :     * Code and ;code::
435 :     * Common Assembler:: Assembler Syntax
436 :     * Common Disassembler::
437 :     * 386 Assembler:: Deviations and special cases
438 :     * Alpha Assembler:: Deviations and special cases
439 :     * MIPS assembler:: Deviations and special cases
440 :     * Other assemblers:: How to write them
441 :    
442 : anton 1.12 Tools
443 :    
444 :     * ANS Report:: Report the words used, sorted by wordset.
445 :    
446 :     ANS conformance
447 :    
448 :     * The Core Words::
449 :     * The optional Block word set::
450 :     * The optional Double Number word set::
451 :     * The optional Exception word set::
452 :     * The optional Facility word set::
453 :     * The optional File-Access word set::
454 :     * The optional Floating-Point word set::
455 :     * The optional Locals word set::
456 :     * The optional Memory-Allocation word set::
457 :     * The optional Programming-Tools word set::
458 :     * The optional Search-Order word set::
459 :    
460 :     The Core Words
461 :    
462 :     * core-idef:: Implementation Defined Options
463 :     * core-ambcond:: Ambiguous Conditions
464 :     * core-other:: Other System Documentation
465 :    
466 :     The optional Block word set
467 :    
468 :     * block-idef:: Implementation Defined Options
469 :     * block-ambcond:: Ambiguous Conditions
470 :     * block-other:: Other System Documentation
471 :    
472 :     The optional Double Number word set
473 :    
474 :     * double-ambcond:: Ambiguous Conditions
475 :    
476 :     The optional Exception word set
477 :    
478 :     * exception-idef:: Implementation Defined Options
479 :    
480 :     The optional Facility word set
481 :    
482 :     * facility-idef:: Implementation Defined Options
483 :     * facility-ambcond:: Ambiguous Conditions
484 :    
485 :     The optional File-Access word set
486 :    
487 :     * file-idef:: Implementation Defined Options
488 :     * file-ambcond:: Ambiguous Conditions
489 :    
490 :     The optional Floating-Point word set
491 :    
492 :     * floating-idef:: Implementation Defined Options
493 :     * floating-ambcond:: Ambiguous Conditions
494 :    
495 :     The optional Locals word set
496 :    
497 :     * locals-idef:: Implementation Defined Options
498 :     * locals-ambcond:: Ambiguous Conditions
499 :    
500 :     The optional Memory-Allocation word set
501 :    
502 :     * memory-idef:: Implementation Defined Options
503 :    
504 :     The optional Programming-Tools word set
505 :    
506 :     * programming-idef:: Implementation Defined Options
507 :     * programming-ambcond:: Ambiguous Conditions
508 :    
509 :     The optional Search-Order word set
510 :    
511 :     * search-idef:: Implementation Defined Options
512 :     * search-ambcond:: Ambiguous Conditions
513 :    
514 : anton 1.109 Emacs and Gforth
515 :    
516 :     * Installing gforth.el:: Making Emacs aware of Forth.
517 :     * Emacs Tags:: Viewing the source of a word in Emacs.
518 :     * Hilighting:: Making Forth code look prettier.
519 :     * Auto-Indentation:: Customizing auto-indentation.
520 :     * Blocks Files:: Reading and writing blocks files.
521 :    
522 : anton 1.12 Image Files
523 :    
524 : anton 1.24 * Image Licensing Issues:: Distribution terms for images.
525 :     * Image File Background:: Why have image files?
526 : anton 1.67 * Non-Relocatable Image Files:: don't always work.
527 : anton 1.24 * Data-Relocatable Image Files:: are better.
528 : anton 1.67 * Fully Relocatable Image Files:: better yet.
529 : anton 1.24 * Stack and Dictionary Sizes:: Setting the default sizes for an image.
530 : anton 1.32 * Running Image Files:: @code{gforth -i @i{file}} or @i{file}.
531 : anton 1.24 * Modifying the Startup Sequence:: and turnkey applications.
532 : anton 1.12
533 :     Fully Relocatable Image Files
534 :    
535 : crook 1.27 * gforthmi:: The normal way
536 : anton 1.12 * cross.fs:: The hard way
537 :    
538 :     Engine
539 :    
540 :     * Portability::
541 :     * Threading::
542 :     * Primitives::
543 :     * Performance::
544 :    
545 :     Threading
546 :    
547 :     * Scheduling::
548 :     * Direct or Indirect Threaded?::
549 : anton 1.109 * Dynamic Superinstructions::
550 : anton 1.12 * DOES>::
551 :    
552 :     Primitives
553 :    
554 :     * Automatic Generation::
555 :     * TOS Optimization::
556 :     * Produced code::
557 : pazsan 1.13
558 :     Cross Compiler
559 :    
560 : anton 1.67 * Using the Cross Compiler::
561 :     * How the Cross Compiler Works::
562 : pazsan 1.13
563 : anton 1.24 @end detailmenu
564 : anton 1.1 @end menu
565 :    
566 : crook 1.26 @node License, Goals, Top, Top
567 : anton 1.1 @unnumbered GNU GENERAL PUBLIC LICENSE
568 :     @center Version 2, June 1991
569 :    
570 :     @display
571 :     Copyright @copyright{} 1989, 1991 Free Software Foundation, Inc.
572 : anton 1.88 59 Temple Place, Suite 330, Boston, MA 02111, USA
573 : anton 1.1
574 :     Everyone is permitted to copy and distribute verbatim copies
575 :     of this license document, but changing it is not allowed.
576 :     @end display
577 :    
578 :     @unnumberedsec Preamble
579 :    
580 :     The licenses for most software are designed to take away your
581 :     freedom to share and change it. By contrast, the GNU General Public
582 :     License is intended to guarantee your freedom to share and change free
583 :     software---to make sure the software is free for all its users. This
584 :     General Public License applies to most of the Free Software
585 :     Foundation's software and to any other program whose authors commit to
586 :     using it. (Some other Free Software Foundation software is covered by
587 :     the GNU Library General Public License instead.) You can apply it to
588 :     your programs, too.
589 :    
590 :     When we speak of free software, we are referring to freedom, not
591 :     price. Our General Public Licenses are designed to make sure that you
592 :     have the freedom to distribute copies of free software (and charge for
593 :     this service if you wish), that you receive source code or can get it
594 :     if you want it, that you can change the software or use pieces of it
595 :     in new free programs; and that you know you can do these things.
596 :    
597 :     To protect your rights, we need to make restrictions that forbid
598 :     anyone to deny you these rights or to ask you to surrender the rights.
599 :     These restrictions translate to certain responsibilities for you if you
600 :     distribute copies of the software, or if you modify it.
601 :    
602 :     For example, if you distribute copies of such a program, whether
603 :     gratis or for a fee, you must give the recipients all the rights that
604 :     you have. You must make sure that they, too, receive or can get the
605 :     source code. And you must show them these terms so they know their
606 :     rights.
607 :    
608 :     We protect your rights with two steps: (1) copyright the software, and
609 :     (2) offer you this license which gives you legal permission to copy,
610 :     distribute and/or modify the software.
611 :    
612 :     Also, for each author's protection and ours, we want to make certain
613 :     that everyone understands that there is no warranty for this free
614 :     software. If the software is modified by someone else and passed on, we
615 :     want its recipients to know that what they have is not the original, so
616 :     that any problems introduced by others will not reflect on the original
617 :     authors' reputations.
618 :    
619 :     Finally, any free program is threatened constantly by software
620 :     patents. We wish to avoid the danger that redistributors of a free
621 :     program will individually obtain patent licenses, in effect making the
622 :     program proprietary. To prevent this, we have made it clear that any
623 :     patent must be licensed for everyone's free use or not licensed at all.
624 :    
625 :     The precise terms and conditions for copying, distribution and
626 :     modification follow.
627 :    
628 :     @iftex
629 :     @unnumberedsec TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
630 :     @end iftex
631 : anton 1.49 @ifnottex
632 : anton 1.1 @center TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION
633 : anton 1.49 @end ifnottex
634 : anton 1.1
635 :     @enumerate 0
636 :     @item
637 :     This License applies to any program or other work which contains
638 :     a notice placed by the copyright holder saying it may be distributed
639 :     under the terms of this General Public License. The ``Program'', below,
640 :     refers to any such program or work, and a ``work based on the Program''
641 :     means either the Program or any derivative work under copyright law:
642 :     that is to say, a work containing the Program or a portion of it,
643 :     either verbatim or with modifications and/or translated into another
644 :     language. (Hereinafter, translation is included without limitation in
645 :     the term ``modification''.) Each licensee is addressed as ``you''.
646 :    
647 :     Activities other than copying, distribution and modification are not
648 :     covered by this License; they are outside its scope. The act of
649 :     running the Program is not restricted, and the output from the Program
650 :     is covered only if its contents constitute a work based on the
651 :     Program (independent of having been made by running the Program).
652 :     Whether that is true depends on what the Program does.
653 :    
654 :     @item
655 :     You may copy and distribute verbatim copies of the Program's
656 :     source code as you receive it, in any medium, provided that you
657 :     conspicuously and appropriately publish on each copy an appropriate
658 :     copyright notice and disclaimer of warranty; keep intact all the
659 :     notices that refer to this License and to the absence of any warranty;
660 :     and give any other recipients of the Program a copy of this License
661 :     along with the Program.
662 :    
663 :     You may charge a fee for the physical act of transferring a copy, and
664 :     you may at your option offer warranty protection in exchange for a fee.
665 :    
666 :     @item
667 :     You may modify your copy or copies of the Program or any portion
668 :     of it, thus forming a work based on the Program, and copy and
669 :     distribute such modifications or work under the terms of Section 1
670 :     above, provided that you also meet all of these conditions:
671 :    
672 :     @enumerate a
673 :     @item
674 :     You must cause the modified files to carry prominent notices
675 :     stating that you changed the files and the date of any change.
676 :    
677 :     @item
678 :     You must cause any work that you distribute or publish, that in
679 :     whole or in part contains or is derived from the Program or any
680 :     part thereof, to be licensed as a whole at no charge to all third
681 :     parties under the terms of this License.
682 :    
683 :     @item
684 :     If the modified program normally reads commands interactively
685 :     when run, you must cause it, when started running for such
686 :     interactive use in the most ordinary way, to print or display an
687 :     announcement including an appropriate copyright notice and a
688 :     notice that there is no warranty (or else, saying that you provide
689 :     a warranty) and that users may redistribute the program under
690 :     these conditions, and telling the user how to view a copy of this
691 :     License. (Exception: if the Program itself is interactive but
692 :     does not normally print such an announcement, your work based on
693 :     the Program is not required to print an announcement.)
694 :     @end enumerate
695 :    
696 :     These requirements apply to the modified work as a whole. If
697 :     identifiable sections of that work are not derived from the Program,
698 :     and can be reasonably considered independent and separate works in
699 :     themselves, then this License, and its terms, do not apply to those
700 :     sections when you distribute them as separate works. But when you
701 :     distribute the same sections as part of a whole which is a work based
702 :     on the Program, the distribution of the whole must be on the terms of
703 :     this License, whose permissions for other licensees extend to the
704 :     entire whole, and thus to each and every part regardless of who wrote it.
705 :    
706 :     Thus, it is not the intent of this section to claim rights or contest
707 :     your rights to work written entirely by you; rather, the intent is to
708 :     exercise the right to control the distribution of derivative or
709 :     collective works based on the Program.
710 :    
711 :     In addition, mere aggregation of another work not based on the Program
712 :     with the Program (or with a work based on the Program) on a volume of
713 :     a storage or distribution medium does not bring the other work under
714 :     the scope of this License.
715 :    
716 :     @item
717 :     You may copy and distribute the Program (or a work based on it,
718 :     under Section 2) in object code or executable form under the terms of
719 :     Sections 1 and 2 above provided that you also do one of the following:
720 :    
721 :     @enumerate a
722 :     @item
723 :     Accompany it with the complete corresponding machine-readable
724 :     source code, which must be distributed under the terms of Sections
725 :     1 and 2 above on a medium customarily used for software interchange; or,
726 :    
727 :     @item
728 :     Accompany it with a written offer, valid for at least three
729 :     years, to give any third party, for a charge no more than your
730 :     cost of physically performing source distribution, a complete
731 :     machine-readable copy of the corresponding source code, to be
732 :     distributed under the terms of Sections 1 and 2 above on a medium
733 :     customarily used for software interchange; or,
734 :    
735 :     @item
736 :     Accompany it with the information you received as to the offer
737 :     to distribute corresponding source code. (This alternative is
738 :     allowed only for noncommercial distribution and only if you
739 :     received the program in object code or executable form with such
740 :     an offer, in accord with Subsection b above.)
741 :     @end enumerate
742 :    
743 :     The source code for a work means the preferred form of the work for
744 :     making modifications to it. For an executable work, complete source
745 :     code means all the source code for all modules it contains, plus any
746 :     associated interface definition files, plus the scripts used to
747 :     control compilation and installation of the executable. However, as a
748 :     special exception, the source code distributed need not include
749 :     anything that is normally distributed (in either source or binary
750 :     form) with the major components (compiler, kernel, and so on) of the
751 :     operating system on which the executable runs, unless that component
752 :     itself accompanies the executable.
753 :    
754 :     If distribution of executable or object code is made by offering
755 :     access to copy from a designated place, then offering equivalent
756 :     access to copy the source code from the same place counts as
757 :     distribution of the source code, even though third parties are not
758 :     compelled to copy the source along with the object code.
759 :    
760 :     @item
761 :     You may not copy, modify, sublicense, or distribute the Program
762 :     except as expressly provided under this License. Any attempt
763 :     otherwise to copy, modify, sublicense or distribute the Program is
764 :     void, and will automatically terminate your rights under this License.
765 :     However, parties who have received copies, or rights, from you under
766 :     this License will not have their licenses terminated so long as such
767 :     parties remain in full compliance.
768 :    
769 :     @item
770 :     You are not required to accept this License, since you have not
771 :     signed it. However, nothing else grants you permission to modify or
772 :     distribute the Program or its derivative works. These actions are
773 :     prohibited by law if you do not accept this License. Therefore, by
774 :     modifying or distributing the Program (or any work based on the
775 :     Program), you indicate your acceptance of this License to do so, and
776 :     all its terms and conditions for copying, distributing or modifying
777 :     the Program or works based on it.
778 :    
779 :     @item
780 :     Each time you redistribute the Program (or any work based on the
781 :     Program), the recipient automatically receives a license from the
782 :     original licensor to copy, distribute or modify the Program subject to
783 :     these terms and conditions. You may not impose any further
784 :     restrictions on the recipients' exercise of the rights granted herein.
785 :     You are not responsible for enforcing compliance by third parties to
786 :     this License.
787 :    
788 :     @item
789 :     If, as a consequence of a court judgment or allegation of patent
790 :     infringement or for any other reason (not limited to patent issues),
791 :     conditions are imposed on you (whether by court order, agreement or
792 :     otherwise) that contradict the conditions of this License, they do not
793 :     excuse you from the conditions of this License. If you cannot
794 :     distribute so as to satisfy simultaneously your obligations under this
795 :     License and any other pertinent obligations, then as a consequence you
796 :     may not distribute the Program at all. For example, if a patent
797 :     license would not permit royalty-free redistribution of the Program by
798 :     all those who receive copies directly or indirectly through you, then
799 :     the only way you could satisfy both it and this License would be to
800 :     refrain entirely from distribution of the Program.
801 :    
802 :     If any portion of this section is held invalid or unenforceable under
803 :     any particular circumstance, the balance of the section is intended to
804 :     apply and the section as a whole is intended to apply in other
805 :     circumstances.
806 :    
807 :     It is not the purpose of this section to induce you to infringe any
808 :     patents or other property right claims or to contest validity of any
809 :     such claims; this section has the sole purpose of protecting the
810 :     integrity of the free software distribution system, which is
811 :     implemented by public license practices. Many people have made
812 :     generous contributions to the wide range of software distributed
813 :     through that system in reliance on consistent application of that
814 :     system; it is up to the author/donor to decide if he or she is willing
815 :     to distribute software through any other system and a licensee cannot
816 :     impose that choice.
817 :    
818 :     This section is intended to make thoroughly clear what is believed to
819 :     be a consequence of the rest of this License.
820 :    
821 :     @item
822 :     If the distribution and/or use of the Program is restricted in
823 :     certain countries either by patents or by copyrighted interfaces, the
824 :     original copyright holder who places the Program under this License
825 :     may add an explicit geographical distribution limitation excluding
826 :     those countries, so that distribution is permitted only in or among
827 :     countries not thus excluded. In such case, this License incorporates
828 :     the limitation as if written in the body of this License.
829 :    
830 :     @item
831 :     The Free Software Foundation may publish revised and/or new versions
832 :     of the General Public License from time to time. Such new versions will
833 :     be similar in spirit to the present version, but may differ in detail to
834 :     address new problems or concerns.
835 :    
836 :     Each version is given a distinguishing version number. If the Program
837 :     specifies a version number of this License which applies to it and ``any
838 :     later version'', you have the option of following the terms and conditions
839 :     either of that version or of any later version published by the Free
840 :     Software Foundation. If the Program does not specify a version number of
841 :     this License, you may choose any version ever published by the Free Software
842 :     Foundation.
843 :    
844 :     @item
845 :     If you wish to incorporate parts of the Program into other free
846 :     programs whose distribution conditions are different, write to the author
847 :     to ask for permission. For software which is copyrighted by the Free
848 :     Software Foundation, write to the Free Software Foundation; we sometimes
849 :     make exceptions for this. Our decision will be guided by the two goals
850 :     of preserving the free status of all derivatives of our free software and
851 :     of promoting the sharing and reuse of software generally.
852 :    
853 :     @iftex
854 :     @heading NO WARRANTY
855 :     @end iftex
856 : anton 1.49 @ifnottex
857 : anton 1.1 @center NO WARRANTY
858 : anton 1.49 @end ifnottex
859 : anton 1.1
860 :     @item
861 :     BECAUSE THE PROGRAM IS LICENSED FREE OF CHARGE, THERE IS NO WARRANTY
862 :     FOR THE PROGRAM, TO THE EXTENT PERMITTED BY APPLICABLE LAW. EXCEPT WHEN
863 :     OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR OTHER PARTIES
864 :     PROVIDE THE PROGRAM ``AS IS'' WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED
865 :     OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES OF
866 :     MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. THE ENTIRE RISK AS
867 :     TO THE QUALITY AND PERFORMANCE OF THE PROGRAM IS WITH YOU. SHOULD THE
868 :     PROGRAM PROVE DEFECTIVE, YOU ASSUME THE COST OF ALL NECESSARY SERVICING,
869 :     REPAIR OR CORRECTION.
870 :    
871 :     @item
872 :     IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN WRITING
873 :     WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY AND/OR
874 :     REDISTRIBUTE THE PROGRAM AS PERMITTED ABOVE, BE LIABLE TO YOU FOR DAMAGES,
875 :     INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR CONSEQUENTIAL DAMAGES ARISING
876 :     OUT OF THE USE OR INABILITY TO USE THE PROGRAM (INCLUDING BUT NOT LIMITED
877 :     TO LOSS OF DATA OR DATA BEING RENDERED INACCURATE OR LOSSES SUSTAINED BY
878 :     YOU OR THIRD PARTIES OR A FAILURE OF THE PROGRAM TO OPERATE WITH ANY OTHER
879 :     PROGRAMS), EVEN IF SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE
880 :     POSSIBILITY OF SUCH DAMAGES.
881 :     @end enumerate
882 :    
883 :     @iftex
884 :     @heading END OF TERMS AND CONDITIONS
885 :     @end iftex
886 : anton 1.49 @ifnottex
887 : anton 1.1 @center END OF TERMS AND CONDITIONS
888 : anton 1.49 @end ifnottex
889 : anton 1.1
890 :     @page
891 :     @unnumberedsec How to Apply These Terms to Your New Programs
892 :    
893 :     If you develop a new program, and you want it to be of the greatest
894 :     possible use to the public, the best way to achieve this is to make it
895 :     free software which everyone can redistribute and change under these terms.
896 :    
897 :     To do so, attach the following notices to the program. It is safest
898 :     to attach them to the start of each source file to most effectively
899 :     convey the exclusion of warranty; and each file should have at least
900 :     the ``copyright'' line and a pointer to where the full notice is found.
901 :    
902 :     @smallexample
903 :     @var{one line to give the program's name and a brief idea of what it does.}
904 :     Copyright (C) 19@var{yy} @var{name of author}
905 :    
906 :     This program is free software; you can redistribute it and/or modify
907 :     it under the terms of the GNU General Public License as published by
908 :     the Free Software Foundation; either version 2 of the License, or
909 :     (at your option) any later version.
910 :    
911 :     This program is distributed in the hope that it will be useful,
912 :     but WITHOUT ANY WARRANTY; without even the implied warranty of
913 :     MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
914 :     GNU General Public License for more details.
915 :    
916 :     You should have received a copy of the GNU General Public License
917 :     along with this program; if not, write to the Free Software
918 : anton 1.88 Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111, USA.
919 : anton 1.1 @end smallexample
920 :    
921 :     Also add information on how to contact you by electronic and paper mail.
922 :    
923 :     If the program is interactive, make it output a short notice like this
924 :     when it starts in an interactive mode:
925 :    
926 :     @smallexample
927 :     Gnomovision version 69, Copyright (C) 19@var{yy} @var{name of author}
928 :     Gnomovision comes with ABSOLUTELY NO WARRANTY; for details
929 :     type `show w'.
930 :     This is free software, and you are welcome to redistribute it
931 :     under certain conditions; type `show c' for details.
932 :     @end smallexample
933 :    
934 :     The hypothetical commands @samp{show w} and @samp{show c} should show
935 :     the appropriate parts of the General Public License. Of course, the
936 :     commands you use may be called something other than @samp{show w} and
937 :     @samp{show c}; they could even be mouse-clicks or menu items---whatever
938 :     suits your program.
939 :    
940 :     You should also get your employer (if you work as a programmer) or your
941 :     school, if any, to sign a ``copyright disclaimer'' for the program, if
942 :     necessary. Here is a sample; alter the names:
943 :    
944 :     @smallexample
945 :     Yoyodyne, Inc., hereby disclaims all copyright interest in the program
946 :     `Gnomovision' (which makes passes at compilers) written by James Hacker.
947 :    
948 :     @var{signature of Ty Coon}, 1 April 1989
949 :     Ty Coon, President of Vice
950 :     @end smallexample
951 :    
952 :     This General Public License does not permit incorporating your program into
953 :     proprietary programs. If your program is a subroutine library, you may
954 :     consider it more useful to permit linking proprietary applications with the
955 :     library. If this is what you want to do, use the GNU Library General
956 :     Public License instead of this License.
957 :    
958 :     @iftex
959 :     @unnumbered Preface
960 :     @cindex Preface
961 : crook 1.21 This manual documents Gforth. Some introductory material is provided for
962 :     readers who are unfamiliar with Forth or who are migrating to Gforth
963 :     from other Forth compilers. However, this manual is primarily a
964 :     reference manual.
965 : anton 1.1 @end iftex
966 :    
967 : crook 1.28 @comment TODO much more blurb here.
968 : crook 1.26
969 :     @c ******************************************************************
970 : crook 1.29 @node Goals, Gforth Environment, License, Top
971 : crook 1.26 @comment node-name, next, previous, up
972 :     @chapter Goals of Gforth
973 :     @cindex goals of the Gforth project
974 :     The goal of the Gforth Project is to develop a standard model for
975 :     ANS Forth. This can be split into several subgoals:
976 :    
977 :     @itemize @bullet
978 :     @item
979 :     Gforth should conform to the ANS Forth Standard.
980 :     @item
981 :     It should be a model, i.e. it should define all the
982 :     implementation-dependent things.
983 :     @item
984 :     It should become standard, i.e. widely accepted and used. This goal
985 :     is the most difficult one.
986 :     @end itemize
987 :    
988 :     To achieve these goals Gforth should be
989 :     @itemize @bullet
990 :     @item
991 :     Similar to previous models (fig-Forth, F83)
992 :     @item
993 :     Powerful. It should provide for all the things that are considered
994 :     necessary today and even some that are not yet considered necessary.
995 :     @item
996 :     Efficient. It should not get the reputation of being exceptionally
997 :     slow.
998 :     @item
999 :     Free.
1000 :     @item
1001 :     Available on many machines/easy to port.
1002 :     @end itemize
1003 :    
1004 :     Have we achieved these goals? Gforth conforms to the ANS Forth
1005 :     standard. It may be considered a model, but we have not yet documented
1006 :     which parts of the model are stable and which parts we are likely to
1007 :     change. It certainly has not yet become a de facto standard, but it
1008 :     appears to be quite popular. It has some similarities to and some
1009 :     differences from previous models. It has some powerful features, but not
1010 :     yet everything that we envisioned. We certainly have achieved our
1011 : anton 1.65 execution speed goals (@pxref{Performance})@footnote{However, in 1998
1012 :     the bar was raised when the major commercial Forth vendors switched to
1013 :     native code compilers.}. It is free and available on many machines.
1014 : crook 1.29
1015 : crook 1.26 @c ******************************************************************
1016 : anton 1.48 @node Gforth Environment, Tutorial, Goals, Top
1017 : crook 1.29 @chapter Gforth Environment
1018 :     @cindex Gforth environment
1019 : crook 1.21
1020 : crook 1.45 Note: ultimately, the Gforth man page will be auto-generated from the
1021 : crook 1.29 material in this chapter.
1022 : crook 1.21
1023 :     @menu
1024 : crook 1.29 * Invoking Gforth:: Getting in
1025 :     * Leaving Gforth:: Getting out
1026 :     * Command-line editing::
1027 : anton 1.48 * Environment variables:: that affect how Gforth starts up
1028 : crook 1.29 * Gforth Files:: What gets installed and where
1029 : anton 1.48 * Startup speed:: When 35ms is not fast enough ...
1030 : crook 1.21 @end menu
1031 :    
1032 : anton 1.49 For related information about the creation of images see @ref{Image Files}.
1033 : crook 1.29
1034 : crook 1.21 @comment ----------------------------------------------
1035 : anton 1.48 @node Invoking Gforth, Leaving Gforth, Gforth Environment, Gforth Environment
1036 : crook 1.29 @section Invoking Gforth
1037 :     @cindex invoking Gforth
1038 :     @cindex running Gforth
1039 :     @cindex command-line options
1040 :     @cindex options on the command line
1041 :     @cindex flags on the command line
1042 : crook 1.21
1043 : anton 1.30 Gforth is made up of two parts; an executable ``engine'' (named
1044 : anton 1.109 @command{gforth} or @command{gforth-fast}) and an image file. To start it, you
1045 : anton 1.30 will usually just say @code{gforth} -- this automatically loads the
1046 :     default image file @file{gforth.fi}. In many other cases the default
1047 :     Gforth image will be invoked like this:
1048 : crook 1.21 @example
1049 : anton 1.30 gforth [file | -e forth-code] ...
1050 : crook 1.21 @end example
1051 : crook 1.29 @noindent
1052 :     This interprets the contents of the files and the Forth code in the order they
1053 :     are given.
1054 : crook 1.21
1055 : anton 1.109 In addition to the @command{gforth} engine, there is also an engine
1056 :     called @command{gforth-fast}, which is faster, but gives less
1057 :     informative error messages (@pxref{Error messages}) and may catch some
1058 :     stack underflows later or not at all. You should use it for debugged,
1059 :     performance-critical programs.
1060 :    
1061 :     Moreover, there is an engine called @command{gforth-itc}, which is
1062 :     useful in some backwards-compatibility situations (@pxref{Direct or
1063 :     Indirect Threaded?}).
1064 : anton 1.30
1065 : crook 1.29 In general, the command line looks like this:
1066 : crook 1.21
1067 :     @example
1068 : anton 1.30 gforth[-fast] [engine options] [image options]
1069 : crook 1.21 @end example
1070 :    
1071 : anton 1.30 The engine options must come before the rest of the command
1072 : crook 1.29 line. They are:
1073 : crook 1.26
1074 : crook 1.29 @table @code
1075 :     @cindex -i, command-line option
1076 :     @cindex --image-file, command-line option
1077 :     @item --image-file @i{file}
1078 :     @itemx -i @i{file}
1079 :     Loads the Forth image @i{file} instead of the default
1080 :     @file{gforth.fi} (@pxref{Image Files}).
1081 : crook 1.21
1082 : anton 1.39 @cindex --appl-image, command-line option
1083 :     @item --appl-image @i{file}
1084 :     Loads the image @i{file} and leaves all further command-line arguments
1085 : anton 1.65 to the image (instead of processing them as engine options). This is
1086 :     useful for building executable application images on Unix, built with
1087 : anton 1.39 @code{gforthmi --application ...}.
1088 :    
1089 : crook 1.29 @cindex --path, command-line option
1090 :     @cindex -p, command-line option
1091 :     @item --path @i{path}
1092 :     @itemx -p @i{path}
1093 :     Uses @i{path} for searching the image file and Forth source code files
1094 :     instead of the default in the environment variable @code{GFORTHPATH} or
1095 :     the path specified at installation time (e.g.,
1096 :     @file{/usr/local/share/gforth/0.2.0:.}). A path is given as a list of
1097 :     directories, separated by @samp{:} (on Unix) or @samp{;} (on other OSs).
1098 : crook 1.21
1099 : crook 1.29 @cindex --dictionary-size, command-line option
1100 :     @cindex -m, command-line option
1101 :     @cindex @i{size} parameters for command-line options
1102 :     @cindex size of the dictionary and the stacks
1103 :     @item --dictionary-size @i{size}
1104 :     @itemx -m @i{size}
1105 :     Allocate @i{size} space for the Forth dictionary space instead of
1106 :     using the default specified in the image (typically 256K). The
1107 :     @i{size} specification for this and subsequent options consists of
1108 :     an integer and a unit (e.g.,
1109 :     @code{4M}). The unit can be one of @code{b} (bytes), @code{e} (element
1110 :     size, in this case Cells), @code{k} (kilobytes), @code{M} (Megabytes),
1111 :     @code{G} (Gigabytes), and @code{T} (Terabytes). If no unit is specified,
1112 :     @code{e} is used.
1113 : crook 1.21
1114 : crook 1.29 @cindex --data-stack-size, command-line option
1115 :     @cindex -d, command-line option
1116 :     @item --data-stack-size @i{size}
1117 :     @itemx -d @i{size}
1118 :     Allocate @i{size} space for the data stack instead of using the
1119 :     default specified in the image (typically 16K).
1120 : crook 1.21
1121 : crook 1.29 @cindex --return-stack-size, command-line option
1122 :     @cindex -r, command-line option
1123 :     @item --return-stack-size @i{size}
1124 :     @itemx -r @i{size}
1125 :     Allocate @i{size} space for the return stack instead of using the
1126 :     default specified in the image (typically 15K).
1127 : crook 1.21
1128 : crook 1.29 @cindex --fp-stack-size, command-line option
1129 :     @cindex -f, command-line option
1130 :     @item --fp-stack-size @i{size}
1131 :     @itemx -f @i{size}
1132 :     Allocate @i{size} space for the floating point stack instead of
1133 :     using the default specified in the image (typically 15.5K). In this case
1134 :     the unit specifier @code{e} refers to floating point numbers.
1135 : crook 1.21
1136 : anton 1.48 @cindex --locals-stack-size, command-line option
1137 :     @cindex -l, command-line option
1138 :     @item --locals-stack-size @i{size}
1139 :     @itemx -l @i{size}
1140 :     Allocate @i{size} space for the locals stack instead of using the
1141 :     default specified in the image (typically 14.5K).
1142 :    
1143 :     @cindex -h, command-line option
1144 :     @cindex --help, command-line option
1145 :     @item --help
1146 :     @itemx -h
1147 :     Print a message about the command-line options
1148 :    
1149 :     @cindex -v, command-line option
1150 :     @cindex --version, command-line option
1151 :     @item --version
1152 :     @itemx -v
1153 :     Print version and exit
1154 :    
1155 :     @cindex --debug, command-line option
1156 :     @item --debug
1157 :     Print some information useful for debugging on startup.
1158 :    
1159 :     @cindex --offset-image, command-line option
1160 :     @item --offset-image
1161 :     Start the dictionary at a slightly different position than would be used
1162 :     otherwise (useful for creating data-relocatable images,
1163 :     @pxref{Data-Relocatable Image Files}).
1164 :    
1165 :     @cindex --no-offset-im, command-line option
1166 :     @item --no-offset-im
1167 :     Start the dictionary at the normal position.
1168 :    
1169 :     @cindex --clear-dictionary, command-line option
1170 :     @item --clear-dictionary
1171 :     Initialize all bytes in the dictionary to 0 before loading the image
1172 :     (@pxref{Data-Relocatable Image Files}).
1173 :    
1174 :     @cindex --die-on-signal, command-line-option
1175 :     @item --die-on-signal
1176 :     Normally Gforth handles most signals (e.g., the user interrupt SIGINT,
1177 :     or the segmentation violation SIGSEGV) by translating it into a Forth
1178 :     @code{THROW}. With this option, Gforth exits if it receives such a
1179 :     signal. This option is useful when the engine and/or the image might be
1180 :     severely broken (such that it causes another signal before recovering
1181 :     from the first); this option avoids endless loops in such cases.
1182 : anton 1.109
1183 :     @item --no-dynamic
1184 :     @item --dynamic
1185 :     Disable or enable dynamic superinstructions with replication
1186 :     (@pxref{Dynamic Superinstructions}).
1187 :    
1188 :     @item --no-super
1189 : anton 1.110 Disable dynamic superinstructions, use just dynamic replication; this is
1190 :     useful if you want to patch threaded code (@pxref{Dynamic
1191 :     Superinstructions}).
1192 : anton 1.109
1193 : anton 1.48 @end table
1194 :    
1195 :     @cindex loading files at startup
1196 :     @cindex executing code on startup
1197 :     @cindex batch processing with Gforth
1198 :     As explained above, the image-specific command-line arguments for the
1199 :     default image @file{gforth.fi} consist of a sequence of filenames and
1200 :     @code{-e @var{forth-code}} options that are interpreted in the sequence
1201 :     in which they are given. The @code{-e @var{forth-code}} or
1202 :     @code{--evaluate @var{forth-code}} option evaluates the Forth
1203 :     code. This option takes only one argument; if you want to evaluate more
1204 :     Forth words, you have to quote them or use @code{-e} several times. To exit
1205 :     after processing the command line (instead of entering interactive mode)
1206 :     append @code{-e bye} to the command line.
1207 :    
1208 :     @cindex versions, invoking other versions of Gforth
1209 :     If you have several versions of Gforth installed, @code{gforth} will
1210 :     invoke the version that was installed last. @code{gforth-@i{version}}
1211 :     invokes a specific version. If your environment contains the variable
1212 :     @code{GFORTHPATH}, you may want to override it by using the
1213 :     @code{--path} option.
1214 :    
1215 :     Not yet implemented:
1216 :     On startup the system first executes the system initialization file
1217 :     (unless the option @code{--no-init-file} is given; note that the system
1218 :     resulting from using this option may not be ANS Forth conformant). Then
1219 :     the user initialization file @file{.gforth.fs} is executed, unless the
1220 : crook 1.62 option @code{--no-rc} is given; this file is searched for in @file{.},
1221 : anton 1.48 then in @file{~}, then in the normal path (see above).
1222 :    
1223 :    
1224 :    
1225 :     @comment ----------------------------------------------
1226 :     @node Leaving Gforth, Command-line editing, Invoking Gforth, Gforth Environment
1227 :     @section Leaving Gforth
1228 :     @cindex Gforth - leaving
1229 :     @cindex leaving Gforth
1230 :    
1231 :     You can leave Gforth by typing @code{bye} or @kbd{Ctrl-d} (at the start
1232 :     of a line) or (if you invoked Gforth with the @code{--die-on-signal}
1233 :     option) @kbd{Ctrl-c}. When you leave Gforth, all of your definitions and
1234 : anton 1.49 data are discarded. For ways of saving the state of the system before
1235 :     leaving Gforth see @ref{Image Files}.
1236 : anton 1.48
1237 :     doc-bye
1238 :    
1239 :    
1240 :     @comment ----------------------------------------------
1241 : anton 1.65 @node Command-line editing, Environment variables, Leaving Gforth, Gforth Environment
1242 : anton 1.48 @section Command-line editing
1243 :     @cindex command-line editing
1244 :    
1245 :     Gforth maintains a history file that records every line that you type to
1246 :     the text interpreter. This file is preserved between sessions, and is
1247 :     used to provide a command-line recall facility; if you type @kbd{Ctrl-P}
1248 :     repeatedly you can recall successively older commands from this (or
1249 :     previous) session(s). The full list of command-line editing facilities is:
1250 :    
1251 :     @itemize @bullet
1252 :     @item
1253 :     @kbd{Ctrl-p} (``previous'') (or up-arrow) to recall successively older
1254 :     commands from the history buffer.
1255 :     @item
1256 :     @kbd{Ctrl-n} (``next'') (or down-arrow) to recall successively newer commands
1257 :     from the history buffer.
1258 :     @item
1259 :     @kbd{Ctrl-f} (or right-arrow) to move the cursor right, non-destructively.
1260 :     @item
1261 :     @kbd{Ctrl-b} (or left-arrow) to move the cursor left, non-destructively.
1262 :     @item
1263 :     @kbd{Ctrl-h} (backspace) to delete the character to the left of the cursor,
1264 :     closing up the line.
1265 :     @item
1266 :     @kbd{Ctrl-k} to delete (``kill'') from the cursor to the end of the line.
1267 :     @item
1268 :     @kbd{Ctrl-a} to move the cursor to the start of the line.
1269 :     @item
1270 :     @kbd{Ctrl-e} to move the cursor to the end of the line.
1271 :     @item
1272 :     @key{RET} (@kbd{Ctrl-m}) or @key{LFD} (@kbd{Ctrl-j}) to submit the current
1273 :     line.
1274 :     @item
1275 :     @key{TAB} to step through all possible full-word completions of the word
1276 :     currently being typed.
1277 :     @item
1278 : anton 1.65 @kbd{Ctrl-d} on an empty line line to terminate Gforth (gracefully,
1279 :     using @code{bye}).
1280 :     @item
1281 :     @kbd{Ctrl-x} (or @code{Ctrl-d} on a non-empty line) to delete the
1282 :     character under the cursor.
1283 : anton 1.48 @end itemize
1284 :    
1285 :     When editing, displayable characters are inserted to the left of the
1286 :     cursor position; the line is always in ``insert'' (as opposed to
1287 :     ``overstrike'') mode.
1288 :    
1289 :     @cindex history file
1290 :     @cindex @file{.gforth-history}
1291 :     On Unix systems, the history file is @file{~/.gforth-history} by
1292 :     default@footnote{i.e. it is stored in the user's home directory.}. You
1293 :     can find out the name and location of your history file using:
1294 :    
1295 :     @example
1296 :     history-file type \ Unix-class systems
1297 :    
1298 :     history-file type \ Other systems
1299 :     history-dir type
1300 :     @end example
1301 :    
1302 :     If you enter long definitions by hand, you can use a text editor to
1303 :     paste them out of the history file into a Forth source file for reuse at
1304 :     a later time.
1305 :    
1306 :     Gforth never trims the size of the history file, so you should do this
1307 :     periodically, if necessary.
1308 :    
1309 :     @comment this is all defined in history.fs
1310 :     @comment NAC TODO the ctrl-D behaviour can either do a bye or a beep.. how is that option
1311 :     @comment chosen?
1312 :    
1313 :    
1314 :     @comment ----------------------------------------------
1315 : anton 1.65 @node Environment variables, Gforth Files, Command-line editing, Gforth Environment
1316 : anton 1.48 @section Environment variables
1317 :     @cindex environment variables
1318 :    
1319 :     Gforth uses these environment variables:
1320 :    
1321 :     @itemize @bullet
1322 :     @item
1323 :     @cindex @code{GFORTHHIST} -- environment variable
1324 :     @code{GFORTHHIST} -- (Unix systems only) specifies the directory in which to
1325 :     open/create the history file, @file{.gforth-history}. Default:
1326 :     @code{$HOME}.
1327 :    
1328 :     @item
1329 :     @cindex @code{GFORTHPATH} -- environment variable
1330 :     @code{GFORTHPATH} -- specifies the path used when searching for the gforth image file and
1331 :     for Forth source-code files.
1332 :    
1333 :     @item
1334 :     @cindex @code{GFORTH} -- environment variable
1335 : anton 1.49 @code{GFORTH} -- used by @file{gforthmi}, @xref{gforthmi}.
1336 : anton 1.48
1337 :     @item
1338 :     @cindex @code{GFORTHD} -- environment variable
1339 : crook 1.62 @code{GFORTHD} -- used by @file{gforthmi}, @xref{gforthmi}.
1340 : anton 1.48
1341 :     @item
1342 :     @cindex @code{TMP}, @code{TEMP} - environment variable
1343 :     @code{TMP}, @code{TEMP} - (non-Unix systems only) used as a potential
1344 :     location for the history file.
1345 :     @end itemize
1346 :    
1347 :     @comment also POSIXELY_CORRECT LINES COLUMNS HOME but no interest in
1348 :     @comment mentioning these.
1349 :    
1350 :     All the Gforth environment variables default to sensible values if they
1351 :     are not set.
1352 :    
1353 :    
1354 :     @comment ----------------------------------------------
1355 :     @node Gforth Files, Startup speed, Environment variables, Gforth Environment
1356 :     @section Gforth files
1357 :     @cindex Gforth files
1358 :    
1359 :     When you install Gforth on a Unix system, it installs files in these
1360 :     locations by default:
1361 :    
1362 :     @itemize @bullet
1363 :     @item
1364 :     @file{/usr/local/bin/gforth}
1365 :     @item
1366 :     @file{/usr/local/bin/gforthmi}
1367 :     @item
1368 :     @file{/usr/local/man/man1/gforth.1} - man page.
1369 :     @item
1370 :     @file{/usr/local/info} - the Info version of this manual.
1371 :     @item
1372 :     @file{/usr/local/lib/gforth/<version>/...} - Gforth @file{.fi} files.
1373 :     @item
1374 :     @file{/usr/local/share/gforth/<version>/TAGS} - Emacs TAGS file.
1375 :     @item
1376 :     @file{/usr/local/share/gforth/<version>/...} - Gforth source files.
1377 :     @item
1378 :     @file{.../emacs/site-lisp/gforth.el} - Emacs gforth mode.
1379 :     @end itemize
1380 :    
1381 :     You can select different places for installation by using
1382 :     @code{configure} options (listed with @code{configure --help}).
1383 :    
1384 :     @comment ----------------------------------------------
1385 :     @node Startup speed, , Gforth Files, Gforth Environment
1386 :     @section Startup speed
1387 :     @cindex Startup speed
1388 :     @cindex speed, startup
1389 :    
1390 :     If Gforth is used for CGI scripts or in shell scripts, its startup
1391 :     speed may become a problem. On a 300MHz 21064a under Linux-2.2.13 with
1392 :     glibc-2.0.7, @code{gforth -e bye} takes about 24.6ms user and 11.3ms
1393 :     system time.
1394 :    
1395 :     If startup speed is a problem, you may consider the following ways to
1396 :     improve it; or you may consider ways to reduce the number of startups
1397 : crook 1.62 (for example, by using Fast-CGI).
1398 : anton 1.48
1399 :     The first step to improve startup speed is to statically link Gforth, by
1400 :     building it with @code{XLDFLAGS=-static}. This requires more memory for
1401 :     the code and will therefore slow down the first invocation, but
1402 :     subsequent invocations avoid the dynamic linking overhead. Another
1403 :     disadvantage is that Gforth won't profit from library upgrades. As a
1404 :     result, @code{gforth-static -e bye} takes about 17.1ms user and
1405 :     8.2ms system time.
1406 :    
1407 :     The next step to improve startup speed is to use a non-relocatable image
1408 : anton 1.65 (@pxref{Non-Relocatable Image Files}). You can create this image with
1409 : anton 1.48 @code{gforth -e "savesystem gforthnr.fi bye"} and later use it with
1410 :     @code{gforth -i gforthnr.fi ...}. This avoids the relocation overhead
1411 :     and a part of the copy-on-write overhead. The disadvantage is that the
1412 : crook 1.62 non-relocatable image does not work if the OS gives Gforth a different
1413 : anton 1.48 address for the dictionary, for whatever reason; so you better provide a
1414 :     fallback on a relocatable image. @code{gforth-static -i gforthnr.fi -e
1415 :     bye} takes about 15.3ms user and 7.5ms system time.
1416 :    
1417 :     The final step is to disable dictionary hashing in Gforth. Gforth
1418 :     builds the hash table on startup, which takes much of the startup
1419 :     overhead. You can do this by commenting out the @code{include hash.fs}
1420 :     in @file{startup.fs} and everything that requires @file{hash.fs} (at the
1421 :     moment @file{table.fs} and @file{ekey.fs}) and then doing @code{make}.
1422 :     The disadvantages are that functionality like @code{table} and
1423 :     @code{ekey} is missing and that text interpretation (e.g., compiling)
1424 :     now takes much longer. So, you should only use this method if there is
1425 :     no significant text interpretation to perform (the script should be
1426 : crook 1.62 compiled into the image, amongst other things). @code{gforth-static -i
1427 : anton 1.48 gforthnrnh.fi -e bye} takes about 2.1ms user and 6.1ms system time.
1428 :    
1429 :     @c ******************************************************************
1430 :     @node Tutorial, Introduction, Gforth Environment, Top
1431 :     @chapter Forth Tutorial
1432 :     @cindex Tutorial
1433 :     @cindex Forth Tutorial
1434 :    
1435 : anton 1.67 @c Topics from nac's Introduction that could be mentioned:
1436 :     @c press <ret> after each line
1437 :     @c Prompt
1438 :     @c numbers vs. words in dictionary on text interpretation
1439 :     @c what happens on redefinition
1440 :     @c parsing words (in particular, defining words)
1441 :    
1442 : anton 1.83 The difference of this chapter from the Introduction
1443 :     (@pxref{Introduction}) is that this tutorial is more fast-paced, should
1444 :     be used while sitting in front of a computer, and covers much more
1445 :     material, but does not explain how the Forth system works.
1446 :    
1447 : crook 1.62 This tutorial can be used with any ANS-compliant Forth; any
1448 :     Gforth-specific features are marked as such and you can skip them if you
1449 :     work with another Forth. This tutorial does not explain all features of
1450 :     Forth, just enough to get you started and give you some ideas about the
1451 :     facilities available in Forth. Read the rest of the manual and the
1452 :     standard when you are through this.
1453 : anton 1.48
1454 :     The intended way to use this tutorial is that you work through it while
1455 :     sitting in front of the console, take a look at the examples and predict
1456 :     what they will do, then try them out; if the outcome is not as expected,
1457 :     find out why (e.g., by trying out variations of the example), so you
1458 :     understand what's going on. There are also some assignments that you
1459 :     should solve.
1460 :    
1461 :     This tutorial assumes that you have programmed before and know what,
1462 :     e.g., a loop is.
1463 :    
1464 :     @c !! explain compat library
1465 :    
1466 :     @menu
1467 :     * Starting Gforth Tutorial::
1468 :     * Syntax Tutorial::
1469 :     * Crash Course Tutorial::
1470 :     * Stack Tutorial::
1471 :     * Arithmetics Tutorial::
1472 :     * Stack Manipulation Tutorial::
1473 :     * Using files for Forth code Tutorial::
1474 :     * Comments Tutorial::
1475 :     * Colon Definitions Tutorial::
1476 :     * Decompilation Tutorial::
1477 :     * Stack-Effect Comments Tutorial::
1478 :     * Types Tutorial::
1479 :     * Factoring Tutorial::
1480 :     * Designing the stack effect Tutorial::
1481 :     * Local Variables Tutorial::
1482 :     * Conditional execution Tutorial::
1483 :     * Flags and Comparisons Tutorial::
1484 :     * General Loops Tutorial::
1485 :     * Counted loops Tutorial::
1486 :     * Recursion Tutorial::
1487 :     * Leaving definitions or loops Tutorial::
1488 :     * Return Stack Tutorial::
1489 :     * Memory Tutorial::
1490 :     * Characters and Strings Tutorial::
1491 :     * Alignment Tutorial::
1492 : anton 1.87 * Files Tutorial::
1493 : anton 1.48 * Interpretation and Compilation Semantics and Immediacy Tutorial::
1494 :     * Execution Tokens Tutorial::
1495 :     * Exceptions Tutorial::
1496 :     * Defining Words Tutorial::
1497 :     * Arrays and Records Tutorial::
1498 :     * POSTPONE Tutorial::
1499 :     * Literal Tutorial::
1500 :     * Advanced macros Tutorial::
1501 :     * Compilation Tokens Tutorial::
1502 :     * Wordlists and Search Order Tutorial::
1503 :     @end menu
1504 :    
1505 :     @node Starting Gforth Tutorial, Syntax Tutorial, Tutorial, Tutorial
1506 :     @section Starting Gforth
1507 : anton 1.66 @cindex starting Gforth tutorial
1508 : anton 1.48 You can start Gforth by typing its name:
1509 :    
1510 :     @example
1511 :     gforth
1512 :     @end example
1513 :    
1514 :     That puts you into interactive mode; you can leave Gforth by typing
1515 :     @code{bye}. While in Gforth, you can edit the command line and access
1516 :     the command line history with cursor keys, similar to bash.
1517 :    
1518 :    
1519 :     @node Syntax Tutorial, Crash Course Tutorial, Starting Gforth Tutorial, Tutorial
1520 :     @section Syntax
1521 : anton 1.66 @cindex syntax tutorial
1522 : anton 1.48
1523 :     A @dfn{word} is a sequence of arbitrary characters (expcept white
1524 :     space). Words are separated by white space. E.g., each of the
1525 :     following lines contains exactly one word:
1526 :    
1527 :     @example
1528 :     word
1529 :     !@@#$%^&*()
1530 :     1234567890
1531 :     5!a
1532 :     @end example
1533 :    
1534 :     A frequent beginner's error is to leave away necessary white space,
1535 :     resulting in an error like @samp{Undefined word}; so if you see such an
1536 :     error, check if you have put spaces wherever necessary.
1537 :    
1538 :     @example
1539 :     ." hello, world" \ correct
1540 :     ."hello, world" \ gives an "Undefined word" error
1541 :     @end example
1542 :    
1543 : anton 1.65 Gforth and most other Forth systems ignore differences in case (they are
1544 : anton 1.48 case-insensitive), i.e., @samp{word} is the same as @samp{Word}. If
1545 :     your system is case-sensitive, you may have to type all the examples
1546 :     given here in upper case.
1547 :    
1548 :    
1549 :     @node Crash Course Tutorial, Stack Tutorial, Syntax Tutorial, Tutorial
1550 :     @section Crash Course
1551 :    
1552 :     Type
1553 :    
1554 :     @example
1555 :     0 0 !
1556 :     here execute
1557 :     ' catch >body 20 erase abort
1558 :     ' (quit) >body 20 erase
1559 :     @end example
1560 :    
1561 :     The last two examples are guaranteed to destroy parts of Gforth (and
1562 :     most other systems), so you better leave Gforth afterwards (if it has
1563 :     not finished by itself). On some systems you may have to kill gforth
1564 :     from outside (e.g., in Unix with @code{kill}).
1565 :    
1566 :     Now that you know how to produce crashes (and that there's not much to
1567 :     them), let's learn how to produce meaningful programs.
1568 :    
1569 :    
1570 :     @node Stack Tutorial, Arithmetics Tutorial, Crash Course Tutorial, Tutorial
1571 :     @section Stack
1572 : anton 1.66 @cindex stack tutorial
1573 : anton 1.48
1574 :     The most obvious feature of Forth is the stack. When you type in a
1575 :     number, it is pushed on the stack. You can display the content of the
1576 :     stack with @code{.s}.
1577 :    
1578 :     @example
1579 :     1 2 .s
1580 :     3 .s
1581 :     @end example
1582 :    
1583 :     @code{.s} displays the top-of-stack to the right, i.e., the numbers
1584 :     appear in @code{.s} output as they appeared in the input.
1585 :    
1586 :     You can print the top of stack element with @code{.}.
1587 :    
1588 :     @example
1589 :     1 2 3 . . .
1590 :     @end example
1591 :    
1592 :     In general, words consume their stack arguments (@code{.s} is an
1593 :     exception).
1594 :    
1595 :     @assignment
1596 :     What does the stack contain after @code{5 6 7 .}?
1597 :     @endassignment
1598 :    
1599 :    
1600 :     @node Arithmetics Tutorial, Stack Manipulation Tutorial, Stack Tutorial, Tutorial
1601 :     @section Arithmetics
1602 : anton 1.66 @cindex arithmetics tutorial
1603 : anton 1.48
1604 :     The words @code{+}, @code{-}, @code{*}, @code{/}, and @code{mod} always
1605 :     operate on the top two stack items:
1606 :    
1607 :     @example
1608 : anton 1.67 2 2 .s
1609 :     + .s
1610 :     .
1611 : anton 1.48 2 1 - .
1612 :     7 3 mod .
1613 :     @end example
1614 :    
1615 :     The operands of @code{-}, @code{/}, and @code{mod} are in the same order
1616 :     as in the corresponding infix expression (this is generally the case in
1617 :     Forth).
1618 :    
1619 :     Parentheses are superfluous (and not available), because the order of
1620 :     the words unambiguously determines the order of evaluation and the
1621 :     operands:
1622 :    
1623 :     @example
1624 :     3 4 + 5 * .
1625 :     3 4 5 * + .
1626 :     @end example
1627 :    
1628 :     @assignment
1629 :     What are the infix expressions corresponding to the Forth code above?
1630 :     Write @code{6-7*8+9} in Forth notation@footnote{This notation is also
1631 :     known as Postfix or RPN (Reverse Polish Notation).}.
1632 :     @endassignment
1633 :    
1634 :     To change the sign, use @code{negate}:
1635 :    
1636 :     @example
1637 :     2 negate .
1638 :     @end example
1639 :    
1640 :     @assignment
1641 :     Convert -(-3)*4-5 to Forth.
1642 :     @endassignment
1643 :    
1644 :     @code{/mod} performs both @code{/} and @code{mod}.
1645 :    
1646 :     @example
1647 :     7 3 /mod . .
1648 :     @end example
1649 :    
1650 : anton 1.66 Reference: @ref{Arithmetic}.
1651 :    
1652 :    
1653 : anton 1.48 @node Stack Manipulation Tutorial, Using files for Forth code Tutorial, Arithmetics Tutorial, Tutorial
1654 :     @section Stack Manipulation
1655 : anton 1.66 @cindex stack manipulation tutorial
1656 : anton 1.48
1657 :     Stack manipulation words rearrange the data on the stack.
1658 :    
1659 :     @example
1660 :     1 .s drop .s
1661 :     1 .s dup .s drop drop .s
1662 :     1 2 .s over .s drop drop drop
1663 :     1 2 .s swap .s drop drop
1664 :     1 2 3 .s rot .s drop drop drop
1665 :     @end example
1666 :    
1667 :     These are the most important stack manipulation words. There are also
1668 :     variants that manipulate twice as many stack items:
1669 :    
1670 :     @example
1671 :     1 2 3 4 .s 2swap .s 2drop 2drop
1672 :     @end example
1673 :    
1674 :     Two more stack manipulation words are:
1675 :    
1676 :     @example
1677 :     1 2 .s nip .s drop
1678 :     1 2 .s tuck .s 2drop drop
1679 :     @end example
1680 :    
1681 :     @assignment
1682 :     Replace @code{nip} and @code{tuck} with combinations of other stack
1683 :     manipulation words.
1684 :    
1685 :     @example
1686 :     Given: How do you get:
1687 :     1 2 3 3 2 1
1688 :     1 2 3 1 2 3 2
1689 :     1 2 3 1 2 3 3
1690 :     1 2 3 1 3 3
1691 :     1 2 3 2 1 3
1692 :     1 2 3 4 4 3 2 1
1693 :     1 2 3 1 2 3 1 2 3
1694 :     1 2 3 4 1 2 3 4 1 2
1695 :     1 2 3
1696 :     1 2 3 1 2 3 4
1697 :     1 2 3 1 3
1698 :     @end example
1699 :     @endassignment
1700 :    
1701 :     @example
1702 :     5 dup * .
1703 :     @end example
1704 :    
1705 :     @assignment
1706 :     Write 17^3 and 17^4 in Forth, without writing @code{17} more than once.
1707 :     Write a piece of Forth code that expects two numbers on the stack
1708 :     (@var{a} and @var{b}, with @var{b} on top) and computes
1709 :     @code{(a-b)(a+1)}.
1710 :     @endassignment
1711 :    
1712 : anton 1.66 Reference: @ref{Stack Manipulation}.
1713 :    
1714 :    
1715 : anton 1.48 @node Using files for Forth code Tutorial, Comments Tutorial, Stack Manipulation Tutorial, Tutorial
1716 :     @section Using files for Forth code
1717 : anton 1.66 @cindex loading Forth code, tutorial
1718 :     @cindex files containing Forth code, tutorial
1719 : anton 1.48
1720 :     While working at the Forth command line is convenient for one-line
1721 :     examples and short one-off code, you probably want to store your source
1722 :     code in files for convenient editing and persistence. You can use your
1723 :     favourite editor (Gforth includes Emacs support, @pxref{Emacs and
1724 : anton 1.102 Gforth}) to create @var{file.fs} and use
1725 : anton 1.48
1726 :     @example
1727 : anton 1.102 s" @var{file.fs}" included
1728 : anton 1.48 @end example
1729 :    
1730 :     to load it into your Forth system. The file name extension I use for
1731 :     Forth files is @samp{.fs}.
1732 :    
1733 :     You can easily start Gforth with some files loaded like this:
1734 :    
1735 :     @example
1736 : anton 1.102 gforth @var{file1.fs} @var{file2.fs}
1737 : anton 1.48 @end example
1738 :    
1739 :     If an error occurs during loading these files, Gforth terminates,
1740 :     whereas an error during @code{INCLUDED} within Gforth usually gives you
1741 :     a Gforth command line. Starting the Forth system every time gives you a
1742 :     clean start every time, without interference from the results of earlier
1743 :     tries.
1744 :    
1745 :     I often put all the tests in a file, then load the code and run the
1746 :     tests with
1747 :    
1748 :     @example
1749 : anton 1.102 gforth @var{code.fs} @var{tests.fs} -e bye
1750 : anton 1.48 @end example
1751 :    
1752 :     (often by performing this command with @kbd{C-x C-e} in Emacs). The
1753 :     @code{-e bye} ensures that Gforth terminates afterwards so that I can
1754 :     restart this command without ado.
1755 :    
1756 :     The advantage of this approach is that the tests can be repeated easily
1757 :     every time the program ist changed, making it easy to catch bugs
1758 :     introduced by the change.
1759 :    
1760 : anton 1.66 Reference: @ref{Forth source files}.
1761 :    
1762 : anton 1.48
1763 :     @node Comments Tutorial, Colon Definitions Tutorial, Using files for Forth code Tutorial, Tutorial
1764 :     @section Comments
1765 : anton 1.66 @cindex comments tutorial
1766 : anton 1.48
1767 :     @example
1768 :     \ That's a comment; it ends at the end of the line
1769 :     ( Another comment; it ends here: ) .s
1770 :     @end example
1771 :    
1772 :     @code{\} and @code{(} are ordinary Forth words and therefore have to be
1773 :     separated with white space from the following text.
1774 :    
1775 :     @example
1776 :     \This gives an "Undefined word" error
1777 :     @end example
1778 :    
1779 :     The first @code{)} ends a comment started with @code{(}, so you cannot
1780 :     nest @code{(}-comments; and you cannot comment out text containing a
1781 :     @code{)} with @code{( ... )}@footnote{therefore it's a good idea to
1782 :     avoid @code{)} in word names.}.
1783 :    
1784 :     I use @code{\}-comments for descriptive text and for commenting out code
1785 :     of one or more line; I use @code{(}-comments for describing the stack
1786 :     effect, the stack contents, or for commenting out sub-line pieces of
1787 :     code.
1788 :    
1789 :     The Emacs mode @file{gforth.el} (@pxref{Emacs and Gforth}) supports
1790 :     these uses by commenting out a region with @kbd{C-x \}, uncommenting a
1791 :     region with @kbd{C-u C-x \}, and filling a @code{\}-commented region
1792 :     with @kbd{M-q}.
1793 :    
1794 : anton 1.66 Reference: @ref{Comments}.
1795 :    
1796 : anton 1.48
1797 :     @node Colon Definitions Tutorial, Decompilation Tutorial, Comments Tutorial, Tutorial
1798 :     @section Colon Definitions
1799 : anton 1.66 @cindex colon definitions, tutorial
1800 :     @cindex definitions, tutorial
1801 :     @cindex procedures, tutorial
1802 :     @cindex functions, tutorial
1803 : anton 1.48
1804 :     are similar to procedures and functions in other programming languages.
1805 :    
1806 :     @example
1807 :     : squared ( n -- n^2 )
1808 :     dup * ;
1809 :     5 squared .
1810 :     7 squared .
1811 :     @end example
1812 :    
1813 :     @code{:} starts the colon definition; its name is @code{squared}. The
1814 :     following comment describes its stack effect. The words @code{dup *}
1815 :     are not executed, but compiled into the definition. @code{;} ends the
1816 :     colon definition.
1817 :    
1818 :     The newly-defined word can be used like any other word, including using
1819 :     it in other definitions:
1820 :    
1821 :     @example
1822 :     : cubed ( n -- n^3 )
1823 :     dup squared * ;
1824 :     -5 cubed .
1825 :     : fourth-power ( n -- n^4 )
1826 :     squared squared ;
1827 :     3 fourth-power .
1828 :     @end example
1829 :    
1830 :     @assignment
1831 :     Write colon definitions for @code{nip}, @code{tuck}, @code{negate}, and
1832 :     @code{/mod} in terms of other Forth words, and check if they work (hint:
1833 :     test your tests on the originals first). Don't let the
1834 :     @samp{redefined}-Messages spook you, they are just warnings.
1835 :     @endassignment
1836 :    
1837 : anton 1.66 Reference: @ref{Colon Definitions}.
1838 :    
1839 : anton 1.48
1840 :     @node Decompilation Tutorial, Stack-Effect Comments Tutorial, Colon Definitions Tutorial, Tutorial
1841 :     @section Decompilation
1842 : anton 1.66 @cindex decompilation tutorial
1843 :     @cindex see tutorial
1844 : anton 1.48
1845 :     You can decompile colon definitions with @code{see}:
1846 :    
1847 :     @example
1848 :     see squared
1849 :     see cubed
1850 :     @end example
1851 :    
1852 :     In Gforth @code{see} shows you a reconstruction of the source code from
1853 :     the executable code. Informations that were present in the source, but
1854 :     not in the executable code, are lost (e.g., comments).
1855 :    
1856 : anton 1.65 You can also decompile the predefined words:
1857 :    
1858 :     @example
1859 :     see .
1860 :     see +
1861 :     @end example
1862 :    
1863 :    
1864 : anton 1.48 @node Stack-Effect Comments Tutorial, Types Tutorial, Decompilation Tutorial, Tutorial
1865 :     @section Stack-Effect Comments
1866 : anton 1.66 @cindex stack-effect comments, tutorial
1867 :     @cindex --, tutorial
1868 : anton 1.48 By convention the comment after the name of a definition describes the
1869 :     stack effect: The part in from of the @samp{--} describes the state of
1870 :     the stack before the execution of the definition, i.e., the parameters
1871 :     that are passed into the colon definition; the part behind the @samp{--}
1872 :     is the state of the stack after the execution of the definition, i.e.,
1873 :     the results of the definition. The stack comment only shows the top
1874 :     stack items that the definition accesses and/or changes.
1875 :    
1876 :     You should put a correct stack effect on every definition, even if it is
1877 :     just @code{( -- )}. You should also add some descriptive comment to
1878 :     more complicated words (I usually do this in the lines following
1879 :     @code{:}). If you don't do this, your code becomes unreadable (because
1880 :     you have to work through every definition before you can undertsand
1881 :     any).
1882 :    
1883 :     @assignment
1884 :     The stack effect of @code{swap} can be written like this: @code{x1 x2 --
1885 :     x2 x1}. Describe the stack effect of @code{-}, @code{drop}, @code{dup},
1886 :     @code{over}, @code{rot}, @code{nip}, and @code{tuck}. Hint: When you
1887 : anton 1.65 are done, you can compare your stack effects to those in this manual
1888 : anton 1.48 (@pxref{Word Index}).
1889 :     @endassignment
1890 :    
1891 :     Sometimes programmers put comments at various places in colon
1892 :     definitions that describe the contents of the stack at that place (stack
1893 :     comments); i.e., they are like the first part of a stack-effect
1894 :     comment. E.g.,
1895 :    
1896 :     @example
1897 :     : cubed ( n -- n^3 )
1898 :     dup squared ( n n^2 ) * ;
1899 :     @end example
1900 :    
1901 :     In this case the stack comment is pretty superfluous, because the word
1902 :     is simple enough. If you think it would be a good idea to add such a
1903 :     comment to increase readability, you should also consider factoring the
1904 :     word into several simpler words (@pxref{Factoring Tutorial,,
1905 : anton 1.60 Factoring}), which typically eliminates the need for the stack comment;
1906 : anton 1.48 however, if you decide not to refactor it, then having such a comment is
1907 :     better than not having it.
1908 :    
1909 :     The names of the stack items in stack-effect and stack comments in the
1910 :     standard, in this manual, and in many programs specify the type through
1911 :     a type prefix, similar to Fortran and Hungarian notation. The most
1912 :     frequent prefixes are:
1913 :    
1914 :     @table @code
1915 :     @item n
1916 :     signed integer
1917 :     @item u
1918 :     unsigned integer
1919 :     @item c
1920 :     character
1921 :     @item f
1922 :     Boolean flags, i.e. @code{false} or @code{true}.
1923 :     @item a-addr,a-
1924 :     Cell-aligned address
1925 :     @item c-addr,c-
1926 :     Char-aligned address (note that a Char may have two bytes in Windows NT)
1927 :     @item xt
1928 :     Execution token, same size as Cell
1929 :     @item w,x
1930 :     Cell, can contain an integer or an address. It usually takes 32, 64 or
1931 :     16 bits (depending on your platform and Forth system). A cell is more
1932 :     commonly known as machine word, but the term @emph{word} already means
1933 :     something different in Forth.
1934 :     @item d
1935 :     signed double-cell integer
1936 :     @item ud
1937 :     unsigned double-cell integer
1938 :     @item r
1939 :     Float (on the FP stack)
1940 :     @end table
1941 :    
1942 :     You can find a more complete list in @ref{Notation}.
1943 :    
1944 :     @assignment
1945 :     Write stack-effect comments for all definitions you have written up to
1946 :     now.
1947 :     @endassignment
1948 :    
1949 :    
1950 :     @node Types Tutorial, Factoring Tutorial, Stack-Effect Comments Tutorial, Tutorial
1951 :     @section Types
1952 : anton 1.66 @cindex types tutorial
1953 : anton 1.48
1954 :     In Forth the names of the operations are not overloaded; so similar
1955 :     operations on different types need different names; e.g., @code{+} adds
1956 :     integers, and you have to use @code{f+} to add floating-point numbers.
1957 :     The following prefixes are often used for related operations on
1958 :     different types:
1959 :    
1960 :     @table @code
1961 :     @item (none)
1962 :     signed integer
1963 :     @item u
1964 :     unsigned integer
1965 :     @item c
1966 :     character
1967 :     @item d
1968 :     signed double-cell integer
1969 :     @item ud, du
1970 :     unsigned double-cell integer
1971 :     @item 2
1972 :     two cells (not-necessarily double-cell numbers)
1973 :     @item m, um
1974 :     mixed single-cell and double-cell operations
1975 :     @item f
1976 :     floating-point (note that in stack comments @samp{f} represents flags,
1977 : anton 1.66 and @samp{r} represents FP numbers).
1978 : anton 1.48 @end table
1979 :    
1980 :     If there are no differences between the signed and the unsigned variant
1981 :     (e.g., for @code{+}), there is only the prefix-less variant.
1982 :    
1983 :     Forth does not perform type checking, neither at compile time, nor at
1984 :     run time. If you use the wrong oeration, the data are interpreted
1985 :     incorrectly:
1986 :    
1987 :     @example
1988 :     -1 u.
1989 :     @end example
1990 :    
1991 :     If you have only experience with type-checked languages until now, and
1992 :     have heard how important type-checking is, don't panic! In my
1993 :     experience (and that of other Forthers), type errors in Forth code are
1994 :     usually easy to find (once you get used to it), the increased vigilance
1995 :     of the programmer tends to catch some harder errors in addition to most
1996 :     type errors, and you never have to work around the type system, so in
1997 :     most situations the lack of type-checking seems to be a win (projects to
1998 :     add type checking to Forth have not caught on).
1999 :    
2000 :    
2001 :     @node Factoring Tutorial, Designing the stack effect Tutorial, Types Tutorial, Tutorial
2002 :     @section Factoring
2003 : anton 1.66 @cindex factoring tutorial
2004 : anton 1.48
2005 :     If you try to write longer definitions, you will soon find it hard to
2006 :     keep track of the stack contents. Therefore, good Forth programmers
2007 :     tend to write only short definitions (e.g., three lines). The art of
2008 :     finding meaningful short definitions is known as factoring (as in
2009 :     factoring polynomials).
2010 :    
2011 :     Well-factored programs offer additional advantages: smaller, more
2012 :     general words, are easier to test and debug and can be reused more and
2013 :     better than larger, specialized words.
2014 :    
2015 :     So, if you run into difficulties with stack management, when writing
2016 :     code, try to define meaningful factors for the word, and define the word
2017 :     in terms of those. Even if a factor contains only two words, it is
2018 :     often helpful.
2019 :    
2020 : anton 1.65 Good factoring is not easy, and it takes some practice to get the knack
2021 :     for it; but even experienced Forth programmers often don't find the
2022 :     right solution right away, but only when rewriting the program. So, if
2023 :     you don't come up with a good solution immediately, keep trying, don't
2024 :     despair.
2025 : anton 1.48
2026 :     @c example !!
2027 :    
2028 :    
2029 :     @node Designing the stack effect Tutorial, Local Variables Tutorial, Factoring Tutorial, Tutorial
2030 :     @section Designing the stack effect
2031 : anton 1.66 @cindex Stack effect design, tutorial
2032 :     @cindex design of stack effects, tutorial
2033 : anton 1.48
2034 :     In other languages you can use an arbitrary order of parameters for a
2035 : anton 1.65 function; and since there is only one result, you don't have to deal with
2036 : anton 1.48 the order of results, either.
2037 :    
2038 :     In Forth (and other stack-based languages, e.g., Postscript) the
2039 :     parameter and result order of a definition is important and should be
2040 :     designed well. The general guideline is to design the stack effect such
2041 :     that the word is simple to use in most cases, even if that complicates
2042 :     the implementation of the word. Some concrete rules are:
2043 :    
2044 :     @itemize @bullet
2045 :    
2046 :     @item
2047 :     Words consume all of their parameters (e.g., @code{.}).
2048 :    
2049 :     @item
2050 :     If there is a convention on the order of parameters (e.g., from
2051 :     mathematics or another programming language), stick with it (e.g.,
2052 :     @code{-}).
2053 :    
2054 :     @item
2055 :     If one parameter usually requires only a short computation (e.g., it is
2056 :     a constant), pass it on the top of the stack. Conversely, parameters
2057 :     that usually require a long sequence of code to compute should be passed
2058 :     as the bottom (i.e., first) parameter. This makes the code easier to
2059 :     read, because reader does not need to keep track of the bottom item
2060 :     through a long sequence of code (or, alternatively, through stack
2061 : anton 1.49 manipulations). E.g., @code{!} (store, @pxref{Memory}) expects the
2062 : anton 1.48 address on top of the stack because it is usually simpler to compute
2063 :     than the stored value (often the address is just a variable).
2064 :    
2065 :     @item
2066 :     Similarly, results that are usually consumed quickly should be returned
2067 :     on the top of stack, whereas a result that is often used in long
2068 :     computations should be passed as bottom result. E.g., the file words
2069 :     like @code{open-file} return the error code on the top of stack, because
2070 :     it is usually consumed quickly by @code{throw}; moreover, the error code
2071 :     has to be checked before doing anything with the other results.
2072 :    
2073 :     @end itemize
2074 :    
2075 :     These rules are just general guidelines, don't lose sight of the overall
2076 :     goal to make the words easy to use. E.g., if the convention rule
2077 :     conflicts with the computation-length rule, you might decide in favour
2078 :     of the convention if the word will be used rarely, and in favour of the
2079 :     computation-length rule if the word will be used frequently (because
2080 :     with frequent use the cost of breaking the computation-length rule would
2081 :     be quite high, and frequent use makes it easier to remember an
2082 :     unconventional order).
2083 :    
2084 :     @c example !! structure package
2085 :    
2086 : anton 1.65
2087 : anton 1.48 @node Local Variables Tutorial, Conditional execution Tutorial, Designing the stack effect Tutorial, Tutorial
2088 :     @section Local Variables
2089 : anton 1.66 @cindex local variables, tutorial
2090 : anton 1.48
2091 :     You can define local variables (@emph{locals}) in a colon definition:
2092 :    
2093 :     @example
2094 :     : swap @{ a b -- b a @}
2095 :     b a ;
2096 :     1 2 swap .s 2drop
2097 :     @end example
2098 :    
2099 :     (If your Forth system does not support this syntax, include
2100 :     @file{compat/anslocals.fs} first).
2101 :    
2102 :     In this example @code{@{ a b -- b a @}} is the locals definition; it
2103 :     takes two cells from the stack, puts the top of stack in @code{b} and
2104 :     the next stack element in @code{a}. @code{--} starts a comment ending
2105 :     with @code{@}}. After the locals definition, using the name of the
2106 :     local will push its value on the stack. You can leave the comment
2107 :     part (@code{-- b a}) away:
2108 :    
2109 :     @example
2110 :     : swap ( x1 x2 -- x2 x1 )
2111 :     @{ a b @} b a ;
2112 :     @end example
2113 :    
2114 :     In Gforth you can have several locals definitions, anywhere in a colon
2115 :     definition; in contrast, in a standard program you can have only one
2116 :     locals definition per colon definition, and that locals definition must
2117 :     be outside any controll structure.
2118 :    
2119 :     With locals you can write slightly longer definitions without running
2120 :     into stack trouble. However, I recommend trying to write colon
2121 :     definitions without locals for exercise purposes to help you gain the
2122 :     essential factoring skills.
2123 :    
2124 :     @assignment
2125 :     Rewrite your definitions until now with locals
2126 :     @endassignment
2127 :    
2128 : anton 1.66 Reference: @ref{Locals}.
2129 :    
2130 : anton 1.48
2131 :     @node Conditional execution Tutorial, Flags and Comparisons Tutorial, Local Variables Tutorial, Tutorial
2132 :     @section Conditional execution
2133 : anton 1.66 @cindex conditionals, tutorial
2134 :     @cindex if, tutorial
2135 : anton 1.48
2136 :     In Forth you can use control structures only inside colon definitions.
2137 :     An @code{if}-structure looks like this:
2138 :    
2139 :     @example
2140 :     : abs ( n1 -- +n2 )
2141 :     dup 0 < if
2142 :     negate
2143 :     endif ;
2144 :     5 abs .
2145 :     -5 abs .
2146 :     @end example
2147 :    
2148 :     @code{if} takes a flag from the stack. If the flag is non-zero (true),
2149 :     the following code is performed, otherwise execution continues after the
2150 : pazsan 1.51 @code{endif} (or @code{else}). @code{<} compares the top two stack
2151 : anton 1.48 elements and prioduces a flag:
2152 :    
2153 :     @example
2154 :     1 2 < .
2155 :     2 1 < .
2156 :     1 1 < .
2157 :     @end example
2158 :    
2159 :     Actually the standard name for @code{endif} is @code{then}. This
2160 :     tutorial presents the examples using @code{endif}, because this is often
2161 :     less confusing for people familiar with other programming languages
2162 :     where @code{then} has a different meaning. If your system does not have
2163 :     @code{endif}, define it with
2164 :    
2165 :     @example
2166 :     : endif postpone then ; immediate
2167 :     @end example
2168 :    
2169 :     You can optionally use an @code{else}-part:
2170 :    
2171 :     @example
2172 :     : min ( n1 n2 -- n )
2173 :     2dup < if
2174 :     drop
2175 :     else
2176 :     nip
2177 :     endif ;
2178 :     2 3 min .
2179 :     3 2 min .
2180 :     @end example
2181 :    
2182 :     @assignment
2183 :     Write @code{min} without @code{else}-part (hint: what's the definition
2184 :     of @code{nip}?).
2185 :     @endassignment
2186 :    
2187 : anton 1.66 Reference: @ref{Selection}.
2188 :    
2189 : anton 1.48
2190 :     @node Flags and Comparisons Tutorial, General Loops Tutorial, Conditional execution Tutorial, Tutorial
2191 :     @section Flags and Comparisons
2192 : anton 1.66 @cindex flags tutorial
2193 :     @cindex comparison tutorial
2194 : anton 1.48
2195 :     In a false-flag all bits are clear (0 when interpreted as integer). In
2196 :     a canonical true-flag all bits are set (-1 as a twos-complement signed
2197 :     integer); in many contexts (e.g., @code{if}) any non-zero value is
2198 :     treated as true flag.
2199 :    
2200 :     @example
2201 :     false .
2202 :     true .
2203 :     true hex u. decimal
2204 :     @end example
2205 :    
2206 :     Comparison words produce canonical flags:
2207 :    
2208 :     @example
2209 :     1 1 = .
2210 :     1 0= .
2211 :     0 1 < .
2212 :     0 0 < .
2213 :     -1 1 u< . \ type error, u< interprets -1 as large unsigned number
2214 :     -1 1 < .
2215 :     @end example
2216 :    
2217 : anton 1.66 Gforth supports all combinations of the prefixes @code{0 u d d0 du f f0}
2218 :     (or none) and the comparisons @code{= <> < > <= >=}. Only a part of
2219 :     these combinations are standard (for details see the standard,
2220 :     @ref{Numeric comparison}, @ref{Floating Point} or @ref{Word Index}).
2221 : anton 1.48
2222 :     You can use @code{and or xor invert} can be used as operations on
2223 :     canonical flags. Actually they are bitwise operations:
2224 :    
2225 :     @example
2226 :     1 2 and .
2227 :     1 2 or .
2228 :     1 3 xor .
2229 :     1 invert .
2230 :     @end example
2231 :    
2232 :     You can convert a zero/non-zero flag into a canonical flag with
2233 :     @code{0<>} (and complement it on the way with @code{0=}).
2234 :    
2235 :     @example
2236 :     1 0= .
2237 :     1 0<> .
2238 :     @end example
2239 :    
2240 : anton 1.65 You can use the all-bits-set feature of canonical flags and the bitwise
2241 : anton 1.48 operation of the Boolean operations to avoid @code{if}s:
2242 :    
2243 :     @example
2244 :     : foo ( n1 -- n2 )
2245 :     0= if
2246 :     14
2247 :     else
2248 :     0
2249 :     endif ;
2250 :     0 foo .
2251 :     1 foo .
2252 :    
2253 :     : foo ( n1 -- n2 )
2254 :     0= 14 and ;
2255 :     0 foo .
2256 :     1 foo .
2257 :     @end example
2258 :    
2259 :     @assignment
2260 :     Write @code{min} without @code{if}.
2261 :     @endassignment
2262 :    
2263 : anton 1.66 For reference, see @ref{Boolean Flags}, @ref{Numeric comparison}, and
2264 :     @ref{Bitwise operations}.
2265 :    
2266 : anton 1.48
2267 :     @node General Loops Tutorial, Counted loops Tutorial, Flags and Comparisons Tutorial, Tutorial
2268 :     @section General Loops
2269 : anton 1.66 @cindex loops, indefinite, tutorial
2270 : anton 1.48
2271 :     The endless loop is the most simple one:
2272 :    
2273 :     @example
2274 :     : endless ( -- )
2275 :     0 begin
2276 :     dup . 1+
2277 :     again ;
2278 :     endless
2279 :     @end example
2280 :    
2281 :     Terminate this loop by pressing @kbd{Ctrl-C} (in Gforth). @code{begin}
2282 :     does nothing at run-time, @code{again} jumps back to @code{begin}.
2283 :    
2284 :     A loop with one exit at any place looks like this:
2285 :    
2286 :     @example
2287 :     : log2 ( +n1 -- n2 )
2288 :     \ logarithmus dualis of n1>0, rounded down to the next integer
2289 :     assert( dup 0> )
2290 :     2/ 0 begin
2291 :     over 0> while
2292 :     1+ swap 2/ swap
2293 :     repeat
2294 :     nip ;
2295 :     7 log2 .
2296 :     8 log2 .
2297 :     @end example
2298 :    
2299 :     At run-time @code{while} consumes a flag; if it is 0, execution
2300 : pazsan 1.51 continues behind the @code{repeat}; if the flag is non-zero, execution
2301 : anton 1.48 continues behind the @code{while}. @code{Repeat} jumps back to
2302 :     @code{begin}, just like @code{again}.
2303 :    
2304 :     In Forth there are many combinations/abbreviations, like @code{1+}.
2305 : anton 1.90 However, @code{2/} is not one of them; it shifts its argument right by
2306 : anton 1.48 one bit (arithmetic shift right):
2307 :    
2308 :     @example
2309 :     -5 2 / .
2310 :     -5 2/ .
2311 :     @end example
2312 :    
2313 :     @code{assert(} is no standard word, but you can get it on systems other
2314 :     then Gforth by including @file{compat/assert.fs}. You can see what it
2315 :     does by trying
2316 :    
2317 :     @example
2318 :     0 log2 .
2319 :     @end example
2320 :    
2321 :     Here's a loop with an exit at the end:
2322 :    
2323 :     @example
2324 :     : log2 ( +n1 -- n2 )
2325 :     \ logarithmus dualis of n1>0, rounded down to the next integer
2326 :     assert( dup 0 > )
2327 :     -1 begin
2328 :     1+ swap 2/ swap
2329 :     over 0 <=
2330 :     until
2331 :     nip ;
2332 :     @end example
2333 :    
2334 :     @code{Until} consumes a flag; if it is non-zero, execution continues at
2335 :     the @code{begin}, otherwise after the @code{until}.
2336 :    
2337 :     @assignment
2338 :     Write a definition for computing the greatest common divisor.
2339 :     @endassignment
2340 :    
2341 : anton 1.66 Reference: @ref{Simple Loops}.
2342 :    
2343 : anton 1.48
2344 :     @node Counted loops Tutorial, Recursion Tutorial, General Loops Tutorial, Tutorial
2345 :     @section Counted loops
2346 : anton 1.66 @cindex loops, counted, tutorial
2347 : anton 1.48
2348 :     @example
2349 :     : ^ ( n1 u -- n )
2350 :     \ n = the uth power of u1
2351 :     1 swap 0 u+do
2352 :     over *
2353 :     loop
2354 :     nip ;
2355 :     3 2 ^ .
2356 :     4 3 ^ .
2357 :     @end example
2358 :    
2359 :     @code{U+do} (from @file{compat/loops.fs}, if your Forth system doesn't
2360 :     have it) takes two numbers of the stack @code{( u3 u4 -- )}, and then
2361 :     performs the code between @code{u+do} and @code{loop} for @code{u3-u4}
2362 :     times (or not at all, if @code{u3-u4<0}).
2363 :    
2364 :     You can see the stack effect design rules at work in the stack effect of
2365 :     the loop start words: Since the start value of the loop is more
2366 :     frequently constant than the end value, the start value is passed on
2367 :     the top-of-stack.
2368 :    
2369 :     You can access the counter of a counted loop with @code{i}:
2370 :    
2371 :     @example
2372 :     : fac ( u -- u! )
2373 :     1 swap 1+ 1 u+do
2374 :     i *
2375 :     loop ;
2376 :     5 fac .
2377 :     7 fac .
2378 :     @end example
2379 :    
2380 :     There is also @code{+do}, which expects signed numbers (important for
2381 :     deciding whether to enter the loop).
2382 :    
2383 :     @assignment
2384 :     Write a definition for computing the nth Fibonacci number.
2385 :     @endassignment
2386 :    
2387 : anton 1.65 You can also use increments other than 1:
2388 :    
2389 :     @example
2390 :     : up2 ( n1 n2 -- )
2391 :     +do
2392 :     i .
2393 :     2 +loop ;
2394 :     10 0 up2
2395 :    
2396 :     : down2 ( n1 n2 -- )
2397 :     -do
2398 :     i .
2399 :     2 -loop ;
2400 :     0 10 down2
2401 :     @end example
2402 : anton 1.48
2403 : anton 1.66 Reference: @ref{Counted Loops}.
2404 :    
2405 : anton 1.48
2406 :     @node Recursion Tutorial, Leaving definitions or loops Tutorial, Counted loops Tutorial, Tutorial
2407 :     @section Recursion
2408 : anton 1.66 @cindex recursion tutorial
2409 : anton 1.48
2410 :     Usually the name of a definition is not visible in the definition; but
2411 :     earlier definitions are usually visible:
2412 :    
2413 :     @example
2414 :     1 0 / . \ "Floating-point unidentified fault" in Gforth on most platforms
2415 :     : / ( n1 n2 -- n )
2416 :     dup 0= if
2417 :     -10 throw \ report division by zero
2418 :     endif
2419 :     / \ old version
2420 :     ;
2421 :     1 0 /
2422 :     @end example
2423 :    
2424 :     For recursive definitions you can use @code{recursive} (non-standard) or
2425 :     @code{recurse}:
2426 :    
2427 :     @example
2428 :     : fac1 ( n -- n! ) recursive
2429 :     dup 0> if
2430 :     dup 1- fac1 *
2431 :     else
2432 :     drop 1
2433 :     endif ;
2434 :     7 fac1 .
2435 :    
2436 :     : fac2 ( n -- n! )
2437 :     dup 0> if
2438 :     dup 1- recurse *
2439 :     else
2440 :     drop 1
2441 :     endif ;
2442 :     8 fac2 .
2443 :     @end example
2444 :    
2445 :     @assignment
2446 :     Write a recursive definition for computing the nth Fibonacci number.
2447 :     @endassignment
2448 :    
2449 : anton 1.66 Reference (including indirect recursion): @xref{Calls and returns}.
2450 :    
2451 : anton 1.48
2452 :     @node Leaving definitions or loops Tutorial, Return Stack Tutorial, Recursion Tutorial, Tutorial
2453 :     @section Leaving definitions or loops
2454 : anton 1.66 @cindex leaving definitions, tutorial
2455 :     @cindex leaving loops, tutorial
2456 : anton 1.48
2457 :     @code{EXIT} exits the current definition right away. For every counted
2458 :     loop that is left in this way, an @code{UNLOOP} has to be performed
2459 :     before the @code{EXIT}:
2460 :    
2461 :     @c !! real examples
2462 :     @example
2463 :     : ...
2464 :     ... u+do
2465 :     ... if
2466 :     ... unloop exit
2467 :     endif
2468 :     ...
2469 :     loop
2470 :     ... ;
2471 :     @end example
2472 :    
2473 :     @code{LEAVE} leaves the innermost counted loop right away:
2474 :    
2475 :     @example
2476 :     : ...
2477 :     ... u+do
2478 :     ... if
2479 :     ... leave
2480 :     endif
2481 :     ...
2482 :     loop
2483 :     ... ;
2484 :     @end example
2485 :    
2486 : anton 1.65 @c !! example
2487 : anton 1.48
2488 : anton 1.66 Reference: @ref{Calls and returns}, @ref{Counted Loops}.
2489 :    
2490 :    
2491 : anton 1.48 @node Return Stack Tutorial, Memory Tutorial, Leaving definitions or loops Tutorial, Tutorial
2492 :     @section Return Stack
2493 : anton 1.66 @cindex return stack tutorial
2494 : anton 1.48
2495 :     In addition to the data stack Forth also has a second stack, the return
2496 :     stack; most Forth systems store the return addresses of procedure calls
2497 :     there (thus its name). Programmers can also use this stack:
2498 :    
2499 :     @example
2500 :     : foo ( n1 n2 -- )
2501 :     .s
2502 :     >r .s
2503 : anton 1.50 r@@ .
2504 : anton 1.48 >r .s
2505 : anton 1.50 r@@ .
2506 : anton 1.48 r> .
2507 : anton 1.50 r@@ .
2508 : anton 1.48 r> . ;
2509 :     1 2 foo
2510 :     @end example
2511 :    
2512 :     @code{>r} takes an element from the data stack and pushes it onto the
2513 :     return stack; conversely, @code{r>} moves an elementm from the return to
2514 :     the data stack; @code{r@@} pushes a copy of the top of the return stack
2515 :     on the return stack.
2516 :    
2517 :     Forth programmers usually use the return stack for storing data
2518 :     temporarily, if using the data stack alone would be too complex, and
2519 :     factoring and locals are not an option:
2520 :    
2521 :     @example
2522 :     : 2swap ( x1 x2 x3 x4 -- x3 x4 x1 x2 )
2523 :     rot >r rot r> ;
2524 :     @end example
2525 :    
2526 :     The return address of the definition and the loop control parameters of
2527 :     counted loops usually reside on the return stack, so you have to take
2528 :     all items, that you have pushed on the return stack in a colon
2529 :     definition or counted loop, from the return stack before the definition
2530 :     or loop ends. You cannot access items that you pushed on the return
2531 :     stack outside some definition or loop within the definition of loop.
2532 :    
2533 :     If you miscount the return stack items, this usually ends in a crash:
2534 :    
2535 :     @example
2536 :     : crash ( n -- )
2537 :     >r ;
2538 :     5 crash
2539 :     @end example
2540 :    
2541 :     You cannot mix using locals and using the return stack (according to the
2542 :     standard; Gforth has no problem). However, they solve the same
2543 :     problems, so this shouldn't be an issue.
2544 :    
2545 :     @assignment
2546 :     Can you rewrite any of the definitions you wrote until now in a better
2547 :     way using the return stack?
2548 :     @endassignment
2549 :    
2550 : anton 1.66 Reference: @ref{Return stack}.
2551 :    
2552 : anton 1.48
2553 :     @node Memory Tutorial, Characters and Strings Tutorial, Return Stack Tutorial, Tutorial
2554 :     @section Memory
2555 : anton 1.66 @cindex memory access/allocation tutorial
2556 : anton 1.48
2557 :     You can create a global variable @code{v} with
2558 :    
2559 :     @example
2560 :     variable v ( -- addr )
2561 :     @end example
2562 :    
2563 :     @code{v} pushes the address of a cell in memory on the stack. This cell
2564 :     was reserved by @code{variable}. You can use @code{!} (store) to store
2565 :     values into this cell and @code{@@} (fetch) to load the value from the
2566 :     stack into memory:
2567 :    
2568 :     @example
2569 :     v .
2570 :     5 v ! .s
2571 : anton 1.50 v @@ .
2572 : anton 1.48 @end example
2573 :    
2574 : anton 1.65 You can see a raw dump of memory with @code{dump}:
2575 :    
2576 :     @example
2577 :     v 1 cells .s dump
2578 :     @end example
2579 :    
2580 :     @code{Cells ( n1 -- n2 )} gives you the number of bytes (or, more
2581 :     generally, address units (aus)) that @code{n1 cells} occupy. You can
2582 :     also reserve more memory:
2583 : anton 1.48
2584 :     @example
2585 :     create v2 20 cells allot
2586 : anton 1.65 v2 20 cells dump
2587 : anton 1.48 @end example
2588 :    
2589 : anton 1.65 creates a word @code{v2} and reserves 20 uninitialized cells; the
2590 :     address pushed by @code{v2} points to the start of these 20 cells. You
2591 :     can use address arithmetic to access these cells:
2592 : anton 1.48
2593 :     @example
2594 :     3 v2 5 cells + !
2595 : anton 1.65 v2 20 cells dump
2596 : anton 1.48 @end example
2597 :    
2598 :     You can reserve and initialize memory with @code{,}:
2599 :    
2600 :     @example
2601 :     create v3
2602 :     5 , 4 , 3 , 2 , 1 ,
2603 : anton 1.50 v3 @@ .
2604 :     v3 cell+ @@ .
2605 :     v3 2 cells + @@ .
2606 : anton 1.65 v3 5 cells dump
2607 : anton 1.48 @end example
2608 :    
2609 :     @assignment
2610 :     Write a definition @code{vsum ( addr u -- n )} that computes the sum of
2611 :     @code{u} cells, with the first of these cells at @code{addr}, the next
2612 :     one at @code{addr cell+} etc.
2613 :     @endassignment
2614 :    
2615 :     You can also reserve memory without creating a new word:
2616 :    
2617 :     @example
2618 : anton 1.60 here 10 cells allot .
2619 :     here .
2620 : anton 1.48 @end example
2621 :    
2622 :     @code{Here} pushes the start address of the memory area. You should
2623 :     store it somewhere, or you will have a hard time finding the memory area
2624 :     again.
2625 :    
2626 :     @code{Allot} manages dictionary memory. The dictionary memory contains
2627 :     the system's data structures for words etc. on Gforth and most other
2628 :     Forth systems. It is managed like a stack: You can free the memory that
2629 :     you have just @code{allot}ed with
2630 :    
2631 :     @example
2632 :     -10 cells allot
2633 : anton 1.60 here .
2634 : anton 1.48 @end example
2635 :    
2636 :     Note that you cannot do this if you have created a new word in the
2637 :     meantime (because then your @code{allot}ed memory is no longer on the
2638 :     top of the dictionary ``stack'').
2639 :    
2640 :     Alternatively, you can use @code{allocate} and @code{free} which allow
2641 :     freeing memory in any order:
2642 :    
2643 :     @example
2644 :     10 cells allocate throw .s
2645 :     20 cells allocate throw .s
2646 :     swap
2647 :     free throw
2648 :     free throw
2649 :     @end example
2650 :    
2651 :     The @code{throw}s deal with errors (e.g., out of memory).
2652 :    
2653 : anton 1.65 And there is also a
2654 :     @uref{http://www.complang.tuwien.ac.at/forth/garbage-collection.zip,
2655 :     garbage collector}, which eliminates the need to @code{free} memory
2656 :     explicitly.
2657 : anton 1.48
2658 : anton 1.66 Reference: @ref{Memory}.
2659 :    
2660 : anton 1.48
2661 :     @node Characters and Strings Tutorial, Alignment Tutorial, Memory Tutorial, Tutorial
2662 :     @section Characters and Strings
2663 : anton 1.66 @cindex strings tutorial
2664 :     @cindex characters tutorial
2665 : anton 1.48
2666 :     On the stack characters take up a cell, like numbers. In memory they
2667 :     have their own size (one 8-bit byte on most systems), and therefore
2668 :     require their own words for memory access:
2669 :    
2670 :     @example
2671 :     create v4
2672 :     104 c, 97 c, 108 c, 108 c, 111 c,
2673 : anton 1.50 v4 4 chars + c@@ .
2674 : anton 1.65 v4 5 chars dump
2675 : anton 1.48 @end example
2676 :    
2677 :     The preferred representation of strings on the stack is @code{addr
2678 :     u-count}, where @code{addr} is the address of the first character and
2679 :     @code{u-count} is the number of characters in the string.
2680 :    
2681 :     @example
2682 :     v4 5 type
2683 :     @end example
2684 :    
2685 :     You get a string constant with
2686 :    
2687 :     @example
2688 :     s" hello, world" .s
2689 :     type
2690 :     @end example
2691 :    
2692 :     Make sure you have a space between @code{s"} and the string; @code{s"}
2693 :     is a normal Forth word and must be delimited with white space (try what
2694 :     happens when you remove the space).
2695 :    
2696 :     However, this interpretive use of @code{s"} is quite restricted: the
2697 :     string exists only until the next call of @code{s"} (some Forth systems
2698 :     keep more than one of these strings, but usually they still have a
2699 : crook 1.62 limited lifetime).
2700 : anton 1.48
2701 :     @example
2702 :     s" hello," s" world" .s
2703 :     type
2704 :     type
2705 :     @end example
2706 :    
2707 : crook 1.62 You can also use @code{s"} in a definition, and the resulting
2708 :     strings then live forever (well, for as long as the definition):
2709 : anton 1.48
2710 :     @example
2711 :     : foo s" hello," s" world" ;
2712 :     foo .s
2713 :     type
2714 :     type
2715 :     @end example
2716 :    
2717 :     @assignment
2718 :     @code{Emit ( c -- )} types @code{c} as character (not a number).
2719 :     Implement @code{type ( addr u -- )}.
2720 :     @endassignment
2721 :    
2722 : anton 1.66 Reference: @ref{Memory Blocks}.
2723 :    
2724 :    
2725 : pazsan 1.84 @node Alignment Tutorial, Files Tutorial, Characters and Strings Tutorial, Tutorial
2726 : anton 1.48 @section Alignment
2727 : anton 1.66 @cindex alignment tutorial
2728 :     @cindex memory alignment tutorial
2729 : anton 1.48
2730 :     On many processors cells have to be aligned in memory, if you want to
2731 :     access them with @code{@@} and @code{!} (and even if the processor does
2732 : crook 1.62 not require alignment, access to aligned cells is faster).
2733 : anton 1.48
2734 :     @code{Create} aligns @code{here} (i.e., the place where the next
2735 :     allocation will occur, and that the @code{create}d word points to).
2736 :     Likewise, the memory produced by @code{allocate} starts at an aligned
2737 :     address. Adding a number of @code{cells} to an aligned address produces
2738 :     another aligned address.
2739 :    
2740 :     However, address arithmetic involving @code{char+} and @code{chars} can
2741 :     create an address that is not cell-aligned. @code{Aligned ( addr --
2742 :     a-addr )} produces the next aligned address:
2743 :    
2744 :     @example
2745 : anton 1.50 v3 char+ aligned .s @@ .
2746 :     v3 char+ .s @@ .
2747 : anton 1.48 @end example
2748 :    
2749 :     Similarly, @code{align} advances @code{here} to the next aligned
2750 :     address:
2751 :    
2752 :     @example
2753 :     create v5 97 c,
2754 :     here .
2755 :     align here .
2756 :     1000 ,
2757 :     @end example
2758 :    
2759 :     Note that you should use aligned addresses even if your processor does
2760 :     not require them, if you want your program to be portable.
2761 :    
2762 : anton 1.66 Reference: @ref{Address arithmetic}.
2763 :    
2764 : anton 1.48
2765 : pazsan 1.84 @node Files Tutorial, Interpretation and Compilation Semantics and Immediacy Tutorial, Alignment Tutorial, Tutorial
2766 :     @section Files
2767 :     @cindex files tutorial
2768 :    
2769 :     This section gives a short introduction into how to use files inside
2770 :     Forth. It's broken up into five easy steps:
2771 :    
2772 :     @enumerate 1
2773 :     @item Opened an ASCII text file for input
2774 :     @item Opened a file for output
2775 :     @item Read input file until string matched (or some other condition matched)
2776 :     @item Wrote some lines from input ( modified or not) to output
2777 :     @item Closed the files.
2778 :     @end enumerate
2779 :    
2780 :     @subsection Open file for input
2781 :    
2782 :     @example
2783 :     s" foo.in" r/o open-file throw Value fd-in
2784 :     @end example
2785 :    
2786 :     @subsection Create file for output
2787 :    
2788 :     @example
2789 :     s" foo.out" w/o create-file throw Value fd-out
2790 :     @end example
2791 :    
2792 :     The available file modes are r/o for read-only access, r/w for
2793 :     read-write access, and w/o for write-only access. You could open both
2794 :     files with r/w, too, if you like. All file words return error codes; for
2795 :     most applications, it's best to pass there error codes with @code{throw}
2796 :     to the outer error handler.
2797 :    
2798 :     If you want words for opening and assigning, define them as follows:
2799 :    
2800 :     @example
2801 :     0 Value fd-in
2802 :     0 Value fd-out
2803 :     : open-input ( addr u -- ) r/o open-file throw to fd-in ;
2804 :     : open-output ( addr u -- ) w/o create-file throw to fd-out ;
2805 :     @end example
2806 :    
2807 :     Usage example:
2808 :    
2809 :     @example
2810 :     s" foo.in" open-input
2811 :     s" foo.out" open-output
2812 :     @end example
2813 :    
2814 :     @subsection Scan file for a particular line
2815 :    
2816 :     @example
2817 :     256 Constant max-line
2818 :     Create line-buffer max-line 2 + allot
2819 :    
2820 :     : scan-file ( addr u -- )
2821 :     begin
2822 :     line-buffer max-line fd-in read-line throw
2823 :     while
2824 :     >r 2dup line-buffer r> compare 0=
2825 :     until
2826 :     else
2827 :     drop
2828 :     then
2829 :     2drop ;
2830 :     @end example
2831 :    
2832 :     @code{read-line ( addr u1 fd -- u2 flag ior )} reads up to u1 bytes into
2833 : anton 1.94 the buffer at addr, and returns the number of bytes read, a flag that is
2834 :     false when the end of file is reached, and an error code.
2835 : pazsan 1.84
2836 :     @code{compare ( addr1 u1 addr2 u2 -- n )} compares two strings and
2837 :     returns zero if both strings are equal. It returns a positive number if
2838 :     the first string is lexically greater, a negative if the second string
2839 :     is lexically greater.
2840 :    
2841 :     We haven't seen this loop here; it has two exits. Since the @code{while}
2842 :     exits with the number of bytes read on the stack, we have to clean up
2843 :     that separately; that's after the @code{else}.
2844 :    
2845 :     Usage example:
2846 :    
2847 :     @example
2848 :     s" The text I search is here" scan-file
2849 :     @end example
2850 :    
2851 :     @subsection Copy input to output
2852 :    
2853 :     @example
2854 :     : copy-file ( -- )
2855 :     begin
2856 :     line-buffer max-line fd-in read-line throw
2857 :     while
2858 :     line-buffer swap fd-out write-file throw
2859 :     repeat ;
2860 :     @end example
2861 :    
2862 :     @subsection Close files
2863 :    
2864 :     @example
2865 :     fd-in close-file throw
2866 :     fd-out close-file throw
2867 :     @end example
2868 :    
2869 :     Likewise, you can put that into definitions, too:
2870 :    
2871 :     @example
2872 :     : close-input ( -- ) fd-in close-file throw ;
2873 :     : close-output ( -- ) fd-out close-file throw ;
2874 :     @end example
2875 :    
2876 :     @assignment
2877 :     How could you modify @code{copy-file} so that it copies until a second line is
2878 :     matched? Can you write a program that extracts a section of a text file,
2879 :     given the line that starts and the line that terminates that section?
2880 :     @endassignment
2881 :    
2882 :     @node Interpretation and Compilation Semantics and Immediacy Tutorial, Execution Tokens Tutorial, Files Tutorial, Tutorial
2883 : anton 1.48 @section Interpretation and Compilation Semantics and Immediacy
2884 : anton 1.66 @cindex semantics tutorial
2885 :     @cindex interpretation semantics tutorial
2886 :     @cindex compilation semantics tutorial
2887 :     @cindex immediate, tutorial
2888 : anton 1.48
2889 :     When a word is compiled, it behaves differently from being interpreted.
2890 :     E.g., consider @code{+}:
2891 :    
2892 :     @example
2893 :     1 2 + .
2894 :     : foo + ;
2895 :     @end example
2896 :    
2897 :     These two behaviours are known as compilation and interpretation
2898 :     semantics. For normal words (e.g., @code{+}), the compilation semantics
2899 :     is to append the interpretation semantics to the currently defined word
2900 :     (@code{foo} in the example above). I.e., when @code{foo} is executed
2901 :     later, the interpretation semantics of @code{+} (i.e., adding two
2902 :     numbers) will be performed.
2903 :    
2904 :     However, there are words with non-default compilation semantics, e.g.,
2905 :     the control-flow words like @code{if}. You can use @code{immediate} to
2906 :     change the compilation semantics of the last defined word to be equal to
2907 :     the interpretation semantics:
2908 :    
2909 :     @example
2910 :     : [FOO] ( -- )
2911 :     5 . ; immediate
2912 :    
2913 :     [FOO]
2914 :     : bar ( -- )
2915 :     [FOO] ;
2916 :     bar
2917 :     see bar
2918 :     @end example
2919 :    
2920 :     Two conventions to mark words with non-default compilation semnatics are
2921 :     names with brackets (more frequently used) and to write them all in
2922 :     upper case (less frequently used).
2923 :    
2924 :     In Gforth (and many other systems) you can also remove the
2925 :     interpretation semantics with @code{compile-only} (the compilation
2926 :     semantics is derived from the original interpretation semantics):
2927 :    
2928 :     @example
2929 :     : flip ( -- )
2930 :     6 . ; compile-only \ but not immediate
2931 :     flip
2932 :    
2933 :     : flop ( -- )
2934 :     flip ;
2935 :     flop
2936 :     @end example
2937 :    
2938 :     In this example the interpretation semantics of @code{flop} is equal to
2939 :     the original interpretation semantics of @code{flip}.
2940 :    
2941 :     The text interpreter has two states: in interpret state, it performs the
2942 :     interpretation semantics of words it encounters; in compile state, it
2943 :     performs the compilation semantics of these words.
2944 :    
2945 :     Among other things, @code{:} switches into compile state, and @code{;}
2946 :     switches back to interpret state. They contain the factors @code{]}
2947 :     (switch to compile state) and @code{[} (switch to interpret state), that
2948 :     do nothing but switch the state.
2949 :    
2950 :     @example
2951 :     : xxx ( -- )
2952 :     [ 5 . ]
2953 :     ;
2954 :    
2955 :     xxx
2956 :     see xxx
2957 :     @end example
2958 :    
2959 :     These brackets are also the source of the naming convention mentioned
2960 :     above.
2961 :    
2962 : anton 1.66 Reference: @ref{Interpretation and Compilation Semantics}.
2963 :    
2964 : anton 1.48
2965 :     @node Execution Tokens Tutorial, Exceptions Tutorial, Interpretation and Compilation Semantics and Immediacy Tutorial, Tutorial
2966 :     @section Execution Tokens
2967 : anton 1.66 @cindex execution tokens tutorial
2968 :     @cindex XT tutorial
2969 : anton 1.48
2970 :     @code{' word} gives you the execution token (XT) of a word. The XT is a
2971 :     cell representing the interpretation semantics of a word. You can
2972 :     execute this semantics with @code{execute}:
2973 :    
2974 :     @example
2975 :     ' + .s
2976 :     1 2 rot execute .
2977 :     @end example
2978 :    
2979 :     The XT is similar to a function pointer in C. However, parameter
2980 :     passing through the stack makes it a little more flexible:
2981 :    
2982 :     @example
2983 :     : map-array ( ... addr u xt -- ... )
2984 : anton 1.50 \ executes xt ( ... x -- ... ) for every element of the array starting
2985 :     \ at addr and containing u elements
2986 : anton 1.48 @{ xt @}
2987 :     cells over + swap ?do
2988 : anton 1.50 i @@ xt execute
2989 : anton 1.48 1 cells +loop ;
2990 :    
2991 :     create a 3 , 4 , 2 , -1 , 4 ,
2992 :     a 5 ' . map-array .s
2993 :     0 a 5 ' + map-array .
2994 :     s" max-n" environment? drop .s
2995 :     a 5 ' min map-array .
2996 :     @end example
2997 :    
2998 :     You can use map-array with the XTs of words that consume one element
2999 :     more than they produce. In theory you can also use it with other XTs,
3000 :     but the stack effect then depends on the size of the array, which is
3001 :     hard to understand.
3002 :    
3003 : pazsan 1.51 Since XTs are cell-sized, you can store them in memory and manipulate
3004 :     them on the stack like other cells. You can also compile the XT into a
3005 : anton 1.48 word with @code{compile,}:
3006 :    
3007 :     @example
3008 :     : foo1 ( n1 n2 -- n )
3009 :     [ ' + compile, ] ;
3010 :     see foo
3011 :     @end example
3012 :    
3013 :     This is non-standard, because @code{compile,} has no compilation
3014 :     semantics in the standard, but it works in good Forth systems. For the
3015 :     broken ones, use
3016 :    
3017 :     @example
3018 :     : [compile,] compile, ; immediate
3019 :    
3020 :     : foo1 ( n1 n2 -- n )
3021 :     [ ' + ] [compile,] ;
3022 :     see foo
3023 :     @end example
3024 :    
3025 :     @code{'} is a word with default compilation semantics; it parses the
3026 :     next word when its interpretation semantics are executed, not during
3027 :     compilation:
3028 :    
3029 :     @example
3030 :     : foo ( -- xt )
3031 :     ' ;
3032 :     see foo
3033 :     : bar ( ... "word" -- ... )
3034 :     ' execute ;
3035 :     see bar
3036 : anton 1.60 1 2 bar + .
3037 : anton 1.48 @end example
3038 :    
3039 :     You often want to parse a word during compilation and compile its XT so
3040 :     it will be pushed on the stack at run-time. @code{[']} does this:
3041 :    
3042 :     @example
3043 :     : xt-+ ( -- xt )
3044 :     ['] + ;
3045 :     see xt-+
3046 :     1 2 xt-+ execute .
3047 :     @end example
3048 :    
3049 :     Many programmers tend to see @code{'} and the word it parses as one
3050 :     unit, and expect it to behave like @code{[']} when compiled, and are
3051 :     confused by the actual behaviour. If you are, just remember that the
3052 :     Forth system just takes @code{'} as one unit and has no idea that it is
3053 :     a parsing word (attempts to convenience programmers in this issue have
3054 :     usually resulted in even worse pitfalls, see
3055 : anton 1.66 @uref{http://www.complang.tuwien.ac.at/papers/ertl98.ps.gz,
3056 :     @code{State}-smartness---Why it is evil and How to Exorcise it}).
3057 : anton 1.48
3058 :     Note that the state of the interpreter does not come into play when
3059 : pazsan 1.51 creating and executing XTs. I.e., even when you execute @code{'} in
3060 : anton 1.48 compile state, it still gives you the interpretation semantics. And
3061 :     whatever that state is, @code{execute} performs the semantics
3062 : anton 1.66 represented by the XT (i.e., for XTs produced with @code{'} the
3063 :     interpretation semantics).
3064 :    
3065 :     Reference: @ref{Tokens for Words}.
3066 : anton 1.48
3067 :    
3068 :     @node Exceptions Tutorial, Defining Words Tutorial, Execution Tokens Tutorial, Tutorial
3069 :     @section Exceptions
3070 : anton 1.66 @cindex exceptions tutorial
3071 : anton 1.48
3072 :     @code{throw ( n -- )} causes an exception unless n is zero.
3073 :    
3074 :     @example
3075 :     100 throw .s
3076 :     0 throw .s
3077 :     @end example
3078 :    
3079 :     @code{catch ( ... xt -- ... n )} behaves similar to @code{execute}, but
3080 :     it catches exceptions and pushes the number of the exception on the
3081 :     stack (or 0, if the xt executed without exception). If there was an
3082 :     exception, the stacks have the same depth as when entering @code{catch}:
3083 :    
3084 :     @example
3085 :     .s
3086 :     3 0 ' / catch .s
3087 :     3 2 ' / catch .s
3088 :     @end example
3089 :    
3090 :     @assignment
3091 :     Try the same with @code{execute} instead of @code{catch}.
3092 :     @endassignment
3093 :    
3094 :     @code{Throw} always jumps to the dynamically next enclosing
3095 :     @code{catch}, even if it has to leave several call levels to achieve
3096 :     this:
3097 :    
3098 :     @example
3099 :     : foo 100 throw ;
3100 :     : foo1 foo ." after foo" ;
3101 : pazsan 1.51 : bar ['] foo1 catch ;
3102 : anton 1.60 bar .
3103 : anton 1.48 @end example
3104 :    
3105 :     It is often important to restore a value upon leaving a definition, even
3106 :     if the definition is left through an exception. You can ensure this
3107 :     like this:
3108 :    
3109 :     @example
3110 :     : ...
3111 :     save-x
3112 : pazsan 1.51 ['] word-changing-x catch ( ... n )
3113 : anton 1.48 restore-x
3114 :     ( ... n ) throw ;
3115 :     @end example
3116 :    
3117 : anton 1.55 Gforth provides an alternative syntax in addition to @code{catch}:
3118 : anton 1.48 @code{try ... recover ... endtry}. If the code between @code{try} and
3119 :     @code{recover} has an exception, the stack depths are restored, the
3120 :     exception number is pushed on the stack, and the code between
3121 :     @code{recover} and @code{endtry} is performed. E.g., the definition for
3122 :     @code{catch} is
3123 :    
3124 :     @example
3125 :     : catch ( x1 .. xn xt -- y1 .. ym 0 / z1 .. zn error ) \ exception
3126 :     try
3127 :     execute 0
3128 :     recover
3129 :     nip
3130 :     endtry ;
3131 :     @end example
3132 :    
3133 :     The equivalent to the restoration code above is
3134 :    
3135 :     @example
3136 :     : ...
3137 :     save-x
3138 :     try
3139 : anton 1.92 word-changing-x 0
3140 :     recover endtry
3141 : anton 1.48 restore-x
3142 :     throw ;
3143 :     @end example
3144 :    
3145 : anton 1.92 This works if @code{word-changing-x} does not change the stack depth,
3146 :     otherwise you should add some code between @code{recover} and
3147 :     @code{endtry} to balance the stack.
3148 : anton 1.48
3149 : anton 1.66 Reference: @ref{Exception Handling}.
3150 :    
3151 : anton 1.48
3152 :     @node Defining Words Tutorial, Arrays and Records Tutorial, Exceptions Tutorial, Tutorial
3153 :     @section Defining Words
3154 : anton 1.66 @cindex defining words tutorial
3155 :     @cindex does> tutorial
3156 :     @cindex create...does> tutorial
3157 :    
3158 :     @c before semantics?
3159 : anton 1.48
3160 :     @code{:}, @code{create}, and @code{variable} are definition words: They
3161 :     define other words. @code{Constant} is another definition word:
3162 :    
3163 :     @example
3164 :     5 constant foo
3165 :     foo .
3166 :     @end example
3167 :    
3168 :     You can also use the prefixes @code{2} (double-cell) and @code{f}
3169 :     (floating point) with @code{variable} and @code{constant}.
3170 :    
3171 :     You can also define your own defining words. E.g.:
3172 :    
3173 :     @example
3174 :     : variable ( "name" -- )
3175 :     create 0 , ;
3176 :     @end example
3177 :    
3178 :     You can also define defining words that create words that do something
3179 :     other than just producing their address:
3180 :    
3181 :     @example
3182 :     : constant ( n "name" -- )
3183 :     create ,
3184 :     does> ( -- n )
3185 : anton 1.50 ( addr ) @@ ;
3186 : anton 1.48
3187 :     5 constant foo
3188 :     foo .
3189 :     @end example
3190 :    
3191 :     The definition of @code{constant} above ends at the @code{does>}; i.e.,
3192 :     @code{does>} replaces @code{;}, but it also does something else: It
3193 :     changes the last defined word such that it pushes the address of the
3194 :     body of the word and then performs the code after the @code{does>}
3195 :     whenever it is called.
3196 :    
3197 :     In the example above, @code{constant} uses @code{,} to store 5 into the
3198 :     body of @code{foo}. When @code{foo} executes, it pushes the address of
3199 :     the body onto the stack, then (in the code after the @code{does>})
3200 :     fetches the 5 from there.
3201 :    
3202 :     The stack comment near the @code{does>} reflects the stack effect of the
3203 :     defined word, not the stack effect of the code after the @code{does>}
3204 :     (the difference is that the code expects the address of the body that
3205 :     the stack comment does not show).
3206 :    
3207 :     You can use these definition words to do factoring in cases that involve
3208 :     (other) definition words. E.g., a field offset is always added to an
3209 :     address. Instead of defining
3210 :    
3211 :     @example
3212 :     2 cells constant offset-field1
3213 :     @end example
3214 :    
3215 :     and using this like
3216 :    
3217 :     @example
3218 :     ( addr ) offset-field1 +
3219 :     @end example
3220 :    
3221 :     you can define a definition word
3222 :    
3223 :     @example
3224 :     : simple-field ( n "name" -- )
3225 :     create ,
3226 :     does> ( n1 -- n1+n )
3227 : anton 1.50 ( addr ) @@ + ;
3228 : anton 1.48 @end example
3229 : crook 1.21
3230 : anton 1.48 Definition and use of field offsets now look like this:
3231 : crook 1.21
3232 : anton 1.48 @example
3233 :     2 cells simple-field field1
3234 : anton 1.60 create mystruct 4 cells allot
3235 :     mystruct .s field1 .s drop
3236 : anton 1.48 @end example
3237 : crook 1.21
3238 : anton 1.48 If you want to do something with the word without performing the code
3239 :     after the @code{does>}, you can access the body of a @code{create}d word
3240 :     with @code{>body ( xt -- addr )}:
3241 : crook 1.21
3242 : anton 1.48 @example
3243 :     : value ( n "name" -- )
3244 :     create ,
3245 :     does> ( -- n1 )
3246 : anton 1.50 @@ ;
3247 : anton 1.48 : to ( n "name" -- )
3248 :     ' >body ! ;
3249 : crook 1.21
3250 : anton 1.48 5 value foo
3251 :     foo .
3252 :     7 to foo
3253 :     foo .
3254 :     @end example
3255 : crook 1.21
3256 : anton 1.48 @assignment
3257 :     Define @code{defer ( "name" -- )}, which creates a word that stores an
3258 :     XT (at the start the XT of @code{abort}), and upon execution
3259 :     @code{execute}s the XT. Define @code{is ( xt "name" -- )} that stores
3260 :     @code{xt} into @code{name}, a word defined with @code{defer}. Indirect
3261 :     recursion is one application of @code{defer}.
3262 :     @endassignment
3263 : crook 1.29
3264 : anton 1.66 Reference: @ref{User-defined Defining Words}.
3265 :    
3266 :    
3267 : anton 1.48 @node Arrays and Records Tutorial, POSTPONE Tutorial, Defining Words Tutorial, Tutorial
3268 :     @section Arrays and Records
3269 : anton 1.66 @cindex arrays tutorial
3270 :     @cindex records tutorial
3271 :     @cindex structs tutorial
3272 : crook 1.29
3273 : anton 1.48 Forth has no standard words for defining data structures such as arrays
3274 :     and records (structs in C terminology), but you can build them yourself
3275 :     based on address arithmetic. You can also define words for defining
3276 :     arrays and records (@pxref{Defining Words Tutorial,, Defining Words}).
3277 : crook 1.29
3278 : anton 1.48 One of the first projects a Forth newcomer sets out upon when learning
3279 :     about defining words is an array defining word (possibly for
3280 :     n-dimensional arrays). Go ahead and do it, I did it, too; you will
3281 :     learn something from it. However, don't be disappointed when you later
3282 :     learn that you have little use for these words (inappropriate use would
3283 :     be even worse). I have not yet found a set of useful array words yet;
3284 :     the needs are just too diverse, and named, global arrays (the result of
3285 :     naive use of defining words) are often not flexible enough (e.g.,
3286 : anton 1.66 consider how to pass them as parameters). Another such project is a set
3287 :     of words to help dealing with strings.
3288 : crook 1.29
3289 : anton 1.48 On the other hand, there is a useful set of record words, and it has
3290 :     been defined in @file{compat/struct.fs}; these words are predefined in
3291 :     Gforth. They are explained in depth elsewhere in this manual (see
3292 :     @pxref{Structures}). The @code{simple-field} example above is
3293 :     simplified variant of fields in this package.
3294 : crook 1.21
3295 :    
3296 : anton 1.48 @node POSTPONE Tutorial, Literal Tutorial, Arrays and Records Tutorial, Tutorial
3297 :     @section @code{POSTPONE}
3298 : anton 1.66 @cindex postpone tutorial
3299 : crook 1.21
3300 : anton 1.48 You can compile the compilation semantics (instead of compiling the
3301 :     interpretation semantics) of a word with @code{POSTPONE}:
3302 : crook 1.21
3303 : anton 1.48 @example
3304 :     : MY-+ ( Compilation: -- ; Run-time of compiled code: n1 n2 -- n )
3305 : pazsan 1.51 POSTPONE + ; immediate
3306 : anton 1.48 : foo ( n1 n2 -- n )
3307 :     MY-+ ;
3308 :     1 2 foo .
3309 :     see foo
3310 :     @end example
3311 : crook 1.21
3312 : anton 1.48 During the definition of @code{foo} the text interpreter performs the
3313 :     compilation semantics of @code{MY-+}, which performs the compilation
3314 :     semantics of @code{+}, i.e., it compiles @code{+} into @code{foo}.
3315 :    
3316 :     This example also displays separate stack comments for the compilation
3317 :     semantics and for the stack effect of the compiled code. For words with
3318 :     default compilation semantics these stack effects are usually not
3319 :     displayed; the stack effect of the compilation semantics is always
3320 :     @code{( -- )} for these words, the stack effect for the compiled code is
3321 :     the stack effect of the interpretation semantics.
3322 :    
3323 :     Note that the state of the interpreter does not come into play when
3324 :     performing the compilation semantics in this way. You can also perform
3325 :     it interpretively, e.g.:
3326 :    
3327 :     @example
3328 :     : foo2 ( n1 n2 -- n )
3329 :     [ MY-+ ] ;
3330 :     1 2 foo .
3331 :     see foo
3332 :     @end example
3333 : crook 1.21
3334 : anton 1.48 However, there are some broken Forth systems where this does not always
3335 : crook 1.62 work, and therefore this practice was been declared non-standard in
3336 : anton 1.48 1999.
3337 :     @c !! repair.fs
3338 :    
3339 :     Here is another example for using @code{POSTPONE}:
3340 : crook 1.44
3341 : anton 1.48 @example
3342 :     : MY-- ( Compilation: -- ; Run-time of compiled code: n1 n2 -- n )
3343 :     POSTPONE negate POSTPONE + ; immediate compile-only
3344 :     : bar ( n1 n2 -- n )
3345 :     MY-- ;
3346 :     2 1 bar .
3347 :     see bar
3348 :     @end example
3349 : crook 1.21
3350 : anton 1.48 You can define @code{ENDIF} in this way:
3351 : crook 1.21
3352 : anton 1.48 @example
3353 :     : ENDIF ( Compilation: orig -- )
3354 :     POSTPONE then ; immediate
3355 :     @end example
3356 : crook 1.21
3357 : anton 1.48 @assignment
3358 :     Write @code{MY-2DUP} that has compilation semantics equivalent to
3359 :     @code{2dup}, but compiles @code{over over}.
3360 :     @endassignment
3361 : crook 1.29
3362 : anton 1.66 @c !! @xref{Macros} for reference
3363 :    
3364 :    
3365 : anton 1.48 @node Literal Tutorial, Advanced macros Tutorial, POSTPONE Tutorial, Tutorial
3366 :     @section @code{Literal}
3367 : anton 1.66 @cindex literal tutorial
3368 : crook 1.29
3369 : anton 1.48 You cannot @code{POSTPONE} numbers:
3370 : crook 1.21
3371 : anton 1.48 @example
3372 :     : [FOO] POSTPONE 500 ; immediate
3373 : crook 1.21 @end example
3374 :    
3375 : anton 1.48 Instead, you can use @code{LITERAL (compilation: n --; run-time: -- n )}:
3376 : crook 1.29
3377 : anton 1.48 @example
3378 :     : [FOO] ( compilation: --; run-time: -- n )
3379 :     500 POSTPONE literal ; immediate
3380 : crook 1.29
3381 : anton 1.60 : flip [FOO] ;
3382 : anton 1.48 flip .
3383 :     see flip
3384 :     @end example
3385 : crook 1.29
3386 : anton 1.48 @code{LITERAL} consumes a number at compile-time (when it's compilation
3387 :     semantics are executed) and pushes it at run-time (when the code it
3388 :     compiled is executed). A frequent use of @code{LITERAL} is to compile a
3389 :     number computed at compile time into the current word:
3390 : crook 1.29
3391 : anton 1.48 @example
3392 :     : bar ( -- n )
3393 :     [ 2 2 + ] literal ;
3394 :     see bar
3395 :     @end example
3396 : crook 1.29
3397 : anton 1.48 @assignment
3398 :     Write @code{]L} which allows writing the example above as @code{: bar (
3399 :     -- n ) [ 2 2 + ]L ;}
3400 :     @endassignment
3401 :    
3402 : anton 1.66 @c !! @xref{Macros} for reference
3403 :    
3404 : anton 1.48
3405 :     @node Advanced macros Tutorial, Compilation Tokens Tutorial, Literal Tutorial, Tutorial
3406 :     @section Advanced macros
3407 : anton 1.66 @cindex macros, advanced tutorial
3408 :     @cindex run-time code generation, tutorial
3409 : anton 1.48
3410 : anton 1.66 Reconsider @code{map-array} from @ref{Execution Tokens Tutorial,,
3411 :     Execution Tokens}. It frequently performs @code{execute}, a relatively
3412 :     expensive operation in some Forth implementations. You can use
3413 : anton 1.48 @code{compile,} and @code{POSTPONE} to eliminate these @code{execute}s
3414 :     and produce a word that contains the word to be performed directly:
3415 :    
3416 :     @c use ]] ... [[
3417 :     @example
3418 :     : compile-map-array ( compilation: xt -- ; run-time: ... addr u -- ... )
3419 :     \ at run-time, execute xt ( ... x -- ... ) for each element of the
3420 :     \ array beginning at addr and containing u elements
3421 :     @{ xt @}
3422 :     POSTPONE cells POSTPONE over POSTPONE + POSTPONE swap POSTPONE ?do
3423 : anton 1.50 POSTPONE i POSTPONE @@ xt compile,
3424 : anton 1.48 1 cells POSTPONE literal POSTPONE +loop ;
3425 :    
3426 :     : sum-array ( addr u -- n )
3427 :     0 rot rot [ ' + compile-map-array ] ;
3428 :     see sum-array
3429 :     a 5 sum-array .
3430 :     @end example
3431 :    
3432 :     You can use the full power of Forth for generating the code; here's an
3433 :     example where the code is generated in a loop:
3434 :    
3435 :     @example
3436 :     : compile-vmul-step ( compilation: n --; run-time: n1 addr1 -- n2 addr2 )
3437 :     \ n2=n1+(addr1)*n, addr2=addr1+cell
3438 : anton 1.50 POSTPONE tuck POSTPONE @@
3439 : anton 1.48 POSTPONE literal POSTPONE * POSTPONE +
3440 :     POSTPONE swap POSTPONE cell+ ;
3441 :    
3442 :     : compile-vmul ( compilation: addr1 u -- ; run-time: addr2 -- n )
3443 : pazsan 1.51 \ n=v1*v2 (inner product), where the v_i are represented as addr_i u
3444 : anton 1.48 0 postpone literal postpone swap
3445 :     [ ' compile-vmul-step compile-map-array ]
3446 :     postpone drop ;
3447 :     see compile-vmul
3448 :    
3449 :     : a-vmul ( addr -- n )
3450 : pazsan 1.51 \ n=a*v, where v is a vector that's as long as a and starts at addr
3451 : anton 1.48 [ a 5 compile-vmul ] ;
3452 :     see a-vmul
3453 :     a a-vmul .
3454 :     @end example
3455 :    
3456 :     This example uses @code{compile-map-array} to show off, but you could
3457 : anton 1.66 also use @code{map-array} instead (try it now!).
3458 : anton 1.48
3459 :     You can use this technique for efficient multiplication of large
3460 :     matrices. In matrix multiplication, you multiply every line of one
3461 :     matrix with every column of the other matrix. You can generate the code
3462 :     for one line once, and use it for every column. The only downside of
3463 :     this technique is that it is cumbersome to recover the memory consumed
3464 :     by the generated code when you are done (and in more complicated cases
3465 :     it is not possible portably).
3466 :    
3467 : anton 1.66 @c !! @xref{Macros} for reference
3468 :    
3469 :    
3470 : anton 1.48 @node Compilation Tokens Tutorial, Wordlists and Search Order Tutorial, Advanced macros Tutorial, Tutorial
3471 :     @section Compilation Tokens
3472 : anton 1.66 @cindex compilation tokens, tutorial
3473 :     @cindex CT, tutorial
3474 : anton 1.48
3475 :     This section is Gforth-specific. You can skip it.
3476 :    
3477 :     @code{' word compile,} compiles the interpretation semantics. For words
3478 :     with default compilation semantics this is the same as performing the
3479 :     compilation semantics. To represent the compilation semantics of other
3480 :     words (e.g., words like @code{if} that have no interpretation
3481 :     semantics), Gforth has the concept of a compilation token (CT,
3482 :     consisting of two cells), and words @code{comp'} and @code{[comp']}.
3483 :     You can perform the compilation semantics represented by a CT with
3484 :     @code{execute}:
3485 : crook 1.29
3486 : anton 1.48 @example
3487 :     : foo2 ( n1 n2 -- n )
3488 :     [ comp' + execute ] ;
3489 :     see foo
3490 :     @end example
3491 : crook 1.29
3492 : anton 1.48 You can compile the compilation semantics represented by a CT with
3493 :     @code{postpone,}:
3494 : anton 1.30
3495 : anton 1.48 @example
3496 :     : foo3 ( -- )
3497 :     [ comp' + postpone, ] ;
3498 :     see foo3
3499 :     @end example
3500 : anton 1.30
3501 : pazsan 1.51 @code{[ comp' word postpone, ]} is equivalent to @code{POSTPONE word}.
3502 : anton 1.48 @code{comp'} is particularly useful for words that have no
3503 :     interpretation semantics:
3504 : crook 1.29
3505 : anton 1.30 @example
3506 : anton 1.48 ' if
3507 : anton 1.60 comp' if .s 2drop
3508 : anton 1.30 @end example
3509 :    
3510 : anton 1.66 Reference: @ref{Tokens for Words}.
3511 :    
3512 : crook 1.29
3513 : anton 1.48 @node Wordlists and Search Order Tutorial, , Compilation Tokens Tutorial, Tutorial
3514 :     @section Wordlists and Search Order
3515 : anton 1.66 @cindex wordlists tutorial
3516 :     @cindex search order, tutorial
3517 : anton 1.48
3518 :     The dictionary is not just a memory area that allows you to allocate
3519 :     memory with @code{allot}, it also contains the Forth words, arranged in
3520 :     several wordlists. When searching for a word in a wordlist,
3521 :     conceptually you start searching at the youngest and proceed towards
3522 :     older words (in reality most systems nowadays use hash-tables); i.e., if
3523 :     you define a word with the same name as an older word, the new word
3524 :     shadows the older word.
3525 :    
3526 :     Which wordlists are searched in which order is determined by the search
3527 :     order. You can display the search order with @code{order}. It displays
3528 :     first the search order, starting with the wordlist searched first, then
3529 :     it displays the wordlist that will contain newly defined words.
3530 : crook 1.21
3531 : anton 1.48 You can create a new, empty wordlist with @code{wordlist ( -- wid )}:
3532 : crook 1.21
3533 : anton 1.48 @example
3534 :     wordlist constant mywords
3535 :     @end example
3536 : crook 1.21
3537 : anton 1.48 @code{Set-current ( wid -- )} sets the wordlist that will contain newly
3538 :     defined words (the @emph{current} wordlist):
3539 : crook 1.21
3540 : anton 1.48 @example
3541 :     mywords set-current
3542 :     order
3543 :     @end example
3544 : crook 1.26
3545 : anton 1.48 Gforth does not display a name for the wordlist in @code{mywords}
3546 :     because this wordlist was created anonymously with @code{wordlist}.
3547 : crook 1.21
3548 : anton 1.48 You can get the current wordlist with @code{get-current ( -- wid)}. If
3549 :     you want to put something into a specific wordlist without overall
3550 :     effect on the current wordlist, this typically looks like this:
3551 : crook 1.21
3552 : anton 1.48 @example
3553 :     get-current mywords set-current ( wid )
3554 :     create someword
3555 :     ( wid ) set-current
3556 :     @end example
3557 : crook 1.21
3558 : anton 1.48 You can write the search order with @code{set-order ( wid1 .. widn n --
3559 :     )} and read it with @code{get-order ( -- wid1 .. widn n )}. The first
3560 :     searched wordlist is topmost.
3561 : crook 1.21
3562 : anton 1.48 @example
3563 :     get-order mywords swap 1+ set-order
3564 :     order
3565 :     @end example
3566 : crook 1.21
3567 : anton 1.48 Yes, the order of wordlists in the output of @code{order} is reversed
3568 :     from stack comments and the output of @code{.s} and thus unintuitive.
3569 : crook 1.21
3570 : anton 1.48 @assignment
3571 :     Define @code{>order ( wid -- )} with adds @code{wid} as first searched
3572 :     wordlist to the search order. Define @code{previous ( -- )}, which
3573 :     removes the first searched wordlist from the search order. Experiment
3574 :     with boundary conditions (you will see some crashes or situations that
3575 :     are hard or impossible to leave).
3576 :     @endassignment
3577 : crook 1.21
3578 : anton 1.48 The search order is a powerful foundation for providing features similar
3579 :     to Modula-2 modules and C++ namespaces. However, trying to modularize
3580 :     programs in this way has disadvantages for debugging and reuse/factoring
3581 :     that overcome the advantages in my experience (I don't do huge projects,
3582 : anton 1.55 though). These disadvantages are not so clear in other
3583 : anton 1.82 languages/programming environments, because these languages are not so
3584 : anton 1.48 strong in debugging and reuse.
3585 : crook 1.21
3586 : anton 1.66 @c !! example
3587 :    
3588 :     Reference: @ref{Word Lists}.
3589 : crook 1.21
3590 : crook 1.29 @c ******************************************************************
3591 : anton 1.48 @node Introduction, Words, Tutorial, Top
3592 : crook 1.29 @comment node-name, next, previous, up
3593 :     @chapter An Introduction to ANS Forth
3594 :     @cindex Forth - an introduction
3595 : crook 1.21
3596 : anton 1.83 The difference of this chapter from the Tutorial (@pxref{Tutorial}) is
3597 :     that it is slower-paced in its examples, but uses them to dive deep into
3598 :     explaining Forth internals (not covered by the Tutorial). Apart from
3599 :     that, this chapter covers far less material. It is suitable for reading
3600 :     without using a computer.
3601 :    
3602 : crook 1.29 The primary purpose of this manual is to document Gforth. However, since
3603 :     Forth is not a widely-known language and there is a lack of up-to-date
3604 :     teaching material, it seems worthwhile to provide some introductory
3605 : anton 1.49 material. For other sources of Forth-related
3606 :     information, see @ref{Forth-related information}.
3607 : crook 1.21
3608 : crook 1.29 The examples in this section should work on any ANS Forth; the
3609 :     output shown was produced using Gforth. Each example attempts to
3610 :     reproduce the exact output that Gforth produces. If you try out the
3611 :     examples (and you should), what you should type is shown @kbd{like this}
3612 :     and Gforth's response is shown @code{like this}. The single exception is
3613 : anton 1.30 that, where the example shows @key{RET} it means that you should
3614 : crook 1.29 press the ``carriage return'' key. Unfortunately, some output formats for
3615 :     this manual cannot show the difference between @kbd{this} and
3616 :     @code{this} which will make trying out the examples harder (but not
3617 :     impossible).
3618 : crook 1.21
3619 : crook 1.29 Forth is an unusual language. It provides an interactive development
3620 :     environment which includes both an interpreter and compiler. Forth
3621 :     programming style encourages you to break a problem down into many
3622 :     @cindex factoring
3623 :     small fragments (@dfn{factoring}), and then to develop and test each
3624 :     fragment interactively. Forth advocates assert that breaking the
3625 :     edit-compile-test cycle used by conventional programming languages can
3626 :     lead to great productivity improvements.
3627 : crook 1.21
3628 : crook 1.29 @menu
3629 : anton 1.67 * Introducing the Text Interpreter::
3630 :     * Stacks and Postfix notation::
3631 :     * Your first definition::
3632 :     * How does that work?::
3633 :     * Forth is written in Forth::
3634 :     * Review - elements of a Forth system::
3635 :     * Where to go next::
3636 :     * Exercises::
3637 : crook 1.29 @end menu
3638 : crook 1.21
3639 : crook 1.29 @comment ----------------------------------------------
3640 :     @node Introducing the Text Interpreter, Stacks and Postfix notation, Introduction, Introduction
3641 :     @section Introducing the Text Interpreter
3642 :     @cindex text interpreter
3643 :     @cindex outer interpreter
3644 : crook 1.21
3645 : anton 1.30 @c IMO this is too detailed and the pace is too slow for
3646 :     @c an introduction. If you know German, take a look at
3647 :     @c http://www.complang.tuwien.ac.at/anton/lvas/skriptum-stack.html
3648 :     @c to see how I do it - anton
3649 :    
3650 : crook 1.44 @c nac-> Where I have accepted your comments 100% and modified the text
3651 :     @c accordingly, I have deleted your comments. Elsewhere I have added a
3652 :     @c response like this to attempt to rationalise what I have done. Of
3653 :     @c course, this is a very clumsy mechanism for something that would be
3654 :     @c done far more efficiently over a beer. Please delete any dialogue
3655 :     @c you consider closed.
3656 :    
3657 : crook 1.29 When you invoke the Forth image, you will see a startup banner printed
3658 :     and nothing else (if you have Gforth installed on your system, try
3659 : anton 1.30 invoking it now, by typing @kbd{gforth@key{RET}}). Forth is now running
3660 : crook 1.29 its command line interpreter, which is called the @dfn{Text Interpreter}
3661 :     (also known as the @dfn{Outer Interpreter}). (You will learn a lot
3662 : anton 1.49 about the text interpreter as you read through this chapter, for more
3663 :     detail @pxref{The Text Interpreter}).
3664 : crook 1.21
3665 : crook 1.29 Although it's not obvious, Forth is actually waiting for your
3666 : anton 1.30 input. Type a number and press the @key{RET} key:
3667 : crook 1.21
3668 : crook 1.26 @example
3669 : anton 1.30 @kbd{45@key{RET}} ok
3670 : crook 1.26 @end example
3671 : crook 1.21
3672 : crook 1.29 Rather than give you a prompt to invite you to input something, the text
3673 :     interpreter prints a status message @i{after} it has processed a line
3674 :     of input. The status message in this case (``@code{ ok}'' followed by
3675 :     carriage-return) indicates that the text interpreter was able to process
3676 :     all of your input successfully. Now type something illegal:
3677 :    
3678 :     @example
3679 : anton 1.30 @kbd{qwer341@key{RET}}
3680 : crook 1.29 :1: Undefined word
3681 :     qwer341
3682 :     ^^^^^^^
3683 :     $400D2BA8 Bounce
3684 :     $400DBDA8 no.extensions
3685 :     @end example
3686 : crook 1.23
3687 : crook 1.29 The exact text, other than the ``Undefined word'' may differ slightly on
3688 :     your system, but the effect is the same; when the text interpreter
3689 :     detects an error, it discards any remaining text on a line, resets
3690 : anton 1.49 certain internal state and prints an error message. For a detailed description of error messages see @ref{Error
3691 :     messages}.
3692 : crook 1.23
3693 : crook 1.29 The text interpreter waits for you to press carriage-return, and then
3694 :     processes your input line. Starting at the beginning of the line, it
3695 :     breaks the line into groups of characters separated by spaces. For each
3696 :     group of characters in turn, it makes two attempts to do something:
3697 : crook 1.23
3698 : crook 1.29 @itemize @bullet
3699 :     @item
3700 : crook 1.44 @cindex name dictionary
3701 : crook 1.29 It tries to treat it as a command. It does this by searching a @dfn{name
3702 :     dictionary}. If the group of characters matches an entry in the name
3703 :     dictionary, the name dictionary provides the text interpreter with
3704 :     information that allows the text interpreter perform some actions. In
3705 :     Forth jargon, we say that the group
3706 :     @cindex word
3707 :     @cindex definition
3708 :     @cindex execution token
3709 :     @cindex xt
3710 :     of characters names a @dfn{word}, that the dictionary search returns an
3711 :     @dfn{execution token (xt)} corresponding to the @dfn{definition} of the
3712 :     word, and that the text interpreter executes the xt. Often, the terms
3713 :     @dfn{word} and @dfn{definition} are used interchangeably.
3714 :     @item
3715 :     If the text interpreter fails to find a match in the name dictionary, it
3716 :     tries to treat the group of characters as a number in the current number
3717 :     base (when you start up Forth, the current number base is base 10). If
3718 :     the group of characters legitimately represents a number, the text
3719 :     interpreter pushes the number onto a stack (we'll learn more about that
3720 :     in the next section).
3721 :     @end itemize
3722 : crook 1.23
3723 : crook 1.29 If the text interpreter is unable to do either of these things with any
3724 :     group of characters, it discards the group of characters and the rest of
3725 :     the line, then prints an error message. If the text interpreter reaches
3726 :     the end of the line without error, it prints the status message ``@code{ ok}''
3727 :     followed by carriage-return.
3728 : crook 1.21
3729 : crook 1.29 This is the simplest command we can give to the text interpreter:
3730 : crook 1.23
3731 :     @example
3732 : anton 1.30 @key{RET} ok
3733 : crook 1.23 @end example
3734 : crook 1.21
3735 : crook 1.29 The text interpreter did everything we asked it to do (nothing) without
3736 :     an error, so it said that everything is ``@code{ ok}''. Try a slightly longer
3737 :     command:
3738 : crook 1.21
3739 : crook 1.23 @example
3740 : anton 1.30 @kbd{12 dup fred dup@key{RET}}
3741 : crook 1.29 :1: Undefined word
3742 :     12 dup fred dup
3743 :     ^^^^
3744 :     $400D2BA8 Bounce
3745 :     $400DBDA8 no.extensions
3746 : crook 1.23 @end example
3747 : crook 1.21
3748 : crook 1.29 When you press the carriage-return key, the text interpreter starts to
3749 :     work its way along the line:
3750 : crook 1.21
3751 : crook 1.29 @itemize @bullet
3752 :     @item
3753 :     When it gets to the space after the @code{2}, it takes the group of
3754 :     characters @code{12} and looks them up in the name
3755 :     dictionary@footnote{We can't tell if it found them or not, but assume
3756 :     for now that it did not}. There is no match for this group of characters
3757 :     in the name dictionary, so it tries to treat them as a number. It is
3758 :     able to do this successfully, so it puts the number, 12, ``on the stack''
3759 :     (whatever that means).
3760 :     @item
3761 :     The text interpreter resumes scanning the line and gets the next group
3762 :     of characters, @code{dup}. It looks it up in the name dictionary and
3763 :     (you'll have to take my word for this) finds it, and executes the word
3764 :     @code{dup} (whatever that means).
3765 :     @item
3766 :     Once again, the text interpreter resumes scanning the line and gets the
3767 :     group of characters @code{fred}. It looks them up in the name
3768 :     dictionary, but can't find them. It tries to treat them as a number, but
3769 :     they don't represent any legal number.
3770 :     @end itemize
3771 : crook 1.21
3772 : crook 1.29 At this point, the text interpreter gives up and prints an error
3773 :     message. The error message shows exactly how far the text interpreter
3774 :     got in processing the line. In particular, it shows that the text
3775 :     interpreter made no attempt to do anything with the final character
3776 :     group, @code{dup}, even though we have good reason to believe that the
3777 :     text interpreter would have no problem looking that word up and
3778 :     executing it a second time.
3779 : crook 1.21
3780 :    
3781 : crook 1.29 @comment ----------------------------------------------
3782 :     @node Stacks and Postfix notation, Your first definition, Introducing the Text Interpreter, Introduction
3783 :     @section Stacks, postfix notation and parameter passing
3784 :     @cindex text interpreter
3785 :     @cindex outer interpreter
3786 : crook 1.21
3787 : crook 1.29 In procedural programming languages (like C and Pascal), the
3788 :     building-block of programs is the @dfn{function} or @dfn{procedure}. These
3789 :     functions or procedures are called with @dfn{explicit parameters}. For
3790 :     example, in C we might write:
3791 : crook 1.21
3792 : crook 1.23 @example
3793 : crook 1.29 total = total + new_volume(length,height,depth);
3794 : crook 1.23 @end example
3795 : crook 1.21
3796 : crook 1.23 @noindent
3797 : crook 1.29 where new_volume is a function-call to another piece of code, and total,
3798 :     length, height and depth are all variables. length, height and depth are
3799 :     parameters to the function-call.
3800 : crook 1.21
3801 : crook 1.29 In Forth, the equivalent of the function or procedure is the
3802 :     @dfn{definition} and parameters are implicitly passed between
3803 :     definitions using a shared stack that is visible to the
3804 :     programmer. Although Forth does support variables, the existence of the
3805 :     stack means that they are used far less often than in most other
3806 :     programming languages. When the text interpreter encounters a number, it
3807 :     will place (@dfn{push}) it on the stack. There are several stacks (the
3808 : anton 1.30 actual number is implementation-dependent ...) and the particular stack
3809 : crook 1.29 used for any operation is implied unambiguously by the operation being
3810 :     performed. The stack used for all integer operations is called the @dfn{data
3811 :     stack} and, since this is the stack used most commonly, references to
3812 :     ``the data stack'' are often abbreviated to ``the stack''.
3813 : crook 1.21
3814 : crook 1.29 The stacks have a last-in, first-out (LIFO) organisation. If you type:
3815 : crook 1.21
3816 : crook 1.23 @example
3817 : anton 1.30 @kbd{1 2 3@key{RET}} ok
3818 : crook 1.23 @end example
3819 : crook 1.21
3820 : crook 1.29 Then this instructs the text interpreter to placed three numbers on the
3821 :     (data) stack. An analogy for the behaviour of the stack is to take a
3822 :     pack of playing cards and deal out the ace (1), 2 and 3 into a pile on
3823 :     the table. The 3 was the last card onto the pile (``last-in'') and if
3824 :     you take a card off the pile then, unless you're prepared to fiddle a
3825 :     bit, the card that you take off will be the 3 (``first-out''). The
3826 :     number that will be first-out of the stack is called the @dfn{top of
3827 :     stack}, which
3828 :     @cindex TOS definition
3829 :     is often abbreviated to @dfn{TOS}.
3830 : crook 1.21
3831 : crook 1.29 To understand how parameters are passed in Forth, consider the
3832 :     behaviour of the definition @code{+} (pronounced ``plus''). You will not
3833 :     be surprised to learn that this definition performs addition. More
3834 :     precisely, it adds two number together and produces a result. Where does
3835 :     it get the two numbers from? It takes the top two numbers off the
3836 :     stack. Where does it place the result? On the stack. You can act-out the
3837 :     behaviour of @code{+} with your playing cards like this:
3838 : crook 1.21
3839 :     @itemize @bullet
3840 :     @item
3841 : crook 1.29 Pick up two cards from the stack on the table
3842 : crook 1.21 @item
3843 : crook 1.29 Stare at them intently and ask yourself ``what @i{is} the sum of these two
3844 :     numbers''
3845 : crook 1.21 @item
3846 : crook 1.29 Decide that the answer is 5
3847 : crook 1.21 @item
3848 : crook 1.29 Shuffle the two cards back into the pack and find a 5
3849 : crook 1.21 @item
3850 : crook 1.29 Put a 5 on the remaining ace that's on the table.
3851 : crook 1.21 @end itemize
3852 :    
3853 : crook 1.29 If you don't have a pack of cards handy but you do have Forth running,
3854 :     you can use the definition @code{.s} to show the current state of the stack,
3855 :     without affecting the stack. Type:
3856 : crook 1.21
3857 :     @example
3858 : anton 1.30 @kbd{clearstack 1 2 3@key{RET}} ok
3859 :     @kbd{.s@key{RET}} <3> 1 2 3 ok
3860 : crook 1.23 @end example
3861 :    
3862 : crook 1.29 The text interpreter looks up the word @code{clearstack} and executes
3863 :     it; it tidies up the stack and removes any entries that may have been
3864 :     left on it by earlier examples. The text interpreter pushes each of the
3865 :     three numbers in turn onto the stack. Finally, the text interpreter
3866 :     looks up the word @code{.s} and executes it. The effect of executing
3867 :     @code{.s} is to print the ``<3>'' (the total number of items on the stack)
3868 :     followed by a list of all the items on the stack; the item on the far
3869 :     right-hand side is the TOS.
3870 : crook 1.21
3871 : crook 1.29 You can now type:
3872 : crook 1.21
3873 : crook 1.29 @example
3874 : anton 1.30 @kbd{+ .s@key{RET}} <2> 1 5 ok
3875 : crook 1.29 @end example
3876 : crook 1.21
3877 : crook 1.29 @noindent
3878 :     which is correct; there are now 2 items on the stack and the result of
3879 :     the addition is 5.
3880 : crook 1.23
3881 : crook 1.29 If you're playing with cards, try doing a second addition: pick up the
3882 :     two cards, work out that their sum is 6, shuffle them into the pack,
3883 :     look for a 6 and place that on the table. You now have just one item on
3884 :     the stack. What happens if you try to do a third addition? Pick up the
3885 :     first card, pick up the second card -- ah! There is no second card. This
3886 :     is called a @dfn{stack underflow} and consitutes an error. If you try to
3887 : anton 1.95 do the same thing with Forth it often reports an error (probably a Stack
3888 : crook 1.29 Underflow or an Invalid Memory Address error).
3889 : crook 1.23
3890 : crook 1.29 The opposite situation to a stack underflow is a @dfn{stack overflow},
3891 :     which simply accepts that there is a finite amount of storage space
3892 :     reserved for the stack. To stretch the playing card analogy, if you had
3893 :     enough packs of cards and you piled the cards up on the table, you would
3894 :     eventually be unable to add another card; you'd hit the ceiling. Gforth
3895 :     allows you to set the maximum size of the stacks. In general, the only
3896 :     time that you will get a stack overflow is because a definition has a
3897 :     bug in it and is generating data on the stack uncontrollably.
3898 : crook 1.23
3899 : crook 1.29 There's one final use for the playing card analogy. If you model your
3900 :     stack using a pack of playing cards, the maximum number of items on
3901 :     your stack will be 52 (I assume you didn't use the Joker). The maximum
3902 :     @i{value} of any item on the stack is 13 (the King). In fact, the only
3903 :     possible numbers are positive integer numbers 1 through 13; you can't
3904 :     have (for example) 0 or 27 or 3.52 or -2. If you change the way you
3905 :     think about some of the cards, you can accommodate different
3906 :     numbers. For example, you could think of the Jack as representing 0,
3907 :     the Queen as representing -1 and the King as representing -2. Your
3908 : crook 1.45 @i{range} remains unchanged (you can still only represent a total of 13
3909 : crook 1.29 numbers) but the numbers that you can represent are -2 through 10.
3910 : crook 1.28
3911 : crook 1.29 In that analogy, the limit was the amount of information that a single
3912 :     stack entry could hold, and Forth has a similar limit. In Forth, the
3913 :     size of a stack entry is called a @dfn{cell}. The actual size of a cell is
3914 :     implementation dependent and affects the maximum value that a stack
3915 :     entry can hold. A Standard Forth provides a cell size of at least
3916 :     16-bits, and most desktop systems use a cell size of 32-bits.
3917 : crook 1.21
3918 : crook 1.29 Forth does not do any type checking for you, so you are free to
3919 :     manipulate and combine stack items in any way you wish. A convenient way
3920 :     of treating stack items is as 2's complement signed integers, and that
3921 :     is what Standard words like @code{+} do. Therefore you can type:
3922 : crook 1.21
3923 : crook 1.29 @example
3924 : anton 1.30 @kbd{-5 12 + .s@key{RET}} <1> 7 ok
3925 : crook 1.29 @end example
3926 : crook 1.21
3927 : crook 1.29 If you use numbers and definitions like @code{+} in order to turn Forth
3928 :     into a great big pocket calculator, you will realise that it's rather
3929 :     different from a normal calculator. Rather than typing 2 + 3 = you had
3930 :     to type 2 3 + (ignore the fact that you had to use @code{.s} to see the
3931 :     result). The terminology used to describe this difference is to say that
3932 :     your calculator uses @dfn{Infix Notation} (parameters and operators are
3933 :     mixed) whilst Forth uses @dfn{Postfix Notation} (parameters and
3934 :     operators are separate), also called @dfn{Reverse Polish Notation}.
3935 : crook 1.21
3936 : crook 1.29 Whilst postfix notation might look confusing to begin with, it has
3937 :     several important advantages:
3938 : crook 1.21
3939 : crook 1.23 @itemize @bullet
3940 :     @item
3941 : crook 1.29 it is unambiguous
3942 : crook 1.23 @item
3943 : crook 1.29 it is more concise
3944 : crook 1.23 @item
3945 : crook 1.29 it fits naturally with a stack-based system
3946 : crook 1.23 @end itemize
3947 : crook 1.21
3948 : crook 1.29 To examine these claims in more detail, consider these sums:
3949 : crook 1.21
3950 : crook 1.29 @example
3951 :     6 + 5 * 4 =
3952 :     4 * 5 + 6 =
3953 :     @end example
3954 : crook 1.21
3955 : crook 1.29 If you're just learning maths or your maths is very rusty, you will
3956 :     probably come up with the answer 44 for the first and 26 for the
3957 :     second. If you are a bit of a whizz at maths you will remember the
3958 :     @i{convention} that multiplication takes precendence over addition, and
3959 :     you'd come up with the answer 26 both times. To explain the answer 26
3960 :     to someone who got the answer 44, you'd probably rewrite the first sum
3961 :     like this:
3962 : crook 1.21
3963 : crook 1.29 @example
3964 :     6 + (5 * 4) =
3965 :     @end example
3966 : crook 1.21
3967 : crook 1.29 If what you really wanted was to perform the addition before the
3968 :     multiplication, you would have to use parentheses to force it.
3969 : crook 1.21
3970 : crook 1.29 If you did the first two sums on a pocket calculator you would probably
3971 :     get the right answers, unless you were very cautious and entered them using
3972 :     these keystroke sequences:
3973 : crook 1.21
3974 : crook 1.29 6 + 5 = * 4 =
3975 :     4 * 5 = + 6 =
3976 : crook 1.21
3977 : crook 1.29 Postfix notation is unambiguous because the order that the operators
3978 :     are applied is always explicit; that also means that parentheses are
3979 :     never required. The operators are @i{active} (the act of quoting the
3980 :     operator makes the operation occur) which removes the need for ``=''.
3981 : crook 1.28
3982 : crook 1.29 The sum 6 + 5 * 4 can be written (in postfix notation) in two
3983 :     equivalent ways:
3984 : crook 1.26
3985 :     @example
3986 : crook 1.29 6 5 4 * + or:
3987 :     5 4 * 6 +
3988 : crook 1.26 @end example
3989 : crook 1.23
3990 : crook 1.29 An important thing that you should notice about this notation is that
3991 :     the @i{order} of the numbers does not change; if you want to subtract
3992 :     2 from 10 you type @code{10 2 -}.
3993 : anton 1.1
3994 : crook 1.29 The reason that Forth uses postfix notation is very simple to explain: it
3995 :     makes the implementation extremely simple, and it follows naturally from
3996 :     using the stack as a mechanism for passing parameters. Another way of
3997 :     thinking about this is to realise that all Forth definitions are
3998 :     @i{active}; they execute as they are encountered by the text
3999 :     interpreter. The result of this is that the syntax of Forth is trivially
4000 :     simple.
4001 : anton 1.1
4002 :    
4003 :    
4004 : crook 1.29 @comment ----------------------------------------------
4005 :     @node Your first definition, How does that work?, Stacks and Postfix notation, Introduction
4006 :     @section Your first Forth definition
4007 :     @cindex first definition
4008 : anton 1.1
4009 : crook 1.29 Until now, the examples we've seen have been trivial; we've just been
4010 :     using Forth as a bigger-than-pocket calculator. Also, each calculation
4011 :     we've shown has been a ``one-off'' -- to repeat it we'd need to type it in
4012 :     again@footnote{That's not quite true. If you press the up-arrow key on
4013 :     your keyboard you should be able to scroll back to any earlier command,
4014 :     edit it and re-enter it.} In this section we'll see how to add new
4015 :     words to Forth's vocabulary.
4016 : anton 1.1
4017 : crook 1.29 The easiest way to create a new word is to use a @dfn{colon
4018 :     definition}. We'll define a few and try them out before worrying too
4019 :     much about how they work. Try typing in these examples; be careful to
4020 :     copy the spaces accurately:
4021 : anton 1.1
4022 : crook 1.29 @example
4023 :     : add-two 2 + . ;
4024 :     : greet ." Hello and welcome" ;
4025 :     : demo 5 add-two ;
4026 :     @end example
4027 : anton 1.1
4028 : crook 1.29 @noindent
4029 :     Now try them out:
4030 : anton 1.1
4031 : crook 1.29 @example
4032 : anton 1.30 @kbd{greet@key{RET}} Hello and welcome ok
4033 :     @kbd{greet greet@key{RET}} Hello and welcomeHello and welcome ok
4034 :     @kbd{4 add-two@key{RET}} 6 ok
4035 :     @kbd{demo@key{RET}} 7 ok
4036 :     @kbd{9 greet demo add-two@key{RET}} Hello and welcome7 11 ok
4037 : crook 1.29 @end example
4038 : anton 1.1
4039 : crook 1.29 The first new thing that we've introduced here is the pair of words
4040 :     @code{:} and @code{;}. These are used to start and terminate a new
4041 :     definition, respectively. The first word after the @code{:} is the name
4042 :     for the new definition.
4043 : anton 1.1
4044 : crook 1.29 As you can see from the examples, a definition is built up of words that
4045 :     have already been defined; Forth makes no distinction between
4046 :     definitions that existed when you started the system up, and those that
4047 :     you define yourself.
4048 : anton 1.1
4049 : crook 1.29 The examples also introduce the words @code{.} (dot), @code{."}
4050 :     (dot-quote) and @code{dup} (dewp). Dot takes the value from the top of
4051 :     the stack and displays it. It's like @code{.s} except that it only
4052 :     displays the top item of the stack and it is destructive; after it has
4053 :     executed, the number is no longer on the stack. There is always one
4054 :     space printed after the number, and no spaces before it. Dot-quote
4055 :     defines a string (a sequence of characters) that will be printed when
4056 :     the word is executed. The string can contain any printable characters
4057 :     except @code{"}. A @code{"} has a special function; it is not a Forth
4058 :     word but it acts as a delimiter (the way that delimiters work is
4059 :     described in the next section). Finally, @code{dup} duplicates the value
4060 :     at the top of the stack. Try typing @code{5 dup .s} to see what it does.
4061 : anton 1.1
4062 : crook 1.29 We already know that the text interpreter searches through the
4063 :     dictionary to locate names. If you've followed the examples earlier, you
4064 :     will already have a definition called @code{add-two}. Lets try modifying
4065 :     it by typing in a new definition:
4066 : anton 1.1
4067 : crook 1.29 @example
4068 : anton 1.30 @kbd{: add-two dup . ." + 2 =" 2 + . ;@key{RET}} redefined add-two ok
4069 : crook 1.29 @end example
4070 : anton 1.5
4071 : crook 1.29 Forth recognised that we were defining a word that already exists, and
4072 :     printed a message to warn us of that fact. Let's try out the new
4073 :     definition:
4074 : anton 1.5
4075 : crook 1.29 @example
4076 : anton 1.30 @kbd{9 add-two@key{RET}} 9 + 2 =11 ok
4077 : crook 1.29 @end example
4078 : anton 1.1
4079 : crook 1.29 @noindent
4080 :     All that we've actually done here, though, is to create a new
4081 :     definition, with a particular name. The fact that there was already a
4082 :     definition with the same name did not make any difference to the way
4083 :     that the new definition was created (except that Forth printed a warning
4084 :     message). The old definition of add-two still exists (try @code{demo}
4085 :     again to see that this is true). Any new definition will use the new
4086 :     definition of @code{add-two}, but old definitions continue to use the
4087 :     version that already existed at the time that they were @code{compiled}.
4088 : anton 1.1
4089 : crook 1.29 Before you go on to the next section, try defining and redefining some
4090 :     words of your own.
4091 : anton 1.1
4092 : crook 1.29 @comment ----------------------------------------------
4093 :     @node How does that work?, Forth is written in Forth, Your first definition, Introduction
4094 :     @section How does that work?
4095 :     @cindex parsing words
4096 : anton 1.1
4097 : anton 1.30 @c That's pretty deep (IMO way too deep) for an introduction. - anton
4098 :    
4099 :     @c Is it a good idea to talk about the interpretation semantics of a
4100 :     @c number? We don't have an xt to go along with it. - anton
4101 :    
4102 :     @c Now that I have eliminated execution semantics, I wonder if it would not
4103 :     @c be better to keep them (or add run-time semantics), to make it easier to
4104 :     @c explain what compilation semantics usually does. - anton
4105 :    
4106 : crook 1.44 @c nac-> I removed the term ``default compilation sematics'' from the
4107 :     @c introductory chapter. Removing ``execution semantics'' was making
4108 :     @c everything simpler to explain, then I think the use of this term made
4109 :     @c everything more complex again. I replaced it with ``default
4110 :     @c semantics'' (which is used elsewhere in the manual) by which I mean
4111 :     @c ``a definition that has neither the immediate nor the compile-only
4112 : anton 1.83 @c flag set''.
4113 :    
4114 :     @c anton: I have eliminated default semantics (except in one place where it
4115 :     @c means "default interpretation and compilation semantics"), because it
4116 :     @c makes no sense in the presence of combined words. I reverted to
4117 :     @c "execution semantics" where necessary.
4118 :    
4119 :     @c nac-> I reworded big chunks of the ``how does that work''
4120 : crook 1.44 @c section (and, unusually for me, I think I even made it shorter!). See
4121 :     @c what you think -- I know I have not addressed your primary concern
4122 :     @c that it is too heavy-going for an introduction. From what I understood
4123 :     @c of your course notes it looks as though they might be a good framework.
4124 :     @c Things that I've tried to capture here are some things that came as a
4125 :     @c great revelation here when I first understood them. Also, I like the
4126 :     @c fact that a very simple code example shows up almost all of the issues
4127 :     @c that you need to understand to see how Forth works. That's unique and
4128 :     @c worthwhile to emphasise.
4129 :    
4130 : anton 1.83 @c anton: I think it's a good idea to present the details, especially those
4131 :     @c that you found to be a revelation, and probably the tutorial tries to be
4132 :     @c too superficial and does not get some of the things across that make
4133 :     @c Forth special. I do believe that most of the time these things should
4134 :     @c be discussed at the end of a section or in separate sections instead of
4135 :     @c in the middle of a section (e.g., the stuff you added in "User-defined
4136 :     @c defining words" leads in a completely different direction from the rest
4137 :     @c of the section).
4138 :    
4139 : crook 1.29 Now we're going to take another look at the definition of @code{add-two}
4140 :     from the previous section. From our knowledge of the way that the text
4141 :     interpreter works, we would have expected this result when we tried to
4142 :     define @code{add-two}:
4143 : crook 1.21
4144 : crook 1.29 @example
4145 : crook 1.44 @kbd{: add-two 2 + . ;@key{RET}}
4146 : crook 1.29 ^^^^^^^
4147 :     Error: Undefined word
4148 :     @end example
4149 : crook 1.28
4150 : crook 1.29 The reason that this didn't happen is bound up in the way that @code{:}
4151 :     works. The word @code{:} does two special things. The first special
4152 :     thing that it does prevents the text interpreter from ever seeing the
4153 :     characters @code{add-two}. The text interpreter uses a variable called
4154 :     @cindex modifying >IN
4155 : crook 1.44 @code{>IN} (pronounced ``to-in'') to keep track of where it is in the
4156 : crook 1.29 input line. When it encounters the word @code{:} it behaves in exactly
4157 :     the same way as it does for any other word; it looks it up in the name
4158 :     dictionary, finds its xt and executes it. When @code{:} executes, it
4159 :     looks at the input buffer, finds the word @code{add-two} and advances the
4160 :     value of @code{>IN} to point past it. It then does some other stuff
4161 :     associated with creating the new definition (including creating an entry
4162 :     for @code{add-two} in the name dictionary). When the execution of @code{:}
4163 :     completes, control returns to the text interpreter, which is oblivious
4164 :     to the fact that it has been tricked into ignoring part of the input
4165 :     line.
4166 : crook 1.21
4167 : crook 1.29 @cindex parsing words
4168 :     Words like @code{:} -- words that advance the value of @code{>IN} and so
4169 :     prevent the text interpreter from acting on the whole of the input line
4170 :     -- are called @dfn{parsing words}.
4171 : crook 1.21
4172 : crook 1.29 @cindex @code{state} - effect on the text interpreter
4173 :     @cindex text interpreter - effect of state
4174 :     The second special thing that @code{:} does is change the value of a
4175 :     variable called @code{state}, which affects the way that the text
4176 :     interpreter behaves. When Gforth starts up, @code{state} has the value
4177 :     0, and the text interpreter is said to be @dfn{interpreting}. During a
4178 :     colon definition (started with @code{:}), @code{state} is set to -1 and
4179 : crook 1.44 the text interpreter is said to be @dfn{compiling}.
4180 :    
4181 :     In this example, the text interpreter is compiling when it processes the
4182 :     string ``@code{2 + . ;}''. It still breaks the string down into
4183 :     character sequences in the same way. However, instead of pushing the
4184 :     number @code{2} onto the stack, it lays down (@dfn{compiles}) some magic
4185 :     into the definition of @code{add-two} that will make the number @code{2} get
4186 :     pushed onto the stack when @code{add-two} is @dfn{executed}. Similarly,
4187 :     the behaviours of @code{+} and @code{.} are also compiled into the
4188 :     definition.
4189 :    
4190 :     One category of words don't get compiled. These so-called @dfn{immediate
4191 :     words} get executed (performed @i{now}) regardless of whether the text
4192 :     interpreter is interpreting or compiling. The word @code{;} is an
4193 :     immediate word. Rather than being compiled into the definition, it
4194 :     executes. Its effect is to terminate the current definition, which
4195 :     includes changing the value of @code{state} back to 0.
4196 :    
4197 :     When you execute @code{add-two}, it has a @dfn{run-time effect} that is
4198 :     exactly the same as if you had typed @code{2 + . @key{RET}} outside of a
4199 :     definition.
4200 : crook 1.28
4201 : anton 1.30 In Forth, every word or number can be described in terms of two
4202 : crook 1.29 properties:
4203 : crook 1.28
4204 :     @itemize @bullet
4205 :     @item
4206 : crook 1.29 @cindex interpretation semantics
4207 : crook 1.44 Its @dfn{interpretation semantics} describe how it will behave when the
4208 :     text interpreter encounters it in @dfn{interpret} state. The
4209 :     interpretation semantics of a word are represented by an @dfn{execution
4210 :     token}.
4211 : crook 1.28 @item
4212 : crook 1.29 @cindex compilation semantics
4213 : crook 1.44 Its @dfn{compilation semantics} describe how it will behave when the
4214 :     text interpreter encounters it in @dfn{compile} state. The compilation
4215 :     semantics of a word are represented in an implementation-dependent way;
4216 :     Gforth uses a @dfn{compilation token}.
4217 : crook 1.29 @end itemize
4218 :    
4219 :     @noindent
4220 :     Numbers are always treated in a fixed way:
4221 :    
4222 :     @itemize @bullet
4223 : crook 1.28 @item
4224 : crook 1.44 When the number is @dfn{interpreted}, its behaviour is to push the
4225 :     number onto the stack.
4226 : crook 1.28 @item
4227 : anton 1.30 When the number is @dfn{compiled}, a piece of code is appended to the
4228 :     current definition that pushes the number when it runs. (In other words,
4229 :     the compilation semantics of a number are to postpone its interpretation
4230 :     semantics until the run-time of the definition that it is being compiled
4231 :     into.)
4232 : crook 1.29 @end itemize
4233 :    
4234 : crook 1.44 Words don't behave in such a regular way, but most have @i{default
4235 :     semantics} which means that they behave like this:
4236 : crook 1.29
4237 :     @itemize @bullet
4238 : crook 1.28 @item
4239 : anton 1.30 The @dfn{interpretation semantics} of the word are to do something useful.
4240 :     @item
4241 : crook 1.29 The @dfn{compilation semantics} of the word are to append its
4242 : anton 1.30 @dfn{interpretation semantics} to the current definition (so that its
4243 :     run-time behaviour is to do something useful).
4244 : crook 1.28 @end itemize
4245 :    
4246 : anton 1.30 @cindex immediate words
4247 : crook 1.44 The actual behaviour of any particular word can be controlled by using
4248 :     the words @code{immediate} and @code{compile-only} when the word is
4249 :     defined. These words set flags in the name dictionary entry of the most
4250 :     recently defined word, and these flags are retrieved by the text
4251 :     interpreter when it finds the word in the name dictionary.
4252 :    
4253 :     A word that is marked as @dfn{immediate} has compilation semantics that
4254 :     are identical to its interpretation semantics. In other words, it
4255 :     behaves like this:
4256 : crook 1.29
4257 :     @itemize @bullet
4258 :     @item
4259 : anton 1.30 The @dfn{interpretation semantics} of the word are to do something useful.
4260 : crook 1.29 @item
4261 : anton 1.30 The @dfn{compilation semantics} of the word are to do something useful
4262 :     (and actually the same thing); i.e., it is executed during compilation.
4263 : crook 1.29 @end itemize
4264 : crook 1.28
4265 : crook 1.44 Marking a word as @dfn{compile-only} prohibits the text interpreter from
4266 :     performing the interpretation semantics of the word directly; an attempt
4267 :     to do so will generate an error. It is never necessary to use
4268 :     @code{compile-only} (and it is not even part of ANS Forth, though it is
4269 :     provided by many implementations) but it is good etiquette to apply it
4270 :     to a word that will not behave correctly (and might have unexpected
4271 :     side-effects) in interpret state. For example, it is only legal to use
4272 :     the conditional word @code{IF} within a definition. If you forget this
4273 :     and try to use it elsewhere, the fact that (in Gforth) it is marked as
4274 :     @code{compile-only} allows the text interpreter to generate a helpful
4275 :     error message rather than subjecting you to the consequences of your
4276 :     folly.
4277 :    
4278 : crook 1.29 This example shows the difference between an immediate and a
4279 :     non-immediate word:
4280 : crook 1.28
4281 : crook 1.29 @example
4282 :     : show-state state @@ . ;
4283 :     : show-state-now show-state ; immediate
4284 :     : word1 show-state ;
4285 :     : word2 show-state-now ;
4286 : crook 1.28 @end example
4287 : crook 1.23
4288 : crook 1.29 The word @code{immediate} after the definition of @code{show-state-now}
4289 :     makes that word an immediate word. These definitions introduce a new
4290 :     word: @code{@@} (pronounced ``fetch''). This word fetches the value of a
4291 :     variable, and leaves it on the stack. Therefore, the behaviour of
4292 :     @code{show-state} is to print a number that represents the current value
4293 :     of @code{state}.
4294 : crook 1.28
4295 : crook 1.29 When you execute @code{word1}, it prints the number 0, indicating that
4296 :     the system is interpreting. When the text interpreter compiled the
4297 :     definition of @code{word1}, it encountered @code{show-state} whose
4298 : anton 1.30 compilation semantics are to append its interpretation semantics to the
4299 : crook 1.29 current definition. When you execute @code{word1}, it performs the
4300 : anton 1.30 interpretation semantics of @code{show-state}. At the time that @code{word1}
4301 : crook 1.29 (and therefore @code{show-state}) are executed, the system is
4302 :     interpreting.
4303 : crook 1.28
4304 : anton 1.30 When you pressed @key{RET} after entering the definition of @code{word2},
4305 : crook 1.29 you should have seen the number -1 printed, followed by ``@code{
4306 :     ok}''. When the text interpreter compiled the definition of
4307 :     @code{word2}, it encountered @code{show-state-now}, an immediate word,
4308 : anton 1.30 whose compilation semantics are therefore to perform its interpretation
4309 : crook 1.29 semantics. It is executed straight away (even before the text
4310 :     interpreter has moved on to process another group of characters; the
4311 :     @code{;} in this example). The effect of executing it are to display the
4312 :     value of @code{state} @i{at the time that the definition of}
4313 :     @code{word2} @i{is being defined}. Printing -1 demonstrates that the
4314 :     system is compiling at this time. If you execute @code{word2} it does
4315 :     nothing at all.
4316 : crook 1.28
4317 : crook 1.29 @cindex @code{."}, how it works
4318 :     Before leaving the subject of immediate words, consider the behaviour of
4319 :     @code{."} in the definition of @code{greet}, in the previous
4320 :     section. This word is both a parsing word and an immediate word. Notice
4321 :     that there is a space between @code{."} and the start of the text
4322 :     @code{Hello and welcome}, but that there is no space between the last
4323 :     letter of @code{welcome} and the @code{"} character. The reason for this
4324 :     is that @code{."} is a Forth word; it must have a space after it so that
4325 :     the text interpreter can identify it. The @code{"} is not a Forth word;
4326 :     it is a @dfn{delimiter}. The examples earlier show that, when the string
4327 :     is displayed, there is neither a space before the @code{H} nor after the
4328 :     @code{e}. Since @code{."} is an immediate word, it executes at the time
4329 :     that @code{greet} is defined. When it executes, its behaviour is to
4330 :     search forward in the input line looking for the delimiter. When it
4331 :     finds the delimiter, it updates @code{>IN} to point past the
4332 :     delimiter. It also compiles some magic code into the definition of
4333 :     @code{greet}; the xt of a run-time routine that prints a text string. It
4334 :     compiles the string @code{Hello and welcome} into memory so that it is
4335 :     available to be printed later. When the text interpreter gains control,
4336 :     the next word it finds in the input stream is @code{;} and so it
4337 :     terminates the definition of @code{greet}.
4338 : crook 1.28
4339 :    
4340 :     @comment ----------------------------------------------
4341 : crook 1.29 @node Forth is written in Forth, Review - elements of a Forth system, How does that work?, Introduction
4342 :     @section Forth is written in Forth
4343 :     @cindex structure of Forth programs
4344 :    
4345 :     When you start up a Forth compiler, a large number of definitions
4346 :     already exist. In Forth, you develop a new application using bottom-up
4347 :     programming techniques to create new definitions that are defined in
4348 :     terms of existing definitions. As you create each definition you can
4349 :     test and debug it interactively.
4350 :    
4351 :     If you have tried out the examples in this section, you will probably
4352 :     have typed them in by hand; when you leave Gforth, your definitions will
4353 :     be lost. You can avoid this by using a text editor to enter Forth source
4354 :     code into a file, and then loading code from the file using
4355 : anton 1.49 @code{include} (@pxref{Forth source files}). A Forth source file is
4356 : crook 1.29 processed by the text interpreter, just as though you had typed it in by
4357 :     hand@footnote{Actually, there are some subtle differences -- see
4358 :     @ref{The Text Interpreter}.}.
4359 :    
4360 :     Gforth also supports the traditional Forth alternative to using text
4361 : anton 1.49 files for program entry (@pxref{Blocks}).
4362 : crook 1.28
4363 : crook 1.29 In common with many, if not most, Forth compilers, most of Gforth is
4364 :     actually written in Forth. All of the @file{.fs} files in the
4365 :     installation directory@footnote{For example,
4366 : anton 1.30 @file{/usr/local/share/gforth...}} are Forth source files, which you can
4367 : crook 1.29 study to see examples of Forth programming.
4368 : crook 1.28
4369 : crook 1.29 Gforth maintains a history file that records every line that you type to
4370 :     the text interpreter. This file is preserved between sessions, and is
4371 :     used to provide a command-line recall facility. If you enter long
4372 :     definitions by hand, you can use a text editor to paste them out of the
4373 :     history file into a Forth source file for reuse at a later time
4374 : anton 1.49 (for more information @pxref{Command-line editing}).
4375 : crook 1.28
4376 :    
4377 :     @comment ----------------------------------------------
4378 : crook 1.29 @node Review - elements of a Forth system, Where to go next, Forth is written in Forth, Introduction
4379 :     @section Review - elements of a Forth system
4380 :     @cindex elements of a Forth system
4381 : crook 1.28
4382 : crook 1.29 To summarise this chapter:
4383 : crook 1.28
4384 :     @itemize @bullet
4385 :     @item
4386 : crook 1.29 Forth programs use @dfn{factoring} to break a problem down into small
4387 :     fragments called @dfn{words} or @dfn{definitions}.
4388 :     @item
4389 :     Forth program development is an interactive process.
4390 :     @item
4391 :     The main command loop that accepts input, and controls both
4392 :     interpretation and compilation, is called the @dfn{text interpreter}
4393 :     (also known as the @dfn{outer interpreter}).
4394 :     @item
4395 :     Forth has a very simple syntax, consisting of words and numbers
4396 :     separated by spaces or carriage-return characters. Any additional syntax
4397 :     is imposed by @dfn{parsing words}.
4398 :     @ite