Before the Pike code is sent to the compiler it is fed through the
preprocessor. The preprocessor converts the source code from its
character encoding into the Pike internal representation, performs
some simple normalizations and consistency checks and executes the
"preprocessor directives" that the programmer may have put into the
file. The preprocessor directives are like a very simple programming
language that allows for simple code generation and manipulation.
The code preprocessor can be called from within Pike with the
cpp
call.
Pike code is Unicode enabled, so the first thing the preprocessor has to do is to try to determine the character encoding of the file. It will first look at the two first bytes of the file and interpret them according to this chart.
Byte 0 | Byte 1 | Interpretation |
0 | 0 | 32bit wide string. |
0 | >0 | 16bit Unicode string. |
>0 | 0 | 16bit Unicode string in reverse byte order. |
0xfe | 0xff | 16bit Unicode string. |
0xff | 0xfe | 16bit Unicode string in reverse byte order. |
0x7b | 0x83 | EBCDIC-US ("#c"). |
0x7b | 0x40 | EBCDIC-US ("# "). |
0x7b | 0x09 | EBCDIC-US ("#\t"). |
The preprocessor collapses all consecutive white space characters outside of strings, except for newlines, to single space characters. All // and /**/ comments are removed, as are #! lines. Pike considers ANSI/DEC escape sequences as white space. Supported formats are <ESC>[\040-\077]+[\100-\177] and <CSI>[\040-\077]*[\100-\177]. Note that this means that it is possible to do color markup in the actual source file.
The preprocessor will treat seven consecutive < characters outside of a string as an version control conflict error and will return "Merge conflict detected."
Defining macros or constants is one of the most used preprocessor features. It enables you to make abstractions on a code generation level as well as altering constants cross-application. The simplest use of the #define directive however is to declare a "define" as present.
#define DO_OVERSAMPLING
The existence of this definition can now be used by e.g. #ifdef and #ifndef to activate or deactivate blocks of program code.
#ifdef DO_OVERSAMPLING // This code is not always run. img->render(size*4)->shrink(4); #endif
Note that defines can be given to pike at execution time. In order to set DO_OVERSAMPLING from a command line, the option -DDO_OVERSAMPLING is added before the name of the pike program. E.g. pike -DDO_OVERSAMPLING my_program.pike.
A define can also be given a specific value, which will be inserted everywhere the define is placed in the source code.
#define CYCLES 20 void do_stuff() { for(int i; i<CYCLES; i++) do_other_stuff(); }
Defines can be given specific values on the command line too, just be sure to quote them as required by your shell.
~% pike '-DTEXT="Hello world!"' -e 'write("%s\n", TEXT);' Hello world!
Finally #define can also be used to define macros. Macros are just text expansion with arguments, but it is often very useful to make a cleaner looking code and to write less.
#define VAR(X) id->misc->variable[X] #define ROL(X,Y) (((X)<<(Y))&7+((X)>>(8-(Y)))) #define PLACEHOLDER(X) void X(mixed ... args) { \ error("Method " #X " is not implemented yet.\n"); } #define ERROR(X,Y ...) werror("MyClass" X "\n", Y) #define NEW_CONSTANTS(X) do{ int i=sizeof(all_constants()); \ X \ werror("Constant diff is %d\n", sizeof(all_constants())-i); \ }while(0) #define MY_FUNC(X,Y) void my##X##Y()
All of the preprocessor directives except the string-related (#string and #"") should have the hash character (#) as the first character of the line. Even though it is currently allowed to be indented, it is likely that this will generate warnings or errors in the future. Note that it is however allowed to put white-space between the hash character and the rest of the directive to create indentation in code.
These functions are used in #if
et al
expressions to test for presence of symbols.