blob: 1a04e447495f8a6ec2fe94551b0614bdaa3cd65e [file] [log] [blame]
Mark Whitley40bfc762000-07-24 22:36:06 +00001Busybox Style Guide
2===================
3
4This document describes the coding style conventions used in Busybox. If you
5add a new file to Busybox or are editing an existing file, please format your
6code according to this style. If you are the maintainer of a file that does
7not follow these guidelines, please -- at your own convenience -- modify the
8file(s) you maintain to bring them into conformance with this style guide.
9Please note that this is a low priority task.
10
11To help you format the whitespace of your programs, an ".indent.pro" file is
12included in the main Busybox source directory that contains option flags to
13format code as per this style guide. This way you can run GNU indent on your
14files by typing 'indent myfile.c myfile.h' and it will magically apply all the
15right formatting rules to your file. Please _do_not_ run this on all the files
16in the directory, just your own.
17
Mark Whitley2368a382000-08-22 00:20:21 +000018
Mark Whitleyd58ff872000-11-22 19:25:39 +000019
Mark Whitley40bfc762000-07-24 22:36:06 +000020Declaration Order
21-----------------
22
23Here is the order in which code should be laid out in a file:
24
Mark Whitley9028e2c2000-11-17 21:28:39 +000025 - commented program name and one-line description
Mark Whitley40bfc762000-07-24 22:36:06 +000026 - commented author name and email address(es)
27 - commented GPL boilerplate
Mark Whitley9028e2c2000-11-17 21:28:39 +000028 - commented longer description / notes for the program (if needed)
Mark Whitley40bfc762000-07-24 22:36:06 +000029 - #includes and #defines
Mark Whitley9028e2c2000-11-17 21:28:39 +000030 - const and global variables
Mark Whitley40bfc762000-07-24 22:36:06 +000031 - function declarations (if necessary)
32 - function implementations
33
Mark Whitley2368a382000-08-22 00:20:21 +000034
Mark Whitleyd58ff872000-11-22 19:25:39 +000035
36Whitespace and Formatting
37-------------------------
Mark Whitley40bfc762000-07-24 22:36:06 +000038
Mark Whitley2368a382000-08-22 00:20:21 +000039This is everybody's favorite flame topic so let's get it out of the way right
40up front.
41
42
Mark Whitley9028e2c2000-11-17 21:28:39 +000043Tabs vs. Spaces in Line Indentation
Mark Whitleyd58ff872000-11-22 19:25:39 +000044~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Mark Whitley2368a382000-08-22 00:20:21 +000045
46The preference in Busybox is to indent lines with tabs. Do not indent lines
47with spaces and do not indents lines using a mixture of tabs and spaces. (The
48indentation style in the Apache and Postfix source does this sort of thing:
49\s\s\s\sif (expr) {\n\tstmt; --ick.) The only exception to this rule is
50multi-line comments that use an asterisk at the beginning of each line, i.e.:
Mark Whitley40bfc762000-07-24 22:36:06 +000051
52 /t/*
53 /t * This is a block comment.
54 /t * Note that it has multiple lines
55 /t * and that the beginning of each line has a tab plus a space
56 /t * except for the opening '/*' line where the slash
57 /t * is used instead of a space.
58 /t */
59
60Furthermore, The preference is that tabs be set to display at four spaces
61wide, but the beauty of using only tabs (and not spaces) at the beginning of
Mark Whitley9028e2c2000-11-17 21:28:39 +000062lines is that you can set your editor to display tabs at *whatever* number of
Mark Whitley40bfc762000-07-24 22:36:06 +000063spaces is desired and the code will still look fine.
64
65
Mark Whitley2368a382000-08-22 00:20:21 +000066Operator Spacing
67~~~~~~~~~~~~~~~~
68
69Put spaces between terms and operators. Example:
Mark Whitley40bfc762000-07-24 22:36:06 +000070
71 Don't do this:
72
73 for(i=0;i<num_items;i++){
74
75 Do this instead:
76
77 for (i = 0; i < num_items; i++) {
78
79 While it extends the line a bit longer, the spaced version is more
80 readable. An allowable exception to this rule is the situation where
81 excluding the spacing makes it more obvious that we are dealing with a
Mark Whitley9028e2c2000-11-17 21:28:39 +000082 single term (even if it is a compound term) such as:
Mark Whitley40bfc762000-07-24 22:36:06 +000083
84 if (str[idx] == '/' && str[idx-1] != '\\')
85
86 or
87
88 if ((argc-1) - (optind+1) > 0)
89
90
Mark Whitley2368a382000-08-22 00:20:21 +000091Bracket Spacing
92~~~~~~~~~~~~~~~
93
94If an opening bracket starts a function, it should be on the
Mark Whitley9028e2c2000-11-17 21:28:39 +000095next line with no spacing before it. However, if a bracket follows an opening
Mark Whitley40bfc762000-07-24 22:36:06 +000096control block, it should be on the same line with a single space (not a tab)
Mark Whitley9028e2c2000-11-17 21:28:39 +000097between it and the opening control block statement. Examples:
Mark Whitley40bfc762000-07-24 22:36:06 +000098
99 Don't do this:
100
Mark Whitley9028e2c2000-11-17 21:28:39 +0000101 while (!done)
102 {
103
104 do
105 {
106
107 Don't do this either:
108
Mark Whitley40bfc762000-07-24 22:36:06 +0000109 while (!done){
110 do{
111
Mark Whitley3680c582000-12-20 22:35:12 +0000112 And for heaven's sake, don't do this:
113
114 while (!done)
115 {
116 do
117 {
118
Mark Whitley40bfc762000-07-24 22:36:06 +0000119 Do this instead:
120
121 while (!done) {
122 do {
123
Mark Whitley2368a382000-08-22 00:20:21 +0000124
125Paren Spacing
126~~~~~~~~~~~~~
127
128Put a space between C keywords and left parens, but not between
129function names and the left paren that starts it's parameter list (whether it
130is being declared or called). Examples:
131
132 Don't do this:
133
134 while(foo) {
135 for(i = 0; i < n; i++) {
136
137 Do this instead:
138
139 while (foo) {
140 for (i = 0; i < n; i++) {
141
Mark Whitley9028e2c2000-11-17 21:28:39 +0000142 But do functions like this:
Mark Whitley2368a382000-08-22 00:20:21 +0000143
144 static int my_func(int foo, char bar)
145 ...
146 baz = my_func(1, 2);
147
148
149Cuddled Elses
150~~~~~~~~~~~~~
151
Mark Whitley9028e2c2000-11-17 21:28:39 +0000152Also, please "cuddle" your else statements by putting the else keyword on the
153same line after the right bracket that closes an 'if' statement.
Mark Whitley40bfc762000-07-24 22:36:06 +0000154
155 Don't do this:
156
157 if (foo) {
158 stmt;
159 }
160 else {
161 stmt;
162 }
163
164 Do this instead:
165
166 if (foo) {
167 stmt;
168 } else {
169 stmt;
170 }
171
Mark Whitley9028e2c2000-11-17 21:28:39 +0000172The exception to this rule is if you want to include a comment before the else
173block. Example:
174
175 if (foo) {
176 stmts...
177 }
178 /* otherwise, we're just kidding ourselves, so re-frob the input */
179 else {
180 other_stmts...
181 }
182
Mark Whitley40bfc762000-07-24 22:36:06 +0000183
Mark Whitleyd58ff872000-11-22 19:25:39 +0000184
Mark Whitley3680c582000-12-20 22:35:12 +0000185
Mark Whitley40bfc762000-07-24 22:36:06 +0000186Variable and Function Names
187---------------------------
188
189Use the K&R style with names in all lower-case and underscores occasionally
Mark Whitley9028e2c2000-11-17 21:28:39 +0000190used to separate words (e.g., "variable_name" and "numchars" are both
Mark Whitley40bfc762000-07-24 22:36:06 +0000191acceptable). Using underscores makes variable and function names more readable
192because it looks like whitespace; using lower-case is easy on the eyes.
193
Mark Whitley3680c582000-12-20 22:35:12 +0000194 Frowned upon:
195
196 hitList
197 TotalChars
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000198 szFileName
199 pf_Nfol_TriState
Mark Whitley3680c582000-12-20 22:35:12 +0000200
201 Preferred:
202
203 hit_list
204 total_chars
205 file_name
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000206 sensible_name
Mark Whitley3680c582000-12-20 22:35:12 +0000207
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000208Exceptions:
209
210 - Enums, macros, and constant variables should all be in upper-case with
211 words optionally seperatedy by underscores (i.e. FIFOTYPE, ISBLKDEV()).
212
213 - Nobody is going to get mad at you for using 'pvar' as the name of a
214 variable that is a pointer to 'var'.
Mark Whitley3680c582000-12-20 22:35:12 +0000215
Mark Whitley40bfc762000-07-24 22:36:06 +0000216Note: The Busybox codebase is very much a mixture of code gathered from a
Mark Whitley9028e2c2000-11-17 21:28:39 +0000217variety of sources. This explains why the current codebase contains such a
218hodge-podge of different naming styles (Java, Pascal, K&R, just-plain-weird,
Mark Whitley40bfc762000-07-24 22:36:06 +0000219etc.). The K&R guideline explained above should therefore be used on new files
220that are added to the repository. Furthermore, the maintainer of an existing
Mark Whitley3680c582000-12-20 22:35:12 +0000221file that uses alternate naming conventions should -- at his own convenience
222-- convert those names over to K&R style; converting variable names is a very
223low priority task. Perhaps in the future we will include some magical Perl
224script that can go through and convert variable names, left as an exercise for
225the reader for now.
Mark Whitley40bfc762000-07-24 22:36:06 +0000226
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000227For the time being, if you want to do a search-and-replace of a variable name
228in different files, do the following in the busybox directory:
229
230 $ perl -pi -e 's/\bOldVar\b/new_var/g' *.[ch]
231
Mark Whitley40bfc762000-07-24 22:36:06 +0000232
Mark Whitley40bfc762000-07-24 22:36:06 +0000233
Mark Whitleyd58ff872000-11-22 19:25:39 +0000234Avoid The Preprocessor
235----------------------
Mark Whitley40bfc762000-07-24 22:36:06 +0000236
Mark Whitleyd58ff872000-11-22 19:25:39 +0000237At best, the preprocessor is a necessary evil, helping us account for platform
238and architecture differences. Using the preprocessor unnecessarily is just
239plain evil.
Mark Whitley2368a382000-08-22 00:20:21 +0000240
Mark Whitley40bfc762000-07-24 22:36:06 +0000241
Mark Whitleyd58ff872000-11-22 19:25:39 +0000242The Folly of #define
243~~~~~~~~~~~~~~~~~~~~
Mark Whitley40bfc762000-07-24 22:36:06 +0000244
Mark Whitleyd58ff872000-11-22 19:25:39 +0000245Use 'const <type> var' for declaring constants.
Mark Whitley40bfc762000-07-24 22:36:06 +0000246
247 Don't do this:
248
Mark Whitleyd58ff872000-11-22 19:25:39 +0000249 #define var 80
250
251 Do this instead, when the variable is in a header file and will be used in
252 several source files:
253
254 const int var = 80;
255
256 Or do this when the variable is used only in a single source file:
257
258 static const int var = 80;
259
260Declaring variables as '[static] const' gives variables an actual type and
261makes the compiler do type checking for you; the preprocessor does _no_ type
262checking whatsoever, making it much more error prone. Declaring variables with
263'[static] const' also makes debugging programs much easier since the value of
264the variable can be easily queried and displayed.
265
266
267The Folly of Macros
268~~~~~~~~~~~~~~~~~~~
269
270Use 'static inline' instead of a macro.
271
272 Don't do this:
273
274 #define mini_func(param1, param2) (param1 << param2)
Mark Whitley40bfc762000-07-24 22:36:06 +0000275
276 Do this instead:
277
Mark Whitleyd58ff872000-11-22 19:25:39 +0000278 static inline int mini_func(int param1, param2)
Mark Whitley40bfc762000-07-24 22:36:06 +0000279 {
Mark Whitleyd58ff872000-11-22 19:25:39 +0000280 return (param1 << param2);
281 }
Mark Whitley40bfc762000-07-24 22:36:06 +0000282
Mark Whitleyd58ff872000-11-22 19:25:39 +0000283Static inline functions are greatly preferred over macros. They provide type
284safety, have no length limitations, no formatting limitations, and under gcc
285they are as cheap as macros. Besides, really long macros with backslashes at
286the end of each line are ugly as sin.
287
288
289The Folly of #ifdef
290~~~~~~~~~~~~~~~~~~~
291
292Code cluttered with ifdefs is difficult to read and maintain. Don't do it.
293Instead, put your ifdefs in a header, and conditionally define 'static inline'
294functions, (or *maybe* macros), which are used in the code.
295
296 Don't do this:
297
298 ret = my_func(bar, baz);
299 if (!ret)
300 return -1;
301 #ifdef BB_FEATURE_FUNKY
302 maybe_do_funky_stuff(bar, baz);
303 #endif
304
305 Do this instead:
306
307 (in .h header file)
308
309 #ifndef BB_FEATURE_FUNKY
310 static inline void maybe_do_funky_stuff (int bar, int baz) {}
311 #endif
312
313 (in the .c source file)
314
315 ret = my_func(bar, baz);
316 if (!ret)
317 return -1;
318 maybe_do_funky_stuff(bar, baz);
319
320The great thing about this approach is that the compiler will optimize away
321the "no-op" case when the feature is turned off.
322
323Note also the use of the word 'maybe' in the function name to indicate
324conditional execution.
325
326
327
328Notes on Strings
329----------------
330
331Strings in C can get a little thorny. Here's some guidelines for dealing with
332strings in Busybox. (There is surely more that could be added to this
333section.)
334
335
336String Files
337~~~~~~~~~~~~
338
339Put all help/usage messages in usage.c. Put other strings in messages.c.
340Putting these strings into their own file is a calculated decision designed to
341confine spelling errors to a single place and aid internationalization
342efforts, if needed. (Side Note: we might want to use a single file - maybe
343called 'strings.c' - instead of two, food for thought).
344
345
346Testing String Equivalence
347~~~~~~~~~~~~~~~~~~~~~~~~~~
348
349There's a right way and a wrong way to test for sting equivalence with
350strcmp():
351
352 The wrong way:
353
354 if (!strcmp(string, "foo")) {
355 ...
356
357 The right way:
358
359 if (strcmp(string, "foo") == 0){
360 ...
361
362The use of the "equals" (==) operator in the latter example makes it much more
363obvious that you are testing for equivalence. The former example with the
364"not" (!) operator makes it look like you are testing for an error. In a more
365perfect world, we would have a streq() function in the string library, but
366that ain't the world we're living in.
367
368
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000369Avoid Dangerous String Functions
370~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
371
372Unfortunately, the way C handles strings makes them prone to overruns when
373certain library functions are (mis)used. The following table offers a summary
374of some of the more notorious troublemakers:
375
376function overflows preferred
377----------------------------------------
378strcpy dest string strncpy
379strcat dest string strncat
380gets string it gets fgets
381getwd buf string getcwd
382[v]sprintf str buffer [v]snprintf
383realpath path buffer use with pathconf
384[vf]scanf its arguments just avoid it
385
386
387The above is by no means a complete list. Be careful out there.
388
389
390
391Avoid Big Static Buffers
392------------------------
393
394First, some background to put this discussion in context: Static buffers look
395like this in code:
396
397 /* in a .c file outside any functions */
398 static char *buffer[BUFSIZ]; /* happily used by any function in this file,
399 but ick! big! */
400
401The problem with these is that any time any busybox app is run, you pay a
402memory penalty for this buffer, even if the applet that uses said buffer is
403not run. This can be fixed, thusly:
404
Eric Andersend35c2152001-01-25 23:49:09 +0000405 static char *buffer;
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000406 ...
407 other_func()
408 {
409 strcpy(buffer, lotsa_chars); /* happily uses global *buffer */
410 ...
411 foo_main()
412 {
413 buffer = xmalloc(sizeof(char)*BUFSIZ);
414 ...
415
416However, this approach trades bss segment for text segment. Rather than
417mallocing the buffers (and thus growing the text size), buffers can be
418declared on the stack in the *_main() function and made available globally by
419assigning them to a global pointer thusly:
420
Eric Andersend35c2152001-01-25 23:49:09 +0000421 static char *pbuffer;
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000422 ...
423 other_func()
424 {
425 strcpy(pbuffer, lotsa_chars); /* happily uses global *pbuffer */
426 ...
427 foo_main()
428 {
429 char *buffer[BUFSIZ]; /* declared locally, on stack */
430 pbuffer = buffer; /* but available globally */
431 ...
432
Eric Andersend35c2152001-01-25 23:49:09 +0000433This last approach has some advantages (low code size, space not used until
434it's needed), but can be a problem in some low resource machines that have
435very limited stack space (e.g., uCLinux). busybox.h declares a macro that
436implements compile-time selection between xmalloc() and stack creation, so
437you can code the line in question as
438 RESERVE_BB_BUFFER(buffer, BUFSIZ);
439and the right thing will happen, based on the customer's configuration.
Mark Whitleya5b55ca2001-01-24 00:18:13 +0000440
441
Mark Whitleyd58ff872000-11-22 19:25:39 +0000442
443Miscellaneous Coding Guidelines
444-------------------------------
445
446The following are important items that don't fit into any of the above
447sections.
448
449
450Model Busybox Applets After GNU Counterparts
451~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
452
453When in doubt about the proper behavior of a Busybox program (output,
454formatting, options, etc.), model it after the equivalent GNU program.
455Doesn't matter how that program behaves on some other flavor of *NIX; doesn't
456matter what the POSIX standard says or doesn't say, just model Busybox
457programs after their GNU counterparts and nobody has to get hurt.
458
459The only time we deviate from emulating the GNU behavior is when:
460
461 - We are deliberately not supporting a feature (such as a command line
462 switch)
463 - Emulating the GNU behavior is prohibitively expensive (lots more code
464 would be required, lots more memory would be used, etc.)
Eric Andersen07309432000-11-29 22:12:19 +0000465 - The difference is minor or cosmetic
Mark Whitleyd58ff872000-11-22 19:25:39 +0000466
467A note on the 'cosmetic' case: Output differences might be considered
468cosmetic, but if the output is significant enough to break other scripts that
469use the output, it should really be fixed.
470
471
472Scope
473~~~~~
474
475If a const variable is used only in a single source file, put it in the source
476file and not in a header file. Likewise, if a const variable is used in only
477one function, do not make it global to the file. Instead, declare it inside
Eric Andersen07309432000-11-29 22:12:19 +0000478the function body. Bottom line: Make a conscious effort to limit declarations
Mark Whitleyd58ff872000-11-22 19:25:39 +0000479to the smallest scope possible.
480
481Inside applet files, all functions should be declared static so as to keep the
482global name space clean. The only exception to this rule is the "applet_main"
483function which must be declared extern.
484
485If you write a function that performs a task that could be useful outside the
486immediate file, turn it into a general-purpose function with no ties to any
487applet and put it in the utility.c file instead.
488
489
490Brackets Are Your Friends
491~~~~~~~~~~~~~~~~~~~~~~~~~
492
493Please use brackets on all if and else statements, even if it is only one
494line. Example:
Mark Whitley40bfc762000-07-24 22:36:06 +0000495
496 Don't do this:
497
498 if (foo)
Mark Whitley3680c582000-12-20 22:35:12 +0000499 stmt1;
500 stmt2
501 stmt3;
Mark Whitley40bfc762000-07-24 22:36:06 +0000502
503 Do this instead:
504
505 if (foo) {
Mark Whitley3680c582000-12-20 22:35:12 +0000506 stmt1;
Mark Whitley40bfc762000-07-24 22:36:06 +0000507 }
Mark Whitley3680c582000-12-20 22:35:12 +0000508 stmt2
509 stmt3;
Mark Whitley40bfc762000-07-24 22:36:06 +0000510
Mark Whitleyd58ff872000-11-22 19:25:39 +0000511The "bracketless" approach is error prone because someday you might add a line
512like this:
Mark Whitley40bfc762000-07-24 22:36:06 +0000513
514 if (foo)
Mark Whitley3680c582000-12-20 22:35:12 +0000515 stmt1;
Mark Whitley40bfc762000-07-24 22:36:06 +0000516 new_line();
Mark Whitley3680c582000-12-20 22:35:12 +0000517 stmt2
518 stmt3;
Mark Whitley40bfc762000-07-24 22:36:06 +0000519
Mark Whitleyd58ff872000-11-22 19:25:39 +0000520And the resulting behavior of your program would totally bewilder you. (Don't
521laugh, it happens to us all.) Remember folks, this is C, not Python.
522
523
524Function Declarations
525~~~~~~~~~~~~~~~~~~~~~
526
527Do not use old-style function declarations that declare variable types between
528the parameter list and opening bracket. Example:
529
530 Don't do this:
531
532 int foo(parm1, parm2)
533 char parm1;
534 float parm2;
535 {
536 ....
537
538 Do this instead:
539
540 int foo(char parm1, float parm2)
541 {
542 ....
543
544The only time you would ever need to use the old declaration syntax is to
Eric Andersen07309432000-11-29 22:12:19 +0000545support ancient, antediluvian compilers. To our good fortune, we have access
Mark Whitleyd58ff872000-11-22 19:25:39 +0000546to more modern compilers and the old declaration syntax is neither necessary
547nor desired.
548
Mark Whitley3680c582000-12-20 22:35:12 +0000549
550Emphasizing Logical Blocks
551~~~~~~~~~~~~~~~~~~~~~~~~~~
552
553Organization and readability are improved by putting extra newlines around
554blocks of code that perform a single task. These are typically blocks that
555begin with a C keyword, but not always.
556
557Furthermore, you should put a single comment (not necessarily one line, just
558one comment) before the block, rather than commenting each and every line.
559There is an optimal ammount of commenting that a program can have; you can
560comment too much as well as too little.
561
562A picture is really worth a thousand words here, so here is an example that
563illustrates emphasizing logical blocks:
564
565 while (line = get_line_from_file(fp)) {
566
567 /* eat the newline, if any */
568 if (line[strlen(line)-1] == '\n') {
569 line[strlen(line)-1] = '\0';
570 }
571
572 /* ignore blank lines */
573 if (strlen(file_to_act_on) == 0) {
574 continue;
575 }
576
577 /* if the search string is in this line, print it,
578 * unless we were told to be quiet */
579 if (strstr(line, search) && !be_quiet) {
580 puts(line);
581 }
582
583 /* clean up */
584 free(line);
585 }