August 2010 – Page 2 – The Rails Drop

Programming in C: Important Points on Data Types

Understanding `int` and `short`

The size of int in C can be either 16-bit or 32-bit, depending on the machine architecture and compiler implementation. Generally:

On older 16-bit systems, int is 16 bits.
On modern 32-bit and 64-bit systems, int is 32 bits.
short is often 16 bits, though it can vary based on the compiler.

Signed and Unsigned Characters

The signed and unsigned qualifiers can be applied to char. However, whether plain char (i.e., without signed or unsigned specified) is signed or unsigned is machine-dependent. This can affect operations involving negative values. For portability, it’s best to explicitly declare signed char or unsigned char when working with character data.

Implicit Type Conversions

When assigning between different data types, implicit conversions occur:

If x is float and i is int, the assignment x = i; converts i to float.
Conversely, i = x; truncates x to an integer, losing the decimal portion.

Understanding these implicit conversions is crucial to avoid unexpected results.

Specifying Types Correctly in Expressions

When writing expressions, you must ensure that constant arithmetic follows correct type rules. Consider the following example:

(float) a = (b - 18) * 7 / 9;

In this case, the constant division 7 / 9 is treated as integer division, which results in 0 (since both operands are integers). To ensure correct floating-point computation, use:

(float) a = (b - 18) * 7.0 / 9.0;

or explicitly cast part of the expression:

(float) a = (float)(b - 18) * 7 / 9;

This prevents unintended truncation and ensures proper floating-point arithmetic.

Integer Ranges Depend on Machine Architecture

The range of int and float types is machine-dependent. For example, on a system where int is 16 bits:

The total number of values is 2^16 = 65536.
Signed integers range from -32,768 to 32,767 (65536 / 2).

On a 32-bit system, the signed integer range extends to -2,147,483,648 to 2,147,483,647 (2^31).

Conditional Expressions and Type Conversion

Consider the following expression:

(n > 0) ? f : n;

where f is float and n is int. According to C’s type promotion rules, the entire expression evaluates to float regardless of the condition because float has a higher rank than int. This implicit promotion ensures consistency in expression evaluation.

Additional Considerations

Use sizeof() to determine data type sizes on different systems.
Be mindful of type conversions in mixed arithmetic operations.
Explicit casting is preferred when converting between types to avoid surprises.
Understand integer overflows, especially when working with large values.

By following these principles, you can write more predictable and portable C code!

Programming in C: Important Points to Remember About Variables

When working with variables in C, it’s crucial to follow best practices and be aware of certain language-specific behaviors. Here are some key points to keep in mind:

1. Avoid Variable Names That Start with an Underscore (`_`)

Variable names beginning with an underscore are often reserved for system and library routines. Using them can lead to unexpected conflicts.
Example (should be avoided): int _count = 10; // Might conflict with system-level identifiers
Instead, use meaningful names without underscores at the beginning: int count = 10;

2. Case Sensitivity in Variable Names

C distinguishes between uppercase and lowercase letters in variable names.
Example: int value = 10; int Value = 20; // Different from 'value' printf("%d %d", value, Value); // Output: 10 20

3. Significance of Name Length

At least the first 31 characters of an internal identifier (such as a variable or function name) are significant. This means that names longer than 31 characters might be truncated depending on the compiler.
Example: int thisIsAVeryLongVariableNameButOnlyFirst31CharactersMatter = 100;

4. External Variable Names and Linkers

External names (used in global scope) may be subject to restrictions imposed by the assembler or linker, rather than the C language itself.
Example: extern int globalCounter;

5. Character Set and Signedness

The C standard guarantees that characters in the machine’s standard printing character set will never have a negative value when stored in a char variable. However, whether char is signed or unsigned by default depends on the compiler and architecture.
Example: char c = 'A'; printf("%d", c); // Will always be non-negative for printable characters

Additional Tips:

Use meaningful and descriptive variable names to improve code readability.
Follow naming conventions, such as using snake_case or camelCase depending on coding standards.
Initialize variables before use to prevent undefined behavior.
Prefer const or enum over #define for defining constants.

By keeping these points in mind, you can write more robust and maintainable C programs.

Programming in C: A Small Note on Functions

Functions play a fundamental role in C programming, providing modularity, reusability, and better code organization. This article explores some key aspects of C functions, including their behavior, return values, and compilation.

Basic Function Concepts

Standard library functions like printf(), getchar(), and putchar() are commonly used in C. A C function cannot be split across multiple files; each function must be fully defined in one file.

The `main` Function and Return Values

The main function is the entry point of any C program. It returns an integer value, typically:

0 for normal termination
A nonzero value for erroneous termination

Example:

#include <stdio.h>

int main() {
    printf("Hello, World!\n");
    return 0; // Indicates successful execution
}

Function Prototypes and Argument Handling

A function prototype must match its definition and usage. If the number of actual arguments exceeds the number of formal parameters, the extra arguments are ignored. Conversely, if fewer arguments are passed, the missing parameters may contain garbage values.

Example:

#include <stdio.h>

void greet(char *name) {
    printf("Hello, %s!\n", name);
}

int main() {
    greet("Alice", "ExtraArg"); // Compiler warning: too many arguments
    return 0;
}

Variable Scope and Persistence

Automatic variables (local variables) do not retain their values across function calls unless declared as static.
Static variables retain their values between function calls.

Example:

#include <stdio.h>

void counter() {
    static int count = 0; // Retains value across calls
    count++;
    printf("Count: %d\n", count);
}

int main() {
    counter();
    counter();
    counter();
    return 0;
}

Output:

Count: 1
Count: 2
Count: 3

Return Statements and Garbage Values

A function must return a value if it is declared with a non-void return type. Failing to return a value results in undefined behavior.

Example:

int faultyFunction() {
    // No return statement (causes garbage value to be returned)
}

int main() {
    int value = faultyFunction();
    printf("Returned value: %d\n", value); // Unpredictable result
    return 0;
}

Compilation Across Multiple Files

C allows functions to be spread across multiple source files. Compilation can be done using the gcc command:

$ gcc main.c fun1.c fun2.c -o my_program

This links all object files together to produce the final executable.

Undefined Order of Function Execution

In an expression like:

x = function1() + function2();

The order of execution of function1() and function2() is unspecified. The C standard does not dictate which function gets evaluated first, leading to potential unpredictability in results.

Example:

#include <stdio.h>

int function1() {
    printf("Executing function1\n");
    return 5;
}

int function2() {
    printf("Executing function2\n");
    return 10;
}

int main() {
    int x = function1() + function2();
    printf("x = %d\n", x);
    return 0;
}

Output order may vary, so avoid relying on execution sequence in such cases.

Conclusion

Understanding C functions, their behavior, and proper usage is crucial for writing robust and portable code. Following best practices such as defining proper prototypes, handling return values correctly, and being aware of evaluation order can help avoid unexpected bugs in C programs.

Programming in a UNIX Environment: Essential Commands

Understanding Basic UNIX Commands

UNIX provides a powerful command-line environment where users interact with the system via a shell. Here, we explore some fundamental commands and concepts that every UNIX user should know.

User Identification Commands

`who am i`

This command displays the current user’s name along with the terminal (TTY), date, and time of login:

$ who am i
john_doe    tty1    Mar 19 10:15

`whoami`

This command prints only the username of the currently logged-in user:

$ whoami
john_doe

Directory Navigation

In UNIX systems:

The parent directory is denoted by ..
The current directory is denoted by .

Removing a Directory

rmdir <directory_name> removes an empty directory. To remove non-empty directories, use rm -r <directory_name>.

About the Shell

The shell is an ordinary program that interprets commands and executes them. It processes wildcards (like * and ?) before passing arguments to commands.

Using Wildcards in Commands

Wildcards allow flexible pattern matching in filenames.

`cat` Command and Wildcards

cat c* prints the contents of all files starting with ‘c’.
The * wildcard represents any string of characters.

`echo` Command

The echo command prints arguments to the terminal:

$ echo Hello, UNIX!
Hello, UNIX!

Wildcards with echo:

$ echo ch*
changelog checklists chapter1.txt

$ echo *
file1.txt file2.txt script.sh test.c

`rm` (Remove Files)

rm * deletes all files in the current directory (use with caution!).
rm *.txt deletes all .txt files.
rm te[a-z]* removes all files starting with te followed by a lowercase letter.

Advanced Wildcard Usage

The [ ] brackets specify a range of characters:

$ pr r[1234]*

Prints all files starting with ‘r’ followed by 1, 2, 3, or 4.

The ? wildcard matches exactly one character:

$ ls ?.txt

Lists files that have a single-character name followed by .txt.

Input and Output Redirection

UNIX allows redirecting command outputs using > and <.

Redirecting Output

ls > filelist saves the list of files into filelist.
cat file1.c file2.c > file3.c merges file1.c and file2.c into file3.c.
cat file4.c >> file3.c appends file4.c to file3.c.

Redirecting Input

pr -3 < filelist prints the contents of filelist in three-column format.
grep main < source.c searches for occurrences of ‘main’ in source.c.

Conclusion

Understanding these essential UNIX commands enhances productivity and efficiency when working in a UNIX-based environment. Wildcards, redirection, and basic command utilities provide a powerful toolkit for managing files, directories, and data. Master these, and you’ll navigate UNIX with ease!

Programming in the UNIX Environment: Essential Commands (ed, cat, ls, pr)

The UNIX environment provides a vast number of commands to help users manage files, edit text, and organize output efficiently. In this post, we will explore four fundamental commands: ed, cat, ls, and pr.

The `ed` Command: Line Editor

The ed command is a simple, line-based text editor that allows you to create and modify files.

Example Usage:

$ ed file1
no such file or directory

If the file does not exist, UNIX will display an error message. You can then create and edit it using the following steps:

Type a to enter append mode.
Enter the text you want to add.
Type . (a single period) on a new line to indicate that input is finished.
Save the file using w file1.
Exit ed using q.

$ ed file1
a
Enter the text
.
w file1
q

The `cat` Command: Viewing File Contents

The cat command is used to display the content of a file or concatenate multiple files.

Example Usage:

$ cat filename

To view multiple files together:

$ cat file1 file2

The `pr` Command: Formatting File Output

The pr command paginates the output of a file, making it easier to read.

Example Usage:

$ pr filename

To display multiple files side by side in parallel columns:

$ pr -m file1 file2

To format output into multiple columns:

$ pr -3 filename

The `ls` Command: Listing Files and Directories

The ls command is used to list files in the current directory.

Common Options:

ls – Lists all files in the directory.
ls DIRNAME – Lists files within a specified directory.
ls * – Lists all files, including those in subdirectories.
ls -t – Lists files sorted by modification time (newest first).
ls -l – Displays detailed information about each file.
ls -lt – Combines -l and -t to list files with details, sorted by most recent first.
ls -u – Shows files sorted by last access time.
ls -ult – Lists files by last accessed time with details.

Example Usage:

$ ls
$ ls -l
$ ls -lt
$ ls -u
$ ls -ult

Additional Notes

The ls command can be combined with other UNIX utilities like grep to filter specific results.
Modern alternatives to ed include vi, nano, and vim for a more interactive text editing experience.
The pr command can be useful when preparing text for printing.

These commands provide a foundation for working in the UNIX environment. Mastering them will help improve efficiency when managing files and navigating the system.

Programming in C: A Small Note on Arrays

Arrays in C are collections of elements of the same data type, stored in contiguous memory locations. They are indexed starting from 0, and the subscript (index) used to access an array element can be an expression that evaluates to an integer.

Accessing Arrays and Bounds

It is important to note that accessing an array outside its declared bounds does not necessarily produce an error, but it leads to undefined behavior. This means that the program may read or write unintended memory locations, potentially causing crashes or unexpected results.

Example of an Array Declaration

If an array is declared as:

int array[10] = {10};

The first element (array[0]) is initialized to 10.
All remaining elements (array[1] to array[9]) are automatically initialized to 0.

Incorrect Declaration Example

A common mistake in character array initialization:

char alpha[3] = {a, b, c};  // Incorrect

Here, a, b, and c are not enclosed in single quotes, so the compiler will not recognize them as character literals.

Correct Declaration Example

To correctly initialize a character array, use single quotes for characters:

char alpha[3] = {'a', 'b', 'c'};

Alternatively, a string (null-terminated character array) can be declared as:

char alpha[] = "abc";  // Automatically allocates space for 'a', 'b', 'c', and '\0'

Array Indexing with Expressions

C allows the use of expressions as array indices. For example:

int numbers[5] = {10, 20, 30, 40, 50};
int index = 2;
printf("%d", numbers[index + 1]);  // Output: 40

This flexibility allows dynamic indexing in programs.

Avoiding Out-of-Bounds Access

To prevent accessing elements outside the valid range, always ensure that indices are within the defined size of the array:

int arr[5] = {1, 2, 3, 4, 5};
int idx = 6;  // Out of bounds

if (idx >= 0 && idx < 5) {
    printf("%d", arr[idx]);
} else {
    printf("Index out of bounds!\n");
}

Summary

Arrays in C have zero-based indexing.
Accessing an index outside the declared range results in undefined behavior.
Partial initialization of an array fills the remaining elements with zeros (for static or global arrays).
Character arrays should use single quotes for characters ('a', 'b') and double quotes for strings ("abc").
Always validate array indices to prevent unintended memory access.

Understanding these fundamentals helps in writing safe and efficient C programs.

Understanding int and short

Signed and Unsigned Characters

Implicit Type Conversions

Specifying Types Correctly in Expressions

Integer Ranges Depend on Machine Architecture

Conditional Expressions and Type Conversion

Additional Considerations

1. Avoid Variable Names That Start with an Underscore (_)

2. Case Sensitivity in Variable Names

3. Significance of Name Length

4. External Variable Names and Linkers

5. Character Set and Signedness

Additional Tips:

Basic Function Concepts

The main Function and Return Values

Function Prototypes and Argument Handling

Variable Scope and Persistence

Return Statements and Garbage Values

Compilation Across Multiple Files

Undefined Order of Function Execution

Conclusion

Understanding Basic UNIX Commands

User Identification Commands

who am i

whoami

Directory Navigation

Removing a Directory

About the Shell

Using Wildcards in Commands

cat Command and Wildcards

echo Command

rm (Remove Files)

Advanced Wildcard Usage

Input and Output Redirection

Redirecting Output

Redirecting Input

Conclusion

The ed Command: Line Editor

Example Usage:

The cat Command: Viewing File Contents

Example Usage:

The pr Command: Formatting File Output

Example Usage:

The ls Command: Listing Files and Directories

Common Options:

Example Usage:

Additional Notes

Accessing Arrays and Bounds

Example of an Array Declaration

Incorrect Declaration Example

Correct Declaration Example

Array Indexing with Expressions

Avoiding Out-of-Bounds Access

Summary

Understanding `int` and `short`

1. Avoid Variable Names That Start with an Underscore (`_`)

The `main` Function and Return Values

`who am i`

`whoami`

`cat` Command and Wildcards

`echo` Command

`rm` (Remove Files)

The `ed` Command: Line Editor

The `cat` Command: Viewing File Contents

The `pr` Command: Formatting File Output

The `ls` Command: Listing Files and Directories