mesg, wall, w, kill, killall

on October 29, 2009 at 2:56 pm

msg

The Original Instant Messenger System
With the advent of instant messaging or chat programs we seem to have conquered the ubiquitous challenge of maintaining a real time exchange of information while not getting distracted by voice communication. But are these only in the domain of the fancy programs?

The instant messaging or chat concept has been available on *nix for quite a while. In fact, you have a full fledged secure IM system built right into Linux. It allows you to securely talk to anyone connected to the system; no internet connection is required. The chat is enabled through the commands â€“ write, mesg, wall and talk. Letâ€™s examine each of them.

The write command can write to a userâ€™s terminal. If the user has logged in more than one terminal, you can address a specific terminal. Here is how you write a message â€œBeware of the virusâ€ to the user â€œoracleâ€ logged in on terminal â€œpts/3â€:

# write oracle pts/3
Beware of the virus
ttyl

#
The Control-D key combination ends the message, returns the shell prompt (#) to the user and sends to the userâ€™s terminal. When the above is sent, the user â€œoracleâ€ will see on terminal pts/3 the messages:

Beware of the virus
ttyl
Each line will come up as the sender presses ENTER after the lines. When the sender presses Control-D, marking the end of transmission, the receiver sees EOF on the screen. The message will be displayed regardless of the current action of the user. If the user is editing a file in vi, the message comes and the user can clear it by pressing Control-L. If the user is on SQL*Plus prompt, the message still comes but does not affect the keystrokes of the user.

What if you donâ€™t want that slight inconvenience? You donâ€™t want anyone to send a message to you â€“ akin to â€œleave the phone off the hookâ€. You can do that via the mesg command. This command disables others ability to send you a message. The command without any arguments shows the ability:

# mesg
is y
It shows that others can write to you. To turn it off:

# mesg n
Now to confirm:

# mesg
is n
When you attempt to write to the usersâ€™ terminals, you may want to know which terminals have disabled this writing from others. The who -T command (described earlier in this installment) shows you that:

# who -TH
NAMEÂ Â Â Â Â Â LINEÂ Â Â Â Â Â Â Â Â TIMEÂ Â Â Â Â Â Â Â COMMENT
oracleÂ Â + pts/2Â Â Â Â Â Â Â Jan 11 12:08 (10.23.32.10)
oracleÂ Â + pts/3Â Â Â Â Â Â Â Jan 11 12:08 (10.23.32.10)
oracleÂ Â – pts/4Â Â Â Â Â Â Â Jan 11 12:08 (10.23.32.10)
rootÂ Â Â Â + pts/1Â Â Â Â Â Â Â Dec 26 13:42 (:0.0)
rootÂ Â Â Â ? :0Â Â Â Â Â Â Â Â Â Â Oct 23 15:32
The + sign before the terminal name indicates that it accepts write commands from others; the â€œ-â€œ sign indicates that it doesnâ€™t. The â€œ?â€ indicates that the terminal does not support writing to it, e.g. an X-window session.

What if you want to write to all the logged in users? Instead of typing to each user, use the wall command:

# wall
hello everyone
When sent, the following shows up on the terminals of all logged in users:

Broadcast message from oracleÂ (pts/2) (Thu JanÂ 8 16:37:25 2009):
hello everyone
This is very useful for root user. When you want to shutdown the system, unmount a filesystem or perform similar administrative functions you may want all users to log off. Use this command to send a message to all.

Finally, the program talk allows you to chat in real time.Â Just type the following:

# talk oracle pts/2
If you want to talk to a user on a different server â€“ prolin2 â€“ you can use

# talk oracle@prolin2 pts/2
It brings up a chat window on the other terminal and now you can chat in real time. Is it that different from a â€œprofessionalâ€ chat program you are using now? Probably not. Oh, by the way, to make the talk work, you should make sure the talkd daemon is running, which may not have been installed.

w
Yes, itâ€™s a command, even if itâ€™s just one letter long! The command w is a combination of uptime and who commands given one immediately after the other, in that order. Letâ€™s see a very common output without any arguments and options.

# w
Â 17:29:22 up 672 days, 18:31,Â Â 2 users,Â load average: 4.52,Â 4.54, 4.59
USERÂ Â Â Â TTYÂ Â Â Â Â FROMÂ Â Â Â Â Â Â Â Â Â Â Â Â LOGIN@Â Â IDLEÂ Â JCPUÂ Â PCPU WHAT
oracleÂ Â pts/1Â Â Â Â 10.14.105.139Â Â Â 16:43Â Â Â 0.00sÂ Â 0.06sÂ 0.01s w
oracleÂ Â pts/2Â Â Â Â 10.14.105.139Â Â Â 17:26Â Â 57.00sÂ Â 3.17sÂ 3.17s sqlplusÂ Â as sysdba
â€¦ and soÂ on â€¦
The output has two distinct parts. The first part shows the output of the uptime command (described above in this installment) which shows how long the server has been up, how many users have logged in and the load average for last 1, 5 and 15 minutes. The parts of the output have been explained under the uptime command. The second part of the output shows the output of the who command with the option -H (also explained in this installment). Again, these various columns have been explained under the who command.

If you rather not display the header, use the -h option.

#Â w -h
oracleÂ Â pts/1Â Â Â Â 10.14.105.139Â Â Â 16:43Â Â Â 0.00sÂ Â 0.02sÂ 0.01s w -h
This removes the header from the output. Itâ€™s useful in shell scripts where you want to read and act on the output without the additional burden of skipping the header.

The -s option produces a compact (short) version of the output, removing the login time, JPCU and PCPU times.

# w -s
Â 17:30:07 up 672 days, 18:32,Â Â 2 users,Â load average: 5.03,Â 4.65, 4.63
USERÂ Â Â Â TTYÂ Â Â Â Â FROMÂ Â Â Â Â Â Â Â Â Â Â Â Â Â IDLE WHAT
oracleÂ Â pts/1Â Â Â Â 10.14.105.139Â Â Â Â 0.00s w -s
oracleÂ Â pts/2Â Â Â Â 10.14.105.139Â Â Â Â 1:42Â sqlplusÂ Â as sysdba
You might find that the â€œFROMâ€ field is really not very useful. It shows the IP address of the same server, since the logins are all local. To save the space on the output, you may want to suppress that. The -f option disables printing of the FROM field:

# w -f
Â 17:30:53 up 672 days, 18:33,Â Â 2 users,Â load average: 4.77,Â 4.65, 4.63
USERÂ Â Â Â TTYÂ Â Â Â Â Â Â LOGIN@Â Â IDLEÂ Â Â JCPUÂ Â PCPU WHAT
oracleÂ Â pts/1Â Â Â Â Â 16:43Â Â Â 0.00sÂ 0.06sÂ Â 0.00s w -f
oracleÂ Â pts/2Â Â Â Â Â 17:26Â Â Â 2:28Â Â 3.17sÂ Â 3.17s sqlplusÂ Â as sysdba
The command accepts only one parameter: the name of a user. By default w shows the process and logins for all users. If you put a username, it shows the logins for that user only. For instance, to show logins for root only, issue:

# w -h root
rootÂ Â Â Â pts/1Â Â Â :0.0Â Â Â Â Â Â Â Â Â Â Â Â 26Dec08 13days 0.01sÂ Â 0.01s bash
rootÂ Â Â Â :0Â Â Â Â Â Â -Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â 23Oct08 ?xdm?Â Â 21:13mÂ 1.81sÂ /usr/bin/gnome-session
The -h option was used to suppress displaying header.

kill
A process is running and you want the process to be terminated. What should you do? The process runs in the background so there is no going to the terminal and pressing Control-C; or, the process belongs to another user (using the same userid, such as â€œoracleâ€) and you want to terminate it. The kill command comes to rescue; it does what its name suggests â€“ it kills the process. The most common use is:

# kill
Suppose you want to kill a process called sqlplus issued by the user oracle, you need to know its processid, or PID:

# ps -aef|grep sqlplus|grep ananda
oracleÂ Â Â 8728 23916Â 0 10:36 pts/3Â Â Â 00:00:00 sqlplus
oracleÂ Â Â 8768 23896Â 0 10:36 pts/2Â Â Â 00:00:00Â grep sqlplus
Now, to kill the PID 8728:

# kill 8728
Thatâ€™s it; the process is killed. Of course, you have to be the same user (oracle) to kill a process kicked off by oracle. To kill processes kicked off by other users you have to be super user â€“ root.

Sometimes you may want to merely halt the process instead of killing it. You can use the option -SIGSTOP with the kill command.

# kill -SIGSTOP 9790
# ps -aef|grep sqlplus|grep oracle
oracleÂ Â Â 9790 23916Â Â 0 10:41 pts/3Â Â Â 00:00:00 sqlplusÂ Â as sysdba
oracleÂ Â Â 9885 23896Â 0 10:41 pts/2Â Â Â 00:00:00Â grep sqlplus
This is good for background jobs but with the foreground processes, it merely stops the process and removes the control from the user. So, if you check for the process again after issuing the command:

# ps -aef|grep sqlplus|grep oracle
oracleÂ Â Â 9790 23916Â 0 10:41 pts/3Â Â Â 00:00:00 sqlplusÂ Â as sysdba
oracleÂ Â 10144 23896Â 0 10:42 pts/2Â Â Â 00:00:00Â grep sqlplus
You see that the process is still running. It has not been terminated. To kill this process, and any stubborn processes that refuse to be terminated, you have to pass a new signal called SIGKILL. The default signal is SIGTERM.

# kill -SIGKILL 9790
# ps -aef|grep sqlplus|grep oracle
oracleÂ Â 10092 23916Â 0 10:42 pts/3Â Â Â 00:00:00 sqlplusÂ Â as sysdba
oracleÂ Â 10198 23896Â 0 10:43 pts/2Â Â Â 00:00:00Â grep sqlplus
Note the options -SIGSTOP and -SIGKILL, which pass a specific signal (stop and kill, respectively) to the process. Likewise there are several other signals you can use. To get a listing of all the available signals, you can use the -l (thatâ€™s the letter â€œLâ€, not the numeral â€œ1â€) option:

# kill -l
Â 1) SIGHUPÂ Â Â Â Â Â 2) SIGINTÂ Â Â Â Â Â 3) SIGQUITÂ Â Â Â Â 4) SIGILL
Â 5) SIGTRAPÂ Â Â Â Â 6) SIGABRTÂ Â Â Â Â 7) SIGBUSÂ Â Â Â Â Â 8) SIGFPE
Â 9) SIGKILLÂ Â Â Â 10) SIGUSR1Â Â Â Â 11) SIGSEGVÂ Â Â Â 12) SIGUSR2
13) SIGPIPEÂ Â Â Â 14) SIGALRMÂ Â Â Â 15) SIGTERMÂ Â Â Â 17) SIGCHLD
18) SIGCONTÂ Â Â Â 19) SIGSTOPÂ Â Â Â 20) SIGTSTPÂ Â Â Â 21) SIGTTIN
22) SIGTTOUÂ Â Â Â 23) SIGURGÂ Â Â Â Â 24) SIGXCPUÂ Â Â Â 25) SIGXFSZ
26) SIGVTALRMÂ Â 27) SIGPROFÂ Â Â Â 28) SIGWINCHÂ Â Â 29) SIGIO
30) SIGPWRÂ Â Â Â Â 31) SIGSYSÂ Â Â Â Â 34) SIGRTMINÂ Â Â 35) SIGRTMIN+1
36) SIGRTMIN+2Â 37) SIGRTMIN+3Â 38) SIGRTMIN+4Â 39) SIGRTMIN+5
40) SIGRTMIN+6Â 41) SIGRTMIN+7Â 42) SIGRTMIN+8Â 43) SIGRTMIN+9
44) SIGRTMIN+10 45) SIGRTMIN+11 46) SIGRTMIN+12 47) SIGRTMIN+13
48) SIGRTMIN+14 49) SIGRTMIN+15 50) SIGRTMAX-14 51) SIGRTMAX-13
52) SIGRTMAX-12 53) SIGRTMAX-11 54) SIGRTMAX-10 55) SIGRTMAX-9
56) SIGRTMAX-8Â 57) SIGRTMAX-7Â 58) SIGRTMAX-6Â 59) SIGRTMAX-5
60) SIGRTMAX-4Â 61) SIGRTMAX-3Â 62) SIGRTMAX-2Â 63) SIGRTMAX-1
64) SIGRTMAX
You can also use the numeral equivalent of the signal in place of the actual signal name. For instance, instead of kill -SIGKILL 9790, you can use kill -9 9790.

By the way, this is an interesting command. Remember, almost all Linux commands are usually executable files located in /bin, /sbin/, /user/bin and similar directories. The PATH executable determines where these command files can be found. Some other commands are an actually â€œbuilt-inâ€ command, i.e. they are part of the shell itself. One such example is kill. To demonstrate, give the following:

# kill -h
-bash: kill: h: invalid signalÂ specification
Note the output that came back from the bash shell. The usage is incorrect since the -h argument was not expected. Now use the following:

# /bin/kill -h
usage: kill [ -s signal | -p ]Â [ -a ] pid …
Â Â Â Â Â Â kill -l [ signal ]
Aha! This version of the command kill as an executable in the /bin directory accepted the option -h properly. Now you know the subtle difference between the shell built-in commands and their namesake utilities in the form of executable files.

Why is it important to know the difference? Itâ€™s important because the functionality varies significantly across these two forms. The kill built-in has lesser functionality than its utility equivalent. When you issue the command kill, you are actually invoking the built-in, not the utility. To add the other functionality, you have to use the /bin/kill utility.

The kill utility has many options and arguments. The most popular is the kill command used to kill the processes with process names, rather than PIDs. Here is an example where you want to kill all processes with the name sqlplus:

# /bin/kill sqlplus
[1]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[2]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[3]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[4]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[5]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[6]Â Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[7]-Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
[8]+Â TerminatedÂ Â Â Â Â Â Â Â Â Â Â Â Â sqlplus
Sometimes you may want to see all the process IDs kill will terminate. The -p option accomplishes that. It prints all the PIDs it would have killed, without actually killing them. It serves as a confirmation prior to action:

#Â /bin/kill -p sqlplus
6798
6802
6803
6807
6808
6812
6813
6817
The output shows the PIDs of the processes it would have killed. If you reissue the command without the -p option, it will kill all those processes.

At this time you may be tempted to know which other commands are â€œbuilt-inâ€ in the shell, instead of being utilities.

# man -k builtin
. [builtins]Â Â Â Â Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
: [builtins]Â Â Â Â Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
[ [builtins]Â Â Â Â Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
alias [builtins]Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
bash [builtins]Â Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
bg [builtins]Â Â Â Â Â Â Â (1)Â Â – bash built-in commands, see bash(1)
â€¦ and so on â€¦
Some entries seem familiar â€“ alias, bg and so on. Some are purely built-ins, e.g. alias. There is no executable file called alias.

Usage for Oracle Users
Killing a process has many uses â€“ mostly to kill zombie processes, processes that are in the background and others that have stopped responding to the normal shutdown commands. For instance, the Oracle database instance is not shutting down as a result of some memory issue. You have to bring it down by killing one of the key processes like pmon or smon. This should not be an activity to be performed all the time, just when you donâ€™t have much choice.

You may want to kill all sqlplus sessions or all rman jobs using the utility kill command. Oracle Enterprise Manager processes run as perl processes; or DBCA or DBUA processes run, which you may want to kill quickly:

# /bin/kill perl rman perl dbcaÂ dbua java
There is also a more common use of the command. When you want to terminate a user session in Oracle Database, you typically do this:

Find the SID and Serial# of the session
Kill the session using ALTER SYSTEM command
Letâ€™s see what happens when we want to kill the session of the user SH.

SQL> select sid, serial#,Â status
Â 2Â from v$session
Â 3* where username = ‘SH’;
Â Â Â Â Â Â SIDÂ Â Â SERIAL# STATUS
———- ———- ——–
Â Â Â Â Â Â 116Â Â Â Â Â Â 5784Â INACTIVE
Â
SQL> alter system killÂ session ‘116,5784’
Â 2Â /
Â
System altered.
Â
Itâ€™s killed; but when you check the status of the session:
Â
Â Â Â Â Â Â SIDÂ Â Â SERIAL# STATUS
———- ———- ——–
Â Â Â Â Â Â 116Â Â Â Â Â Â 5784 KILLED
It shows as KILLED, not completely gone. It happens because Oracle waits until the user SH gets to his session and attempts to do something, during which he gets the message â€œORA-00028: your session has been killedâ€. After that time the session disappears from V$SESSION.

A faster way to kill a session is to kill the corresponding server process at the Linux level. To do so, first find the PID of the server process:

SQL> select spid
Â 2Â from v$process
Â 3Â where addr =
Â 4Â (
Â 5Â Â Â Â select paddr
Â 6Â Â Â Â from v$session
Â 7Â Â Â Â where username =Â ‘SH’
Â 8Â );
SPID
————————
30986
The SPID is the Process ID of the server process. Now kill this process:

# kill -9 30986
Now if you check the view V$SESSION, it will be gone immediately. The user will not get a message immediately; but if he attempts to perform a database query, he will get:

ERROR at line 1:
ORA-03135: connection lostÂ contact
Process ID: 30986
Session ID: 125 Serial number:Â 34528
This is a faster method to kill a session but there are some caveats. The Oracle database has to perform a session cleanup–rollback changes and so on. So this should be performed only when the sessions are idle. Otherwise you can use one of the two other ways to kill a session immediately:

alter system disconnect sessionÂ ‘125,35447’ immediate;
alter system disconnect sessionÂ ‘125,35447’ post_transaction;

killall
Unlike the dual nature of kill, killall is purely a utility, i.e. this is an executable program in the /usr/bin directory. The command is similar to kill in functionality but instead of killing a process based on its PID, it accepts the process name as an argument. For instance, to kill all sqlplus processes, issue:

# killall sqlplus
This kills all processes named sqlplus (which you have the permission to kill, of course). Unlike the kill built-in command, you donâ€™t need to know the Process ID of the processes to be killed.

If the command does not terminate the process, or the process does not respond to a TERM signal, you can send an explicit SIGKILL signal as you saw in the kill command using the -s option.

# killall -s SIGKILL sqlplus
Like kill, you can use -9 option in lieu of -s SIGKILL. For a list of all available signals, you can use the -l option.

# killall -l
HUP INT QUIT ILL TRAP ABRT IOTÂ BUS FPE KILL USR1 SEGV USR2 PIPE ALRM TERM
STKFLT CHLD CONT STOP TSTP TTINÂ TTOU URG XCPU XFSZ VTALRM PROF WINCH IO PWR SYS
UNUSED
To get a verbose output of the killall command, use the -v option:

# killall -v sqlplus
Killed sqlplus(26448) with signal 15
Killed sqlplus(26452) with signal 15
Killed sqlplus(26456) with signal 15
Killed sqlplus(26457) with signal 15
â€¦ and so on â€¦
Sometimes you may want to examine the process before terminating it. The -i option allows you run it interactively. This option prompts for your input before killing it:

# killall -i sqlplus
Kill sqlplus(2537) ? (y/n) n
Kill sqlplus(2555) ? (y/n) n
Kill sqlplus(2555) ? (y/n) y
Killed sqlplus(2555) with signal 15
What happens when you pass a wrong process name?

# killall wrong_process
wrong_process: no processÂ killed
There is no such running process called wrong_process so nothing was killed and the output clearly showed that. To suppress this complaint â€œno process killedâ€, use the -q option. That option comes handy in shell scripts where you canâ€™t parse the output. Rather, you want to capture the return code from the command:

# killall -q wrong_process
# echo $?
1
The return code (shown by the shell variable $?) is â€œ1â€, instead of â€œ0â€, meaning failure. You can check the return code to examine whether the killall process was successful, i.e. the return code was â€œ0â€.

One interesting thing about this command is that it does not kill itself. Of course, it kills other killall commands given elsewhere but not itself.

Usage for Oracle Users
Like the kill command, the killall command is also used to kill processes. The biggest advantage of killall is the ability to display the processid and the interactive nature. Suppose you want to kill all perl, java, sqlplus, rman and dbca processes but do it interactively; you can issue:

# killall -i -p perl sqlplusÂ java rman dbca
Kill sqlplus(pgid 7053) ? (y/n) n
Kill perl(pgid 31233) ? (y/n) n
… and so on …
This allows you to view the PID before you kill them, which can be very useful.

Conclusion
In this installment you learned about these commands (shown in alphabetical order)

dig Â Â Â Â Â Â Â Â Â Â Â A newer version of nslookup
ifconfigÂ Â Â Â Â Â To display information on network interfaces
killÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Kill a specific process
killallÂ Â Â Â Â Â Â Â Â Â Â Kill a specific process, a group of processes and names matching a pattern
mesgÂ Â Â Â Â Â Â Â Â To turn on or off the ability of others to display something on oneâ€™s terminal.
netstatÂ Â Â Â Â Â To display statistics and other metrics on network interface usage
nslookupÂ Â Â Â To lookup a hostname for its IP address or lookup IP address for its hostname on the DNS
talkÂ Â Â Â Â Â Â Â Â Â Â To establish an Instant Message system between two users for realtime chat
uptimeÂ Â Â Â Â Â Â How long the system has been up and its load average for 1, 5 and 15 minutes
wÂ Â Â Â Â Â Â Â Â Â Â Â Â Â Combination of uptime and who
wallÂ Â Â Â Â Â Â Â Â Â Â To display some text on the terminals of all the logged in users
whoÂ Â Â Â Â Â Â Â Â Â To display the users logged into the system and what they are doing
write Â Â Â Â Â Â Â Â To instantly display something on a specific userâ€™s terminal session

(Extracted from oracle technet notes author Arup Nanda)

isitdevops.com/databases

Categories

Categories

Meta

mesg, wall, w, kill, killall

Discussion ¬

Comment ¬ Cancel reply