analyzing the use of voip technology in collaborative modeling
TRANSCRIPT
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
1/6
4
Analyzing the use of VoIP Technology in Collaborative Modeling
Mauro C. Pichiliani1, Celso M. Hirata11Instituto Tecnologico de Aeronautica, Sao Paulo, Brazil
{pichilia,hirata}@ita.br
Abstract
Collaborative Editing Systems often use
communication channels in order to make thecollaboration more effective. In this work, we present a
case study of pair communication using audio via VoIP
(Voice over Internet Protocol) technology during
collaborative modeling sessions. The study provides an
analysis of audio and textual chat as communication
medium and presents data on usage patterns, user
interaction and attitudes when using collaborative
editing systems. The qualitative and quantitativeanalysis suggests that VoIP technology does have
advantages over textual chat in collaborative modeling
when used for communication.
Keywords: Audio conferencing, VOIP, Chat,
Collaboration.
1. Introduction
The ability to communicate is a well known factor
that directly affects how people collaborate in order toperform tasks which demand group work. Manycommunication channels have been used in real-time
synchronous collaborative editing systems (CES) to
promote the interaction among multiple users while
editing a shared document simultaneously.The common communication channels used to
promote the interaction during group work are text and
audio, which are provided by text-based chat and audio
conferencing tools, respectively. Despite the widespread
research in communication in multimedia conferencing
systems, as presented in [5, 6, 10, 11], few efforts have
been made to evaluate how the audio conferencing
features provided by the VoIP (Voice over InternetProtocol) technology can be used in applications which
are context dependent on collaborative requirements,
such as collaborative modeling. The features that the
VoIP technology offer include high-fidelity stereoaudio, low bandwidth cost and the ability to change
audio settings such as volume, echo feedback and level
of distortion.
The goal of this paper is to investigate how
communication provided by the VoIP technology
affects the collaboration in CES. The CES is a UML
(Unified Modeling Language) collaborative editor. In
order to investigate the benefits of VoIP in CES
sessions, we compare text and audio usage. A case
study was conducted in which users were divided into
pairs that had either an audio or a text-based channel to
communicate. Qualitative and quantitative analysis ofdata from the case study is presented to support the
findings that VoIP technology, when used in this
context, does have advantages over the text when it is
used for communication.
The rest of the paper is organized as follows. Related
work and background studies are presented in Section 2.
Section 3 describes a case study conducted to collect
empirical data about the use of text and audio as a
channel of communication. The section also presentsthe qualitative and quantitative analysis. Finally,
Section 4 presents the conclusions, comments and the
future work.
2. Related Work
In order to make the work more productive and
reduce the costs of physical group meetings, real-timesynchronous collaborative editing systems (CES) have
been employed to allow multiple users edit a document
simultaneously. Using CES users collaborate to
accomplish tasks which would otherwise be difficult forindividuals, such as those that require the synchronous
interaction of users located in distributed geographically
areas. However, the group work provided by CES can
become unproductive and expensive without
communication. In CES, communication is necessary,
among other reasons, to coordinate users in order to
define when they work and how they work.
According to Kraut et al. [8], coordination can bedefined as the activity of directing individuals' efforts
towards achieving common and explicitly recognized
goals. To study the coordination aspects of
communication researchers divide it into two types:
formal and informal.
On formal communication a coordination
mechanism is used in many degrees of formality. In this
type of communication the coordination is
accomplished by adherence to common rules,
regulations, and standard operating procedures, through
pre-established plans, schedules, forecasts and other
standardized communications. The formal coordinationmechanisms have in common communication that is
specified in advance, is unidirectional, and is relatively
impoverished [8].
Informal communication is defied by Whittaker et al.
[14] as a long intermittent conversation containing
multiple unplanned fragments that often lack openings
and closings. In their work Whittaker et al. indicate that
Proceedings of the 2009 13th International Conference on Computer Supported Cooperative Work in Design
978-1-4244-3535-7/09/$25.00 2009 IEEE
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
2/6
5
informal communication supports a number of different
functions, such as the execution of work-related tasks,
coordination of group activity, transmission of office
culture, and social functions.
The formal and informal types of communication are
related to the nature of interaction in terms of
scheduling, content or protocol, but do not necessarilycharacterize a particular modality of communication. It
means that either audio or text-based communication
can be formal or informal. However, the informalcommunication has been reported to account for over
30% of total work time, with over 90% of this time
being spent on unplanned conversations [8, 14].
In terms of functional characteristics, formal and
informal communications are best suited to different
types of activities. Formal communication tend to be
used for coordinating relatively routine transactions
within groups and organizations while informalcommunication support group coordination, especially
under conditions of uncertainty and novel or unplannedevents, which are likely to occur during the use of CES.
The common used communication channels in CESinclude the textual chat and the audio conference. There
are some CES that use video conferencing over Internet
as well; however, it is traditionally supported by
dedicated conferencing rooms, which cut out the
possibility of supporting informal communication, as
indicated by [11]. Our conjecture is that a combined use
of CES and video conference may be compromised
because both systems require a heavy communicationbandwidth in order to provide an acceptable quality of
service. We also conjecture that the usage of both
systems requires some formal coordination since thesystems usually have their own coordinationmechanisms whose usages may conflict with each
other.
A textual chat is defined as a "live" text-based
synchronous communication where a participant type a
message and it is immediately available to the group of
participants; other participants eventually read the
message and then write and send a response, which is
also immediately available to the group. Textual chatscan happen via instant messaging or via a virtual space,
called a "chat room". Audio conferencing is analogous:
a participant says a message on a microphone that
immediately is reproduced on the sound speaker of theother participants and vice-versa.
In general, textual chat is easier to install and set-up
than audio communication because of the weaker
requirements, i.e. low bandwidth and no need for
microphones or speakers. Conversely, the audio
conferencing systems are more complex to both
implement and use because they require more resources
[11]. In order to facilitate the use of audio conference incollaborative applications, many CSCW (Computer
Supported Cooperative Work) designers are employing
VoIP technology.
VoIP is a technology that allows the transmission of
audio over the IP network. The audio is encoded by a
codec, the software that define how the analog audio
signal is converted to digital stream and then backagain, and whether it is compressed along the way.
When using standard and mobile telephones it is only
possible to transmit audio at 8 kilohertz mono (8k)
while using VoIP technology it is technically possible
to transmit voice at CD quality (44.1k) or higher instereo.
The main reason to choose VoIP over telephone
technology to promote audio conference is the quality
of the audio. Yankelovich et al. [13] state that human
brains are tuned to understand speech, even under the
worst of audio conditions, but the clearer the audio
signal, the easier it is to understand. When an audiosignal is degraded due to low fidelity or background
noise, as in telephone calls, some problems occur. First,
listeners have to strain to hear the speech, thus
expending considerable mental effort to understand the
words. The effort makes it more difficult to focus on the
content of what is being said. Also, a degraded audiosignal makes considerably more difficult to understand
soft-spoken and accented speakers. Therefore, if theclarity of the audio signal is improved the effort needed
to understand the meaning of what a remote person says
decreases.
The VoIP technology became popular when VoIPproviders, such as Skype [12], offered telephony
services through Internet with low delay, high fidelity
voice quality, no jitter and low costs. Other VoIP
service providers developed applications that can be
used in any network that supports the IP protocol,
which are known as softphones in opposition to the real
world physical telephones.The Telecommunications and CSCW literature
contains many studies comparing the effects of audio
and text in different settings [3, 7, 10]. In most of those
studies the textual chat is often reported as an inferior
form of communication based on the arguments that itis less effective for building trust than audio or video
and that users prefer video and audio instead of text to
communicate. Whereas those studies evaluate many
collaborative settings, none of them takes in
consideration the context dependent nature of
collaborative modeling.
There are few studies in the CSCW literature that
evaluate the context dependent nature of collaborativemodeling without remote communication features.
Damm et al. [2] present a qualitative evaluation of
workspace awareness by analyzing the use of
Distributed Knight, a gesture-based diagramming toolthat supports distributed collaboration for modeling of
UML diagrams. Fidas et al. [4] examine the effect of
heterogeneous resources during computer-supported
problem solving promoted by the collaborative
modeling of an educational activity in a secondary
school. However, both studies do not evaluate the use of
audio as a communication channel.
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
3/6
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
4/6
7
3.2. Data Analysis
Qualitative and quantitative data were obtained from
questionnaires, interviews and observations conducted
during the experiment. By analyzing the answers of the
questionnaires from both groups we found that the
commentaries made by the students on the VoIP groupindicate the effect of the communication on the
collaboration. They also imply that the communicationwas somehow beneficial. A student commented the
following:
We can see that there was collaboration betweenthe pair mainly because of the VoIP. The doubts
between us were resolved in real time and the work was
done more quickly.
All the students from the VoIP group made brief and
positive commentaries about the quality and the ease of
use of the audio communication. In fact, the opinionthat the audio conferencing is beneficial was shared byall our respondents in the VoIP group, even those that
consider themselves as less active. Although the
questionnaire does not contain any question about the
direct effect of the communication channel, we couldnot perceive any positive mention about the effects of
the communication channel in the chat group. This
absence indicated that most students do not consider it
worth mentioning or do not like the textual chat.
The analysis of the chat transcripts suggests that
there were some coordination and awareness issues
caused by the nature of the textual chat. One specific
problem of the textual chat that affected thecoordination is the high number of mistyped words
found in the transcript. These mistyped words increased
the effort needed to understand the messages exchanged
by the students. The understanding problem did nothappen on the VoIP group, since the few misspelled
words were understood correctly by them. The reason
why the understanding problem does not happen in the
VoIP group probably is due to the high-quality audio
provided by the softphone that clarified the students
pronunciation of the words.
Some dialogs show that sometimes a student in the
chat group stopped working while waiting an answer
for a question that he had made. This behavior did nothappen in the VoIP groups probably due to the
synchronous property of the communication and the
short time spent to answer the questions made.
The students of the chat group demonstrated
difficulties to coordinate their actions. One of the
reasons for that is the lack of sufficient communication
related to the task they performed concurrently. An
example of this behavior is presented on the excerpt
below that was typed immediately after one student
realized that he duplicated the work of his partner:
On the next time we could try to coordinate our
actions to avoid the risk of one of us does the work thatthe other is doing.
Coordination problems did not happen in the VoIP
group. This may be explained by the fact that this group
not only exchanged more messages than the chat groupbut also communicated in a more elaborated way.
Analyzing the audio of the VoIP group we found that
students discussed alternative scenarios, created
hypothetical situations, taught each other on topics
related to UML modeling and even made assumptions
about the fictitious scenarios presented in each task,
among other discussions. The following suggestion,obtained from a student of the VoIP group, illustrates
how elaborated was the communication.
Lets suppose that in the class that you suggest we
create an attribute to store this property ()
This is evidence that the conversation in the VoIP
group was more elaborated than the conversation that
occurred in the chat group. In the chat group, however,
the students did not suggest any alternative solution or
possible scenarios. Instead of an elaborated
conversation, the transcript of the chat group shows thatduring the first and second tasks, for two distinct chat
pairs, one student assumed the role of a leader, stating
his ideas and saying to the other student what to do.
When this behavior happened the other student
followed the directions of the leader and occasionally
made some observations, thus assuming a passive role
in the collaboration. Other chat pairs showed a moredemocratic decision making process allowing studentsto exchange their strategies to model the diagrams.
When the leader behavior happened, it was possible
to note that the activities were not divided equally
between the pair. The measure used to make this claimis the number of elements created, modified and deleted
by each student in each pair of each group. In the VoIP
pairs, however, the activities were divided almost
equally. There is no data suggesting that the leader
behavior happened in the VoIP pairs.
In general, the qualitative analysis shows strong
evidence to advocate that the communication in the
VoIP group was more elaborated, contributed to a morericher discussion of ideas and promoted a more smooth
and useful conversation than the communication
produced by the chat group.
At the end of each task the students indicated to theobservers when they finished the diagrams. The average
time spent in the three tasks performed by the chat
group was almost 30% less then the average time that
the VoIP group spent in the same three tasks. One
reason to explain this difference is that talking is easier
and faster to communicate than typing [11], thus
students tend to express themselves more in the audio
channel than in the text channel. This is confirmed by
the fact that the chat groups do not communicate as
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
5/6
8
much as the VoIP groups, since the average word count
per minute of both students of the chat group, for the
three tasks, was 10.82 against the 26.15 value for the
same metric in the VoIP group, which is more than
twice.
Although the VoIP pairs took longer to communicate
and their communication was more elaborated than thepairs of the chat group, we cannot conclude that the
VoIP group was more efficient. To compare the
productivity it is necessary to evaluate the groupsperformances considering the effects of the
communication channel and the quality of the work
produced. With this in mind, we presented the diagrams
modeled in the experiment to two UML specialists that
evaluated them by assigning grades to objective
characteristics, such as the errors on the diagrams, the
clarity of the ideas and the representativeness of the
model.We are still analyzing the grades produced by the
specialists, but preliminary quantity analysis indicatesthat the collaborative models produced by the students
with a high level of interaction during the sessions (theVoIP group) received better grades than those made by
students that do not interact much with their partners
(the chat group).
Another metric calculated for this analysis is the
number of elements created, modified and deleted by
each student in each pair of each group. In order to
analyze these data it is useful to take into account the
nature of the task. For instance, in the third task, thestudents already have a diagram with a few elements, so
in this task the number of elements created and deleted
by the pair is less, on the proportion of 2:1, than thenumber for other tasks on the two groups. In the task 1and 2, which both require the creation of a diagram
from scratch, the number of elements created, modified
and deleted by the pair, as a whole, was almost equal in
both groups, with less than 5% of variation.
In order to further investigate the effort needed to
use the VoIP and the textual chat, questions were added
regarding the perception on how hard was the
communication during the experiment. At the end ofeach task the students were asked to rate their effort on
a 5 point Likert scale (1 is very hard 5 is very easy).
The data collected with the Likert scale suggest that
discussion was harder in the chat group. Thequantitative difference can be seen in the graph
presented in Figure 1, showing the average effort
perception in the three tasks of the experiment for each
group.In the first task, most of the students of the chat
group assigned the medium value of the Likert scale,
while the students of the VoIP group indicated that the
communication was easier, with one point of the Likert
scale of difference. In the second task, students of both
groups indicate the same values on the average
perception effort question. In the third task, the average
effort perception of the VoIP group was a little higher
than the chat group.
The analysis of the quantitative data suggest that the
VoIP group took longer to complete the tasks, discussed
more and produced more work than the chat group
during the tasks of the case study. The analysis also
shows that students of the VoIP group, in two of the
three tasks, spent less effort to communicate than the
group that used the textual chat.
Figure 1. Effort perception average for each
task separated by group.
Although only pairs of students were evaluated inthe experiment, we were able to simulate and observe
the interactions produced by small software
development teams when they meet to elaborate UML
diagrams. As our main focus in the experiment is toobserve the communication, we note that when the
quantity of users increase in the collaboration it is likely
that both the workload in the system and the required
mental effort to communicate also increase. To mitigate
these issues it is necessary to employ methods to
coordinate the dialogs and avoid the strain to hear and
understand the speech.
4. Conclusions, Comments and Future
Work
In this paper we presented a case study conducted to
analyze how communication, provided by the VoIPtechnology, affects the collaboration during
collaborative modeling sessions that require the
manipulation of UML diagrams.
The case study corresponds to a controlledexperiment that collected data about the usage patterns,
user interaction and attitudes. The qualitative analysis
of the data showed that the communication with high
quality audio was more elaborated, contributed to a
richer discussion of the ideas and promoted a more
smooth and useful conversation. The quantitative data
produced evidences that the participants discussed and
worked better when the audio is used. The study alsoprovides an analysis of the perception of effort by
reporting that users spent less effort when the audio
channel was used.
The evidences and the analysis of the data presentedin the case study led to the conclusion that high quality
audio does have advantages over text when promoting
communication during modeling with CES in terms of
-
8/12/2019 Analyzing the Use of VoIP Technology in Collaborative Modeling
6/6
9
quality of the discussions, the amount of work
produced, and effort made to communicate.
Whereas the evidence produced by the analysis of
the data suggest that high quality audio have more
benefits than text during collaborative modeling, it is
important to consider the limitations of the case study.
The main limitations of the case study are thedesignation of only two students to co-work in order to
produce UML diagrams, the lack of real world
scenarios to model, the restriction of two types of UMLdiagrams used on the experiment and the focus on the
analysis of tools effect instead of a focus on the effect
of functionality affordance. Also, the number of pairs
used for the groups in this experiment is too small and
may contribute to some bias in the results.
Future work includes the observation of how users
coordinate their activities on other semantic modeling
tasks and more detailed analysis of the data collectedduring the experiment, such as facial expressions and
quality of the diagrams that were produced. Theevaluation of the data in other collaborative scenarios
such as: (i) group session with more than twoparticipants, (ii) other kind of tasks, for instance,
collaborative drawing or text editing is also a possible
work, since the characteristics of other cases studies can
increase the knowledge of usage patterns, user
interaction and attitudes in a collaborative context.
The research proposed in this paper creates
precedent for a evaluation of high fidelity voice quality
in collaborative modeling. The quantitative andqualitative results presented can encourage the
developers of collaborative applications to consider the
use of VoIP technology in their prototypes andexperiments.
References[1] R.M. Araujo, F.M. Santoro and M.R.S. Borges, The
CSCW Lab for groupware evaluation, Proc. of the 8thCollaboration Research International Workshop onGroupware, La Serena, Chile, Sept. 1-4, 2002, pp. 222-
231.
[2] C. Damm and K. Hansen, An Evaluation of WorkspaceAwareness in Collaborative Gesture-based DiagrammingTools, Proc. of the 2004 HCI Conference, Leeds, UK,Sept. 6-10, 2004, pp. 25-50
[3] X. Ding, T. Erickson, W.A. Kellogg, S. Levy, J.E.Christensen, J. Sussman, T.V. Wolf and W.E. Bennett,An Empirical Study of the Use of Visually Enhanced
VoIP Audio Conferencing: The Case of IEAC, Proc. ofthe 2007 Conference on Human Factors in Computing
Systems, California, USA, Apr. 28 - May 3, 2007, pp.1019-1028.
[4] C. Fidas, V. Komis, S. Tzanavaris and N. Avouris,Heterogeneity of Learning Material in SynchronousComputer-supported Collaborative Modelling,
Computers & Education, 2005, 44(2), 135-154.
[5] R.E. Grinter and M.A. Eldrigde, y do tngrs luv 2 txtmsg?, Proc. of the 7th European Conference on
Computer Supported Cooperative Work, Bonn, Germany,
Sept. 16-20, 2001, pp. 219-238.
[6] M. Handel and J.D. Herbsleb, What Is Chat Doing in theWorkplace?, Proc. of the 9th ACM Conference onComputer Supported Cooperative Work, Lousiana, USA,
Nov. 16-20 ,2002, pp. 1-10.
[7] J.D. Herbsleb, D. Atkins, D.G. Boyer, M. Handel andT.A. Finholt, Introducing Instant Messaging and Chatinto the Workplace, Proc. of the 2002 HCI Conference,
Minneapolis, USA, Apr. 20-25, 2002, pp. 171-178.
[8] R.E. Kraut, R. Fish, R. Root and B. Chalfonte, InformalCommunication in Organizations: Form, Function, and
Technology. Claremont Symposium on Applied SocialPsychology, California, USA, Feb. 10, 1990, pp. 145-199.
[9] M.C. Pichiliani and C.H. Hirata, A Guide to MapApplication Components to Support Multi-user Real-timeCollaboration,Proc. of the 2nd International Conference
on Collaborative Computing: Networking, Applicationsand Worksharing,Georgia, USA, Nov. 17-20, 2006.
[10] T. Schliemann, T. Asting, A. Folstad and J. Heim,Medium Preference and Medium Effects in Person-person Communication, Proc. of the 20th ACM
Conference on Human Factors in Computing Systems,Minnesota, USA, Apr. 20-25, 2002, pp. 710-711.
[11] J. Scholl, J.D. McCarthy and R. Harr, AComparison of Chat and Audio in Media Rich
Environments, Proc. of the 11th ACM Conference onComputer Supported Cooperative Work, Alberta, Canada,Nov. 4-8, 2006, pp. 323-332.
[12] Skype, 2008, http://www.skype.com, Visited at04/02/2009.
[13] N. Yankelovich, J. Kaplan, J. Provino, M. Wesslerand J.M. DiMicco, Improving Audio Conferencing: AreTwo Ears Better than One? Proc. of the 11th ACMConference on Computer Supported Cooperative Work,Alberta, Canada, Nov. 4-8, 2006, pp. 333-342.
[14] S. Whittaker, D. Frohlich and O. Daly-Jones,Informal Workplace Communication: What is it Like andHow Might we Support it?, Proc. of the 12th ACMConference on Human Factors in Computing Systems,Massachusetts, USA, Apr. 24-28 , 1994, pp. 131-137.