Possible Core Theories for Software Engineeringdistinguishing two iterative loops: evolutionary prototyping, i.e., iteratively improving a software artifact over weeks or months; and

Possible Core Theories for Software EngineeringShort Paper

Paul RalphLancaster University

Lancaster, [email protected]

Abstract— Following recent calls for greater attention to theory in software engineering, this paper reviews five theories that provide insight into software engineering behavior – Complexity Theory, Sensemaking-Coevolution-Implementation Theory, the Theory of Boundary Objects, Transactive Memory Theory and the Theory of Cognitive Biases. Rather than providing contradictory explanations, these theories apply at different units of analysis and may therefore be used simultaneously to understand the same software engineering phenomena.

Index Terms— Process theory, complexity, SCI Theory, boundary objects, cognitive psychology, general theory.

I. INTRODUCTION

Following several calls for developing a general theory of software engineering (e.g., [1], [2]), Ralph et al. [3] suggested that a general theory may be developed by identifying and integrating a range of “core” theories. The purpose of this paper, then, is to review theories that are useful for analyzing software engineering (SE) behavior, but may be unfamiliar to many in the SE community. Five such theories are identified: the Theory of Cognitive Biases, Sensemaking-Coevolution-Implementation Theory, the Theory of Boundary Objects, Transactive Memory Theory and Complexity Theory. Rather than competing explanations, these theories can be applied simultaneously to the same context as they apply to different units of analysis, specifically, individual, group, process, artifact and project.

The reviewed theories are selected as they have been useful to the author in previous research. They are neither the only relevant theories nor necessarily the best ones. Moreover, this paper sharply distinguishes between theories and methods. SE Methods (e.g., Spiral, Waterfall, Scrum, Lean) purport to explain good ways of building software. SE process theories, in contrast, purport to explain how a phenomenon occurs, good or bad. Consequently, methods in general provide inappropriate foundations for theories [4] and this paper therefore focuses on theories, not methods.

II. FIVE POSSIBLE CORE THEORIES

A. Cognitive BiasesBeginning at the individual unit of analysis, a cognitive bias

is a systematic deviation from optimal judgment [5]. Some existing research has examined the role of cognitive biases in software engineering e.g. [6], [7]. Many cognitive biases are extraordinarily robust – they apply to diverse individuals in heterogenous situations and persist despite explicit attempts to mitigate them [8]. Biases are caused by various cognitive

phenomena including heuristics (mental shortcuts) and illusions (systematic misperceptions of reality) [9]. Some biases interact and reinforce each other, forming complexes of biases, or biasplexes [10]. Biasplexes, in turn, lead to behavioral antipatterns (i.e., systematic, repeated errors) [10], which may deleteriously affect software engineering projects [6]. Debiasing involves inhibiting biases or mitigating their effects[8], sometimes through sociotechnical intervention in the person-task system (Figure 1).

+ + –

Cognitive Phenomena

Heuristics

Illusions

Other

Behavioral Antipatterns

SE Project Success

Sociotechnical Interventions

Biases

Biasplexes

CausalityModerationComposition

Fig. 1. Cognitive Biases (adapted from [10])

Cognitive biases may affect software project participants in many ways. For example, suppose a team is asked to produce a video game for the iPhone. Due to miserly information processing (the tendency to avoid thinking too much) [11] the team simply accepts that the client requires the iPhone platform, rather than Android, Windows or a game console. In the first design meeting, the lead developer suggests making an adventure game. Due to the bandwagon effect (the tendency for beliefs to spread rapidly through groups), team members quickly accept the adventure game option without due consideration of alternatives. Now confirmation bias (the tendency to notice evidence supporting one’s beliefs and ignore evidence contrary to one’s beliefs) [11] takes hold, and the team misses warning signs that neither an adventure game nor the iPhone platform were good choices. When evidence of a mistake becomes overwhelming, team members irrationally defend their choices due to status quo bias (the tendency to irrationally prefer the status quo). Miserly information processing, the bandwagon effect, confirmation bias and status quo bias thus combine to create the inertia biasplex, which systematically reduces solution space exploration [10].

Practically speaking, then, the Theory of Cognitive Biases may be used to identify the causal mechanisms underlying problematic behavior in SE teams and identify simple, inexpensive techniques to debias the team and reduce bias-related anti-patterns. Prominent examples include requesting

several alternative solution concepts to improve solution space exploration and using planning poker [12] to inhibit insufficient adjustment bias during effort estimation. A less known technique involves assigning a Devil’s Advocate [13] to increase critical reflection in design or retrospective meetings.

B. Sensemaking-Coevolution-Implementation TheoryMoving from the individual to the process unit of analysis,

Sensemaking-Coevolution-Implementation Theory (SCI) posits that software development comprises three primary activities (Figure 2): forming and organizing beliefs about the project context (sensemaking); the iterative, mutual refinement of one’s understandings of the context and the software artifact (coevolution); and the realization of one’s beliefs in the form of a software artifact (implementation) [14]. SCI assumes a single design agent, which may be an individual or a team, but not multiple, conflicting teams. The design agent may engage in the three major activities in any sequence. This leads to distinguishing two iterative loops: evolutionary prototyping, i.e., iteratively improving a software artifact over weeks or months; and coevolution, i.e., “developing and refining together both the formulation of a problem and ideas for a solution, with constant iteration of analysis, synthesis and evaluation processes between the … problem space and solution space” {Dorst:2001tq, p. 434}, over minutes or hours. SCI is consistent with the assumption that system requirements are often an illusion [15] and the empirical model of design [16]. A survey of over 1300 developers supported SCI’s core propositions [17].

Mental Picture of Context

Sensemaking

Goals

Design Agent

Mental Picture of Design

Object

Implementation

Design Object

Primitives

Coevolution

Context

Constraints

InputOutputCompositionExecutesUnbounded Entity

Object

Mental Entity

Activity

Key

Fig. 2. SCI Overview (adapted from [14])

SCI has many important implications for SE research and practice. For example, the realization that problem framing and problem solving are interconnected undermines the logic of outsourcing development through tendering and fixed-price/-schedule contracts. Similarly, it suggests that dividing team members into “analysts”, “designers”, “programmers” and “testers” may engender a misleading sense of identity that hinders effective working as these role-names reflect a poor classification of activities. Furthermore, as SE projects depend on the effectiveness of coevolutionary processes, SCI motivates research on tools, techniques and methods for enhancing coevolution. In addition, SCI suggests that SE education programs should cover sensemaking, coevolution and implementation as core topics; however, an analysis of SE2004 (the ACM SE undergraduate model curriculum) revealed that it includes neither coevolution nor any other method of generating design candidates [18].

C. Boundary ObjectsSCI does not explicitly model artifacts produced during SE

other than the software itself. Boundary objects are a class of artifacts, which are simultaneously plastic enough to be used for different purposes in different domains but solid enough to retain their identities [19]. For example, a UML class diagram may be used to guide programming and to elicit feedback from a client, but obviously remains a class diagram in both cases. The term “boundary object” comes from the use of such diagrams to facilitate information sharing across the multifarious boundary between an SE team and a client organization. Although the concept is broad, not all objects are boundary objects. For example, source code is not usually plastic enough to be considered a boundary object, while the employee-management power dynamic is too effervescent.

Analyzing SE artifacts and processes using boundary objects reveals many interesting patterns. For example, designers use boundary objects to promote shared representation, mobilize for action and transform and legitimize design knowledge [19]. In a recent (unpublished) study we found that SE project participants externalize their cognition into boundary objects to remember (e.g., to-do list), communicate (e.g., informal diagram on whiteboard during design meeting) and manage complexity (e.g., entity-relationship model showing tables but not attributes). This suggests a fourth core activity in SCI where design agents externalize their cognition about the project context into conceptual models and their cognition about the design space into design models (Figure 3).

While the Theory of Boundary Objects does not imply specific guidelines or practices for SE, it is useful for both researchers and practitioners as an analytical lens. For example, we found that some project participants feel pressured to externalize their understanding of the context or design space so that, when they are away, their teammates could still access their knowledge. However, their understanding is often so complex that any external representation detailed enough to use effectively is too voluminous to use efficiently. This boundary objects paradox is an example of a behavioral antipattern discussed above.

Boundary Objects

Mental Picture of Context

Design Agent

Mental Picture of the Design

SpaceExternalises

Cognition

Design Models

Conceptual Models

Fig. 3. Adding Boundary Objects to SCI

D. Transactive Memory TheorySCI assumes that software is developed by a design agent,

which may be a team if the team acts in coordinated fashion toward a common agenda. This raises questions regarding the

relationship between coordination, cognition and performance, which Transactive Memory Theory may address.

Shifting, then, to the group unit of analysis, Transactive Memory is a theory of group mind, which posits that individuals have both memory and meta-memory [20]. It further posits that that team members become specialists (Figure 4), maintaining detailed knowledge of their specialism (memory) and superficial links to the specialisms of other team members (meta-memory). Team performance therefore depends on both task familiarity (memory) and team familiarity (meta-memory) [21], i.e., high performance necessitates that team members need to know not only how to do their part but also to whom to route tasks and requests outside of their speciality.

Specialist Specialist

Specialist Specialist

Specialist

meta-memory

memory

Fig. 4. Transactive Memory

Viewing a SE team as a transactive memory system implies that team performance fundamentally depends on individual skill and familiarity between team members. It suggests that interactions including peer programming, peer code reviews and daily stand-up meetings will have longterm positive performance impacts by increasing team familiarity [21], cf. [22]. Furthermore, in a recent (unpublished) study, we that transactive memory degradation helped explain the abandonment of an initiative to transition from an ad hoc methodology to Scrum. Practically speaking, Transactive Memory Theory implies that more direct attempts to share areas of expertise, rather than expertise itself, will increase team performance.

E. Complexity TheorySCI seeks to explain how complex software systems are

designed [14]. In Complexity Theory, “complexity” refers to the extent to which a system manifests behavioral properties not evident from its constituent parts [23], [24]. These “emergent” behaviors may arise from the (large) number of interconnections between components, from non-linear interactions between components or from poor understanding of the domain. Complex systems are also self-organizing, in that a high-level order spontaneously emerges from low-level interactions among initially unordered components. Complexity Theory has been used to explain phenomena as diverse as increasing returns (in economics) and how more complex organisms evolve from less complex ones [25]. Complexity Theory refers to the scientific study of complex

systems. (Like game theory, it is a collection of concepts and an area of study rather than a set of hypotheses.)

Moving up to the project unit of analysis, SE projects and the organizations that execute them can be modeled as complex systems. More specifically, SE projects are complex adaptive systems [26], [27], i.e., complex systems that modify themselves based on feedback (Figure 5). Doing so entails several implications for SE practice including a controversial outlook on the relationship between project complexity and planning. Complexity theory posits that higher complexity implies lower predictability; therefore, planning is more difficult and less valuable in more complex projects. This further implies that more linear, plan-driven methods will be more effective for simple, routine SE projects while more iterative, reactive methods will be more effective for more complex, innovative SE projects. The idea that the utility of planning is inversely related to project complexity directly opposes much of the methods literature where proponents of both Agile and plan-driven methods agree that plan-driven methods are more appropriate for larger, more complex projects while Agile methods are more appropriate for smaller, less complex projects [12], [28]. In a recent (unpublished) study, we observed an SE team confronted with an initially simple project that appeared to gain complexity almost daily. Consistent with Complexity Theory, the team abandoned planning initiatives and adopted practices to become more responsive rather than proactive.

Fig. 5. Model of a Complex Adaptive System (from Wikimedia Commons)

III. DISCUSSION AND CONCLUSION

The five theories discussed above may be used simultaneously to analyze SE phenomenon at different units of analysis. Cognitive biases occur mostly at the individual level, transactive memory describes group-level phenomena, SCI sits at the process level, Boundary Objects are artifacts, and Complexity Theory is especially useful for understanding projects.

Moreover, many interconnections between the theories are evident. For example, cognitive biases including myside bias may inhibit efficient processing in a transactive memory system. Furthermore, the effectiveness of different kinds of boundary objects (e.g., formal vs. informal models) may

depend on the complexity of the project. In addition, Transactive Memory Theory clarifies SCI’s design agent assumption; Complexity Theory clarifies SCI’s complex design object assumption; cognitive biases explain common problems encountered during coevolution (e.g., poor solution space exploration) and boundary objects suggest a fourth core SCI process (i.e., design agents externalize their cognition using boundary objects).

In conclusion, this paper describes five theories from SE and reference disciplines, which are useful for explaining and understanding SE behavior. These theories do not individually or jointly constitute a general theory of SE; rather, they are part of the theoretical foundation necessary to construct a more general theory.

Therefore, this paper’s key contribution is to advance the general theory of software engineering agenda by 1) reaffirming that SE theory is important; 2) acknowledging that a general theory may be developed by integrating existing relevant theories from SE and reference disciplines; and 3) discussing five relevant theories that may be new to many SE researchers. Future research is needed to identify more relevant theories and better integrate them to form a more general theory. This paper begins this process by noting that the reviewed theories are broadly consistent but apply at different units of analysis.

ACKNOWLEDGMENT

Thanks are due to Gerasimos Balis, Ammar Hamamra, Daniel Kershaw, Eduardo Narros, Petr Shportun and Ben Shreeve for their efforts on the three unpublished studies mentioned in this paper.

REFERENCES

[1] P. Johnson, M. Ekstedt, and I. Jacobson, “Where's the Theory for Software Engineering?,” IEEE Software, vol. 29, no. 5, pp. 94–96, 2012.

[2] SEMAT, “SEMAT Call for Action,” semat.org. [Online]. Available: http://semat.org/?page_id=2. [Accessed: 21-Nov-2012].

[3] P. Ralph, P. Johnson, and H. Jordan, “Report on the First SEMAT Workshop on a General Theory of Software Engineering (GTSE 2012),” ACM SIGSOFT Software Engineering Notes, vol. 38, no. 2, 2013.

[4] P. E. Vermaas and K. Dorst, “On the Conceptual Framework of John Gero's FBS-Model and the Prescriptive Aims of Design Methodology,” Design Studies, vol. 28, no. 2, pp. 133–157, 2007.

[5] A. Tversky and D. Kahneman, “1. Judgment under uncertainty: Heuristics and biases,” 2000.

[6] J. Parsons and C. Saunders, “Cognitive Heuristics in Software Engineering: Applying and Extending Anchoring and Adjustment to Artifact Reuse,” IEEE Transactions on Software Engineering, vol. 30, pp. 873–888, 2004.

[7] W. Stacy and J. MacMillan, “Cognitive bias in software engineering,” Communications of the ACM, vol. 38, no. 6, pp. 57–63, 1995.

[8] R. P. Larrick, “Debiasing,” in Blackwell Handbook of Judgment and Decision Making, Blackwell Publishing Ltd, 2008, pp. 316–338.

[9] R. F. Pohl, Ed., Cognitive Illusions. East Sussex, UK: Psychology Press, 2004.

[10] P. Ralph, “Toward a Theory of Debiasing Software Development,” in Research in Systems Analysis and Design: Models and Methods: 4th SIGSAND/PLAIS EuroSymposium 2011, no. 93, S. Wrycza, Ed. Gdansk, Poland: Springer, 2011, pp. 92–105.

[11] K. Stanovich, What Intelligence Tests Miss: The Psychology of Rational Thought. New Haven, CT, USA: Yale University Press, 2009.

[12] K. Beck, Extreme Programming eXplained: Embrace Change, 2nd ed. Boston, MA, USA: Addison Wesley, 2005.

[13] C. R. Schwenk, “Effects of devil's advocacy and dialectical inquiry on decision making: A meta-analysis,” Organizational Behavior and Human Decision Processes, vol. 47, no. 1, pp. 161–176, 1990.

[14] P. Ralph, “The Sensemaking-Coevolution-Implementation Theory of Software Design,” arXiv.org, vol. cs.SE. 17-Feb-2013.

[15] P. Ralph, “The Illusion of Requirements in Software Development,” Requirements Engineering, 2012.

[16] P. Ralph, “Introducing an Empirical Model of Design,” In Proceedings of MCIS 2011, Limassol, Cyprus, 2011.

[17] P. Ralph, “Comparing Two Software Design Process Theories,” in Proceedings of DESRIST, LNCS 6105, St. Gallen, Switzerland, 2010, vol. 6105, pp. 139–153.

[18] P. Ralph, “Improving coverage of design in information systems education,” in Proceedings of ICIS 2012, Orlando, FL, USA, 2012.

[19] M. Bergman, K. Lyytinen, and G. Mark, “Boundary Objects in Design: An Ecological View of Design Artifacts,” Journal of the AIS, vol. 8, no. 11, pp. 546–568, 2007.

[20] D. M. Wegner, “Transactive memory: A contemporary analysis of the group mind,” in Theories of group behavior, B. Mullen and G. R. Goethals, Eds. New York: Springer, 1987, pp. 185–208.

[21] J. A. Espinosa, A. S. Sandra, E. K. Robert, and D. H. James, “Familiarity, Complexity and Team Performance in Geographically Distributed Software Development,” Organization Science, vol. 18, no. 4, pp. 613–630, 2007.

[22] C. Schmidt, K. Spohrer, T. Kude, and A. Heinzl, “The Impact of Peer-Based Software Reviews on Team Performance: The Role of Feedback and Transactive Memory Systems,” in Proceedings of ICIS 2012, Orlando, FL, USA, 2012.

[23] J. H. Holland, “Complex Adaptive Systems,” Daedalus, vol. 121, no. 1, pp. 17–30, Jan. 1992.

[24] M. Gell-Mann, “Complex adaptive systems,” in Complexity: Metaphors, models and reality, Westview Press, 1999, pp. 17–45.

[25] M. M. Waldrop, Complexity: the emerging science at the edge of order and chaos. New York, USA: Simon and Schuster, 1992.

[26] P. Anderson, “Complexity Theory and Organization Science,” Organization Science, vol. 10, no. 3, pp. 216–232, 1999.

[27] K. Kautz, “Beyond Simple Classifications: Contemporary Information Systems Development Projects as Complex Adaptive Systems,” in Proceedings of ICIS 2012, Orlando, Florida, USA, 2012, pp. 1–20.

[28] I. Jacobson, G. Booch, and J. Rumbaugh, The Unified Software Development Process. Boston, MA, USA: Addison-Wesley Longman Publishing Co., Inc., 1999.

http://semat.org/?page_id=2

http://semat.org/?page_id=2

Documents

Possible Core Theories for Software Engineeringdistinguishing two iterative loops: evolutionary prototyping, i.e., iteratively improving a software artifact over weeks or months; and