Synthetic Literature. Writing Science Fiction in a Co ... · Synthetic Literature. Writing Science Fiction in a Co-Creative Process Enrique Manjavacas [1, 3] [email protected]

Synthetic Literature.Writing Science Fiction in a Co-Creative Process

Enrique Manjavacas [1, 3][email protected]

Folgert Karsdorp [2][email protected]

Ben Burtenshaw [1, 3][email protected]

Mike Kestemont [1, 3][email protected]

Computational Linguistics & Psycholinguistics Research Center [1]The University of Antwerp, Lange Winkelstraat 40-42, Antwerp, Belgium

Meertens Instituut [2]Oudezijds Achterburgwal 185, 1012 DK Amsterdam, The Netherlands

Antwerp Centre for Digital Humanities and Literary Criticism [3]The University of Antwerp, Prinsstraat 13, Antwerp, Belgium

Abstract

This paper describes a co-creative text gener-ation system applied within a science fictionsetting to be used by an established novelist.The project was initiated as part of The DutchBook Week, and the generated text will bepublished within a volume of science fictionstories. We explore the ramifications of apply-ing Natural Language Generation within a co-creative process, and examine where the co-creative setting challenges both writer and ma-chine. We employ a character-level languagemodel to generate text based on a large corpusof Dutch novels that exposes a number of tun-able parameters to the user. The system is usedthrough a custom graphical user interface, thathelps the writer to elicit, modify and incor-porate suggestions by the text generation sys-tem. Besides a literary work, the output of thepresent project also includes user-generatedmeta-data that is expected to contribute to thequantitative evaluation of the text-generationsystem and the co-creative process involved.

1 Introduction

In this paper we present ongoing work towards de-veloping a text editing application, through which an

established author of Dutch-language literary fictionwill use an AI-based text generation system with thegoal of producing a valuable piece of literature. Ouraim is to create a stimulating environment that fos-ters co-creation: ideally, the machine should outputvaluable suggestions, to which the author retains asignificant stake within the creative process.

The present project is part of a large-scale ini-tiative by the CPNB (‘Stichting Collectieve Propa-ganda van het Nederlandse Boek’, ‘Collective Pro-motion for the Dutch Book’). In Fall 2017, CPNBwill launch their annual media campaign, which thisyear focuses on robotics. To this end, CPNB willdistribute a re-edition of the Dutch translation ofIsaac Asimov’s I, Robot that is planned to include anadditional piece written as part of a human-machinecollaboration.

This project report is structured as follows. Wefirst introduce the CPNB in greater detail, with spe-cial emphasis on their annual media campaign. Wego on to introduce this year’s ‘Robotics’ theme, andthe way it centers around Asimov’s I, Robot. Then,we concisely survey the state of the art in text gener-ation from the point of view of co-creation betweenhuman and machine. Next, we describe our current

text generation system, starting with the large bodyof Dutch-language fiction (4000+ novels) that is atthe basis of our experiments, as well as its prepro-cessing. We describe our choice of architecture forNatural Language Generation (NLG)—a character-level Language Model (LM) based on RecurrentNeural Networks (RNN) with attractive propertiesfor the present task — and discuss how author andgenre-specific voices can be implemented throughfine-tuning of pre-trained LMs. We present ampleexamples to illustrate the model’s output for varioussettings. We also discuss possible ways to evalu-ate our system empirically—a common bottleneckof text generation systems—, through the monitor-ing of user’s behavior and selectional preferences.Finally, we discuss the design of the interface of ourapplication, emphasizing various ways in which theauthor will be able to interact with the software.

1.1 Trust for the Collective Promotion of theDutch-language Book

The CPNB1 is a trust and PR agency based in TheNetherlands that aims to promote the visibility ofbooks and the publishing sector in Dutch societyat large. The agency is responsible for a num-ber of high-visibility annual initiatives, such as the‘Boekenbal’ (‘Book ball’) and Boekenweek (‘Bookweek’).

These initiatives often center around specificthemes. For the 2017 campaign Nederlandleest (‘The Netherlands reads’), the CPNB chose‘robotics’ as the overarching theme for their cam-paign. Thereby further exacerbating the debate asthe societal opportunities and challenges that comewith the increase of artificial intelligence in every-day life, as well as literature. The campaign, for in-stance, includes the distribution of promotional ma-terial for children (see Figure 1), as well as copiesof I, Robot (1950)—the well-known science fictionnovel by Isaac Asimov—in its Dutch translation Ik,Robot (1966) by Leo Zelders, which serves as thefocal point of the 2017 campaign. The novel iscomposed of interrelated short stories, prepublishedin the journal Astounding Science Fiction between1940 and 1950. They revolve around the fictional

1Stichting Collectieve Propaganda van het NederlandseBoek: https://www.cpnb.nl.

character of robot-psychologist Dr. Susan Calvin.The novel is especially famous because of the ‘ThreeLaws of Robotics’ which feature as an intriguingethical backdrop.

The CPNB wanted to encourage debate about therole of AI and robotics in literature through the ad-dition of a 10th short story co-created by an es-tablished fiction writer and a machine. An award-winning Dutch author, Ronald Giphart, agreed totake part in this experiment.

Figure 1: Make-it-yourself cardboard robot. Promotional ma-

terial distributed as part of the 2017 ‘Nederland leest!’ cam-

paign by the CPNB on robotics and books.

1.2 Co-creativity

Co-creativity is a collaborative process betweenmultiple agents, where in this context, one agent isa computational system. Davis sees co-creativityas the ’blending’ of improvisational forces (Davis,2013). This goes against the pragmatic distribu-tional of labor that we might see in creative support

systems, or how computers are treated in everydaylife, and invites them into an indistinct and overlap-ping process of creativitiy. Where crucially, the re-sult of the output is greater than ’the sum of its parts’(Davis, 2013).

Interestingly, as pointed out by existing literature(Lubart, 2005; Davis, 2013; Jordanous, 2017), thepublic are suspicious of systems that purport to beautonomous whilst in fact involve human partici-pation. Whilst for Lubart the opposite is the case.Lubart reorientates the scientific perception of thesesystems into one aligned with Human Computer In-teraction, where they are examples of successful fa-cilitators of improvisation (Lubart, 2005). Lubartclarifies co-creativity into four distinct roles for acomputational system; ’Computer as nanny’, ‘Com-puter as penpal’, ‘Computer as coach’, ‘Computeras colleague’ (Lubart, 2005, p. 366). In this projectwe are most interested in achieving the last, thoughin practice, much of what our system does could beconsidered under the second. For a more thoroughoverview of co-creativity and its role within com-putational creativity research, see the proceedingof The International Conference on ComputationalCreativity 2012 (Maher, 2012), and for a broader ofview of the term in relation to computing, look to thework of Edmonds and Candy (Edmonds et al., 2005;Candy and Edmonds, 2002).

Developing NLG systems within a co-creative en-vironment allows researchers to utilize the humanagent within the system’s workflow, allowing for ap-proaches that are potentially too experimental fora solely computational approach. Furthermore, co-creation adds a collaborative and challenging dimen-sion to the process of writing, which in turn encour-ages the human writer. That said, though collabora-tion is commonplace in writing, it is not always wel-come. The creative process of writing is associatedwith a fluidity that can easily be hindered or broken;Umberto Eco’s renowned ’How to write a thesis’ as-serts that writers should nurture their process (Eco,2015). In developing this system alongside novelistRonald Giphart, we sought to apply our work withinhis established methodology in a way that enrichesboth parties.

From a technical point of view, there is a possi-bility to limit the collaborative NLG system to anassistive role, solely aiding the writer. However, a

valid collaboration should provoke and challenge thewriter. It should test them, push them, and ask themto reconsider their approach. To achieve this balancewe chose to treat the writer as a competent handler oftext, completely capable of dealing with generatedlanguage, and unlikely to be overwhelmed. This ap-proach certainly would not work for all applications,but seems appropriate to a professional science fic-tion writer.

As Natural Language Generation develops into auseful instrument in the creation of fictional prose,inherent questions arise around how computationalsystems relate to human writers. Nowhere else arethese questions more at home than in science fictionliterature, where readers and writers are eager to ex-plore the speculative limits of technology. This will-ingness allowed us to consider the practical implica-tions and qualities of co-creative writing, and howthey manifest within the interface itself (see Section4).

2 Related Work

Natural Language Generation within a collaborativewriting environment is an active area of research.The co-creative setting gives scope to apply exper-imental approaches within the dynamic context ofa working process. Here we will outline two es-tablished approaches: the structural diagramatic ap-proach, and the auto-completion approach. Ahn,Morbini and Gordon use causal graphs to map thenarrative steps of a story which the writer can ma-nipulate into the eventual story structure, the sys-tem will then use probabilistic modeling to generatelanguage around that skeleton. This approach givesthe system access to the abstract narrative core of astory’s structure; arguably, in doing so the systemimposes upon the writer a far more structured ap-proach than they are likely familiar with. A collab-orative system should be able to fit within a writ-ers existing working process (Ahn et al., 2016).Roemelle and Gordon offer a more hands on ap-proach to assistive writing. Their system acts asa ’Narrative Auto-Completion’, where the writer isprompted with possible sentences (Roemmele andGordon, 2015). Though straightforward, this ap-proach is highly intuitive and unobtrusive; however,the system risks fulfilling the role of tool rather than

collaborator. As such, Creative Help is a retrieval-based system, as appose to the generative approachpresented below.

Narrative generation has been a central topic ofcomputational creativity for decades. One of the firstexamples is Tale-Spin, a system that generates Ae-sop’s Fables guided by a user’s keyword suggestions(Meehan, 1977). More recently and nearer to thisproject, McIntyre and Lapata developed a proba-bilistic narrative generator that uses user-input to re-trieve related phrases (McIntyre and Lapata, 2009).The system here differentiates itself from those byworking on the character level. Generated text re-produces the style and voice of its training material,but does not directly sample quotes verbatim fromthe training material.

3 Method

3.1 Collection and Preprocessing

The first step in constructing our NLG system was tocompile a sufficiently large corpus of literary works.In the present study, we employ a large collectionof Dutch novels in epub format (Williams, 2011),which contains a diverse set of novels in terms ofgenre, and is heterogeneous in style. In total, the col-lection consists of 4,392 novels, written by approxi-mately 1,600 different authors. The average numberof novels written by each author is 2.5. The largestandard deviation of 6.5 is caused by the skeweddistribution in which a few authors contribute rel-atively large oeuvres, such as detective writer Ap-pie Baantjer. The novels were tokenized for words,sentences and paragraphs using the Tokenizer Ucto,which was configured for the Dutch language (VanGompel et al., 2012). The total number of sen-tences, words and characters in the tokenized col-lection (including punctuation) amounts to approxi-mately 24.6M, 425.5M, and 2.1G, respectively. Onaverage, each novel consists of 3k sentences, 59kwords, and 309,531k characters.

3.2 Character-level Language Models for NLG

The aim of this project is to contribute to literarywriting in a co-creative environment, as opposed tosolely narrative generation. Therefore, we approachNLG using character-based Language Models (LM)which typically reason at a local level, in the order

of some few hundreds of characters. Because of this,the LM is only implicitly aware of the global narra-tive structure, but still powerful enough to capturesentence semantics in a unsupervised fashion.

An LM is a probabilistic model of linguistic se-quences that estimate a probability distribution overa given vocabulary conditioned on the previous text(left-to-right model). More formally, at a given stept, an LM defines a conditional probability, express-ing the likelihood that a certain vocabulary item(typically a word or character) will appear next:

LM(wt) = P (wt|w1, w2, ..., wt−2, wt−1) (1)

Different LM implementations exist, which divergein the manner in which they model the previous text.Given their probabilistic nature, LMs are straight-forward to deploy for NLG. The generative processis defined by sampling a character from the outputdistribution at step t, which is then recursively fedback into the model, potentially preceded by the pre-vious output of the model, to condition the next gen-eration at step t + 1. A few decoding approachescan be implemented based on different samplingstrategies. For instance, a rather naive approach to-wards sampling is to select each character so as tomaximize a generated sequence’s overall probabil-ity. Nevertheless, for a large vocabulary size (e.g. inthe case of a word-level model), the search soon be-comes infeasible; therefore, approximate decodingmethods, such as beam search, are used to find anideal solution. When used for generation, the naiveargmax decoding strategy has a tendency towardsrelatively repetitive sentences, that are too uninspir-ing to be of much use in a creative setting. For thepresent work, we therefore decode new charactersvia sampling from the multinomial distribution ateach step.

It is interesting to note that the different decodingapproaches stand in a trade-off relationship betweendiversity and correctness. For example, whereasargmax decoding will tend to generate sentencesthat are very similar, general and monotonous yetformally correct (e.g. more similar to the sen-tences observed in the training corpus), multinomialsampling will make the output diverge more fromthe original training data, and therefore produce amore varied output, with a tendency towards for-mally incorrect sentences. Focusing on our chosen

approach—multinomial sampling—, the describedtrade-off can be operationalized and used to our ad-vantage by letting the author explore model param-eters. This is implemented by exposing a parame-ter τ , commonly referred to as “temperature”, thatcontrols the skewness of the model’s output distri-bution. Given the output distribution at a given stepp = (p1, p2, ..., pV ), a vocabulary size of V , and thetemperature value τ , we can compute a transforma-tion of pτ of the original p through Equation 2

pτi =p

1τi∑Vj p

1τj

(2)

pτ will flatten the original distribution for higher val-ues of τ— thereby ensuring more variability in theoutput. Conversely, for lower values of τ it will skewthe distribution—thereby facilitating the outcome ofthe originally more probable symbol. For τ valuesapproaching zero, we recover the simple argmax de-coding procedure of picking the highest probabilitysymbol at each step, whereas for high enough τ theLM degenerates into a random process in which atany given step all symbols are equally probable re-gardless of the history.

In terms of implementations there are two ma-jor approaches to statistical language modeling—ngram-based LMs and RNN-based LMs.

3.2.1 Ngram Language ModelsNgram-based LMs (NGLMs) go back to at least

the early 1980s in the context of Statistical Ma-chine Translation and Speech Recognition (Rosen-feld, 2000). An NGLM is a direct application ofthe Markov assumption to the task of estimating thenext character probability distribution—e.g. it uses afixed-length ngram prefix to estimate the next char-acter probability distribution. An NGLM is basi-cally a conditional probability table for Equation 1, that is estimated on the basis of the count datafor ngrams of a given length n. Typically, NGLMssuffer from a data sparsity problem, because withlarger values of n possible conditioning prefixes willnot be observed in the training data and the cor-responding probability distribution cannot be esti-mated. To alleviate the sparsity problem, techniquessuch as smoothing and back-off models (Chen andGoodman, 1999) can be used to either reserve some

probability mass to redistribute it across unobservedngrams (smoothing), or resort back to a lower-ordermodel to provide an approximation to the condi-tional distribution of an unobserved ngram (back-offmodels).

3.2.2 RNN-based Language ModelsMore recently, a new class of LMs based on Re-

current Neural Networks (Elman, 1990) have beenintroduced (Bengio et al., 2003; Mikolov, 2012) andhave quickly increased in popularity due to their bet-ter theoretical properties (no Markov assumption),expressive capabilities (information flow throughvery long sequences) and performance gains. AnRNNLM processes an input sequence one step t ata time, feeding the input symbol xt through threeaffine transformations with their corresponding non-linearities. First, the one-hot encoded input vector isprojected into an embedding space of dimensional-ity M through wt = Wmxt, where Wm ∈ RMxV

is a embedding matrix. Secondly, the resulting char-acter embedding wt is fed into an RNN layer thatcomputes a hidden activation ht as a combination ofwt with the hidden activation of the previous stepht−1. This is shown formally in Equation 3

ht = σ(W ihwi +W hhht−1 + bh) (3)

where W ih ∈ RMxH and W hh ∈ RHxH are re-spectively the input-to-hidden and hidden-to-hiddenprojection matrices, bh is a bias vector and σ is thesigmoid non-linear function. Finally, the hidden ac-tivation ht is projected into the vocabulary space ofsize V , followed by a softmax function that turnsthe output vector into a valid probability distribu-tion. Formally, the probability of character j at stept is defined by

Pt,j =eot,j∑Vk e

ot,k(4)

where ot,j is the jth entry in the output vector ot =W hoht and W ho ∈ RV xH is the hidden-to-outputprojection.

In practice, training an RNN is difficult due to thevanishing gradient problem (Hochreiter, 1998) thatmakes it hard to apply the back-propagation learn-ing algorithm (Rumelhart et al., 1986) for parame-ter learning over very long sequences. Therefore,

it is common to implement the recurrent layer us-ing an enhanced RNN version to compute ht—suchas Long Short-term Memory (LSTM) (Hochreiterand Schmidhuber, 1997) or Gated-Recurrent Unit(GRU) (Cho et al., 2014)—, which add an explicitgated mechanism to the traditional RNN in order tocontrol the preservation of information in the hiddenstate over very long sequences.

3.2.3 ModelFor the present study, we implement several vari-

ations of the RNNLM, varying the type of the re-current cell (LSTM, GRU) as well as the values ofdifferent parameters, such as the dimensionality ofthe character embedding matrix Wm (24, 46, . . . )and, more importantly, the size of the hidden layerH (1024, 2048, . . . ). We train our models throughback-propagation using Stochastic Gradient Descent(SGD), clipping the gradients before each batch up-date to a maximum norm value of 5 to avoid the ex-ploding gradient problem (Pascanu et al., 2013) andtruncating the gradient during back-propagation toa maximum of 200 recurrent steps to ensure suffi-ciently long dependencies in the sequence process-ing. Finally, dropout (Srivastava et al., 2014) is ap-plied after each recurrent layer following (Zarembaet al., 2015) to prevent overfitting during full modeltraining.

3.2.4 OverfittingAfter training the full model, we experiment with

further fine-tuning on different author and genre spe-cific subsets to steer the NLG towards a particularstyle. To enforce this effect, we drive the training to-wards overfitting introducing an intended bias in themodels predictions towards sequences that are morelikely in that particular book subset. We achieveoverfitting by zeroing the dropout rate and runningnumerous passes through the subset training data,with a sufficiently small learning rate. We have al-ready observed interesting stylistic and genre prop-erties in the fine-tuned model’s output—see Section4.2 for an illustration. That said, how the partic-ular effect of this technique—differing degrees ofoverfitting—affects the quality of the generated out-put still has to be evaluated. The degree of overfit-ting can easily be quantified and monitored by plot-ting batched-average perplexity values achieved by

the model for both the training data and the valida-tion split as shown in Figure 2.

Figure 2: Example of overfitting learning curves during fine-

tuning of a full model on a subset of novels by Isaac Asimov.

4 User Interface

Uncharacteristically for an NLP project, the visualinterface of the system is paramount to its success.Though ultimately the system will be assessed onthe language it produces, such language can onlybe generated if the writer is able to use the system.Therefore, we have focused on functionalities thatgive the user a clear representation of how text isgenerated, and allow them to understand how theirown writing is affected by the process. This allowsthem to play a defined role within the process ofwriting, whilst also encouraging them to use gener-ated text. The user is able to select which modelto generate text from, so that they can use multi-ple voices and approaches within the same text (seeSection 3.2.4). The user can define ’temperature’for any model using a slider bar (see Section 3.2).The generated text itself is shown as a list of sug-gestions, along with the model’s own probabilityscores, below the main text area. This allows thewriter to choose between a set of options, and geta broader idea of the models voice. With the helpof user edit meta-data—computed by a string diffingalgorithms—we can track user changes and presentthem with a visualization of each fragment’s sourceand the degree of the modification.

Figure 3 is a visual representation of how text an-notation functions. Text annotation reveals to thewriter how generated text is affecting the final text.As the writer works into text, they could easily lose

Figure 3: Visual feedback on the co-creation process used by

Ronald Giphart. Highlighted fragments are synthetic in origin

with the brightness indicating the amount of modification intro-

duced by the user.

track of its source; therefore, the interface is en-hanced with visual feedback which highlights basedon edit distance between original generated text andits current status. Generated text is initially high-lighted in green, as the writer edits that text its colorfades into white. At the same time, whole wordsswapped by the writer are underlined in purple todifferentiate lexical changes (see Figure 3).

4.1 Monitoring the AuthorOur project intends to explore the co-creative pro-cess of science-fiction literature on a quantitativeand objective basis. In line with (Roemmele andGordon, 2015), we acknowledge that such a co-creative interface opens the up possibility for auto-matic evaluation of generative systems based on useredits of generated strings. Our interface is there-fore designed to store all user edits along with thesource of the string (human or machine generated).This will enable us to study individual user behaviorin relation to the particular properties of the genera-tive system, as well as the aptness of different modelvariants and their parameter settings (e.g. degree of

overfit, temperature, voice) for co-creation, takinguser edit behavior as a proxy for output quality.

4.2 Examples

As explained in the previous section, the evaluationof a NLG system in a co-creative setting involvingboth human and machine amounts to the generatedmaterial incorporated (either explicitly or implicitly)by the author in the final work. Suggestions abouthow to formally and informally evaluate this co-production process were given in the previous sec-tion. Here, we provide an exploratory demonstrationof the model’s generation system, where the goal isto highlight some typical behavior of the system un-der different parameters settings, author-based fine-tuning, and text seeds.

We begin with a survey of how different temper-ature values τ impact the generated text. In Table1 we list a number of generated sentences for dif-ferent temperatures given the famous opening sen-tence “Mijn vrouw is dood en al begraven” (‘Mywife is dead and already buried’) from MarcellusEmants’ Een nagelaten bekentenis. These exampleshave been generated on the basis of a model trainedon the complete collection as described in Section3.1. Several observations can be made from theseexamples. First, the examples showcase the model’sability to generate grammatical sentences. Interest-ingly, the sentences become syntactically more com-plex as the temperature increases (as explained inSection 3.2), and also exhibit less repetitive gram-matical structures (cf. the repetition of possessivevan constructions with τ = 0.1). Second, a simi-lar diversifying effect can be observed on the lexicallevel. Higher temperatures are more likely to gen-erate less frequent and semantically more specificlexical material (cf. the decrease in reusing the sameword twice, as well as the increase of names in the fi-nal two examples). Third, depending on the reader’swillingness to search for semantic coherence, mostgenerated sentences can be read as plausible contin-uations of the input seed. Finally, with increasedtemperature values the model composes sentenceswith higher semantic and cognitive complexity, jux-taposing and combining less common concepts andthoughts. Within certain limits, we expect such in-creased ‘creativity’ to encourage and provoke thehuman writer in the process of writing.

τseed

Mijn vrouw is dood en al begraven. My wife is dead and already buried.

0.1 De volgende dag werd er een bericht van de politiegevonden. De verklaring van de politie was een vande belangrijkste verklaringen van de man die hetverhaal van de politie had gehoord.

The next day, a police message was found. Thepolice’s statement was one of the most importantstatements of the man who heard the story by thepolice.

0.3 De dokter keek op zijn horloge. Ze was er zekervan dat hij die dag niet meer zou komen. Hij wasniet gek. Hij was niet van plan om zijn dochter tevertellen dat hij haar een keer had gezien.

The doctor looked at his watch. She was sure hewouldn’t come anymore that day. He wasn’t crazy.He did not intend to tell his daughter he’d seen heronce.

0.5 De tranen weerklonken over haar gezicht en voeldeze in haar keel. Ze was er niet. Ze was er zekervan dat ze alleen maar een affaire had gehad metde meisjes.

The tears resounded over her face and she felt themin her throat. She wasn’t there. She was sure shehad only had an affair with the girls.

0.7 Ze zegt dat het weer in San Sebastian naar Cali-fornie haar deel van de wereld is. Ze moet van dewereld naar de hemel hebben gelopen.

She says that the weather in San Sebastian to Cal-ifornia is her part of the world. She must havewalked from the world to heaven.

0.9 Lucas kon niet met zichzelf trouwen en toen hijachteloos met zijn gebaren zat te praten maakte hijdeel uit van de lessen van de waarheid.

Lucas couldn’t marry himself, and when he spokepainlessly with his gestures, he was part of thelessons of truth.

Table 1: Example of our current NLG system with translation seeded by ”Mijn vrouw is dood en al begraven” (My wife is dead

and already buried) for different temperature τ values.

Having explored the impact of temperature on thefull model’s output, we now proceed with a brief il-lustration and informal evaluation of the generatedoutput of two fine-tuned models. As explained inSection 3.2.4, we experiment with constructing fine-tuned models for specific styles, genres or authors bypost-training on a subset of the collection and driv-ing the training towards overfitting. In this section,we demonstrate the effect of overfitting two modelspost-trained on novels by Isaac Asimov and RonaldGiphart, who form the heart of the CPNB’s roboticscampaign. Using the same seed from Table 1, weobserve a clear style shift when generating sentencesusing either the Asimov or Giphart model. For ex-ample, with a temperature setting of τ = 0.4 theAsimov model produces utterances such as: “Mijnvrouw is dood en al begraven. ‘Het is de groot-ste misdaad die ik ooit heb gezien.’ ‘Weet u datzeker?’ ‘Ja.’ ‘En als dat zo is, wat is dan wel dewaarheid?’ (My wife is dead and already buried. ‘Itis the biggest crime I’ve ever seen.’ ‘Are you sure?’‘Yes.’ ‘And if so, what is the truth?’). By contrast,a model overfitted on novels by Giphart generatesoutput such as: “Mijn vrouw is dood en al begraven.Ik heb het over een door mij gefotografeerde vrouw,een hoofd dat met haar borsten over mijn schouder

ligt. Ik heb de film geschreven die ik mijn leven langheb geleefd.” (‘My wife is dead and already buried.I’m talking about a woman I once photographed, ahead with her breasts over my shoulder. I wrote themovie I’ve lived my life for a long time.’) Both con-tinuations are semantically plausible, yet written incompletely different styles, and put focus on differ-ent concepts (e.g. ‘crime’ versus ‘erotics’), both typ-ical of their respective training material.

5 Conclusion

In this paper we have outlined an applied text gen-eration system and graphical user interface, that to-gether facilitate co-creative environment in which towrite science fiction literature. We have highlightedan existing challenge within state of the art systems,to balance a challenging intervention into the writingprocess, with the risk of becoming a solely a writingtool. The character-level recurrent neural networkfor NLG that we have used is experimental withina solely computational approach, and therefore wehave leveraged the specific advantages of workingwith a professional writer to maximize this system’sability to be applied. We have how to facilitate awriter to use a language model. We have outlined

evaluation procedures for the current NLG system,utilizing user-generated meta-data and quantifyingthe extent of retained synthetic text.

Acknowledgments

We would like to thank Ronald Giphart for his timeand energy and Stichting Collectieve Propagandavan het Nederlandse Boek for initiating the collabo-ration.

ReferencesEmily Ahn, Fabrizio Morbini, and Andrew S. Gordon.

2016. Improving Fluency in Narrative Text Gener-ation With Grammatical Transformations and Prob-abilistic Parsing. In The 9th International NaturalLanguage Generation conference, pages 70–74, Edin-burgh, UK. ACL.

Yoshua Bengio, Rejean Ducharme, Pascal Vincent, andChristian Janvin. 2003. A Neural Probabilistic Lan-guage Model. The Journal of Machine Learning Re-search, 3:1137–1155.

Linda Candy and Ernest Edmonds. 2002. Modeling co-creativity in art and technology. In Proceedings of the4th conference on Creativity & cognition, pages 134–141, Loughborough, UK. ACM.

Stanley F Chen and Joshua Goodman. 1999. An empir-ical study of smoothing techniques for language mod-eling. Computer Speech and Language, 13:359–394.

Kyunghyun Cho, Bart Van Merrienboer, Dzmitry Bah-danau, and Yoshua Bengio. 2014. On the Propertiesof Neural Machine Translation : Encoder – DecoderApproaches. Ssst-2014, pages 103–111.

Nicholas Davis. 2013. Human-computer co-creativity:Blending human and computational creativity. InNinth Artificial Intelligence and Interactive DigitalEntertainment Conference. AAAI Press.

Umberto Eco. 2015. How to write a thesis. MIT Press.Ernest A. Edmonds, Alastair Weakley, Linda Candy,

Mark Fell, Roger Knott, and Sandra Pauletto. 2005.The studio as laboratory: Combining creative practiceand digital technology research. International Journalof Human-Computer Studies, 63(4-5):452–481, Octo-ber.

Jeffrey L. Elman. 1990. Finding structure in time. Cog-nitive Science, 14(2):179–211.

Sepp Hochreiter and Jurgen Schmidhuber. 1997.Long Short-Term Memory. Neural Computation,9(8):1735–1780.

Sepp Hochreiter. 1998. The Vanishing GradientProblem During Learning Recurrent Neural Nets and

Problem Solutions. International Journal of Un-certainty, Fuzziness and Knowledge-Based Systems,06(02):107–116.

Anna Jordanous. 2017. Co-creativity and perceptions ofcomputational agents in co-creativity. In Proceedingsof the Eighth International Conference on Computa-tional Creativity, Atlanta, US. ACC.

Todd Lubart. 2005. How can computers be partners inthe creative process: Classification and commentaryon the Special Issue. International Journal of Human-Computer Studies, 63(4-5):365–369, October.

Mary Lou Maher. 2012. Computational and CollectiveCreativity: Who’s Being Creative? In Proceedingsof the 3rd International Conference on Computer Cre-ativity, pages 67–71, Dublin, Ireland. ACC.

Neil McIntyre and Mirella Lapata. 2009. Learning to telltales: A data-driven approach to story generation. InProceedings of the Joint Conference of the 47th An-nual Meeting of the ACL and the 4th InternationalJoint Conference on Natural Language Processing ofthe AFNLP: Volume 1-Volume 1, pages 217–225, Sin-gapore. Association for Computational Linguistics.

James R. Meehan. 1977. TALE-SPIN: An interactiveprogram that writes stories. In Proceedings of the 5thInternational Joint Conference on Ar tificial Intelli-gence, pages 91–98.

Tomas Mikolov. 2012. Statistical Language ModelsBased on Neural Networks.

Razvan Pascanu, Tomas Mikolov, and Yoshua Bengio.2013. On the Difficulties of Training Recurrent NeuralNetworks. Icml, (2):1–9.

Melissa Roemmele and Andrew S. Gordon. 2015. Cre-ative help: a story writing assistant. In InternationalConference on Interactive Digital Storytelling, pages81–92. Springer.

Ronald Rosenfeld. 2000. Two decades of statistical lan-guage modeling: where do we go from here? Pro-ceedings of the IEEE, 88(8):1270–1278.

David E. Rumelhart, Geoffrey E. Hinton, and Ronald J.Williams. 1986. Learning representations by back-propagating errors. Nature, 323(6088):533–536.

Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky,Ilya Sutskever, and Ruslan Salakhutdinov. 2014.Dropout: A Simple Way to Prevent Neural Networksfrom Overfitting. Journal of Machine Learning Re-search, 15:1929–1958.

Maarten Van Gompel, Ko Van Der Sloot, and Antal Vanden Bosch. 2012. Ucto: Unicode Tokeniser ReferenceGuide. Technical report.

Greg Williams. 2011. EPUB: Primer, Preview, and Prog-nostications. Collection Management, 36(3):182–191.

Wojciech Zaremba, Ilya Sutskever, and Oriol Vinyals.2015. Recurrent Neural Network Regularization.ICLR, pages 1–8.

Documents

Synthetic Literature. Writing Science Fiction in a Co ... · Synthetic Literature. Writing Science Fiction in a Co-Creative Process Enrique Manjavacas [1, 3] [email protected]