Dekker & Barok
Copying as a Way to Start Something New A Conversation with Dusan Barok about Monoskop
2017


COPYING AS A WAY TO START SOMETHING NEW
A Conversation with Dusan Barok about Monoskop

Annet Dekker

Dusan Barok is an artist, writer, and cultural activist involved
in critical practice in the fields of software, art, and theory. After founding and organizing the online culture portal
Koridor in Slovakia from 1999–2002, in 2003 he co-founded
the BURUNDI media lab where he organized the Translab
evening series. A year later, the first ideas about building an
online platform for texts and media started to emerge and
Monoskop became a reality. More than a decade later, Barok
is well-known as the main editor of Monoskop. In 2016, he
began a PhD research project at the University of Amsterdam. His project, titled Database for the Documentation of
Contemporary Art, investigates art databases as discursive
platforms that provide context for artworks. In an extended
email exchange, we discuss the possibilities and restraints
of an online ‘archive’.
ANNET DEKKER

You started Monoskop in 2004, already some time ago. What
does the name mean?
DUSAN BAROK

‘Monoskop’ is the Slovak equivalent of the English ‘monoscope’, which means an electric tube used in analogue TV
broadcasting to produce images of test cards, station logotypes, error messages but also for calibrating cameras. Monoscopes were automatized television announcers designed to
speak to both live and machine audiences about the status
of a channel, broadcasting purely phatic messages.
AD
Can you explain why you wanted to do the project and how it
developed to what it is now? In other words, what were your
main aims and have they changed? If so, in which direction
and what caused these changes?
DB

I began Monoskop as one of the strands of the BURUNDI
media lab in Bratislava. Originally, it was designed as a wiki
website for documenting media art and culture in the eastern part of Europe, whose backbone consisted of city entries
composed of links to separate pages about various events,

212

LOST AND LIVING (IN) ARCHIVES

initiatives, and individuals. In the early days it was modelled
on Wikipedia (which had been running for two years when
Monoskop started) and contained biographies and descriptions of events from a kind of neutral point of view. Over
the years, the geographic and thematic boundaries have
gradually expanded to embrace the arts and humanities in
their widest sense, focusing primarily on lesser-known
1
phenomena.1 Perhaps the biggest change is the ongoing
See for example
shift from mapping people, events, and places towards
https://monoskop.org/
Features. Accessed
synthesizing discourses.
28 May 2016.
A turning point occurred during my studies at the
Piet Zwart Institute, in the Networked Media programme
from 2010–2012, which combined art, design, software,
and theory with support in the philosophy of open source
and prototyping. While there, I was researching aspects of
the networked condition and how it transforms knowledge,
sociality and economics: I wrote research papers on leaking
as a technique of knowledge production, a critique of the
social graph, and on the libertarian values embedded in the
design of digital currencies. I was ready for more practice.
When Aymeric Mansoux, one of the tutors, encouraged me
to develop my then side-project Monoskop into a graduation
work, the timing was good.
The website got its own domain, a redesign, and most
crucially, the Monoskop wiki was restructured from its
2
focus on media art and culture towards the much wider
https://monoskop.org/
embrace
of the arts and humanities. It turned to a media
Symposium. Accessed
28 May 2016.
library of sorts. The graduation work also consisted of
a symposium about personal collecting and media ar3
chiving,2 which saw its loose follow-ups on media aeshttps://monoskop.org/
thetics (in Bergen)3 and on knowledge classification and
The_Extensions_of_
Many. Accessed
archives (in Mons)4 last year.
28 May 2016.

AD

https://monoskop.org/
Ideographies_of_
Knowledge. Accessed
28 May 2016.

Did you have a background in library studies, or have
you taken their ideas/methods of systemization and categorization (meta data)? If not, what are your methods
and how did you develop them?

213

COPYING AS A WAY TO START SOMETHING NEW

4

been an interesting process, clearly showing the influence
of a changing back-end system. Are you interested in the
idea of sharing and circulating texts as a new way not just
of accessing and distributing but perhaps also of production—and publishing? I’m thinking how Aaaaarg started as
a way to share and exchange ideas about a text. In what
way do you think Monoskop plays (or could play) with these
kinds of mechanisms? Do you think it brings out a new
potential in publishing?

DB

Besides the standard literature in information science (I
have a degree in information technologies), I read some
works of documentation scientists Paul Otlet and Suzanne
Briet, historians such as W. Boyd Rayward and Ronald E.
Day, as well as translated writings of Michel Pêcheux and
other French discourse analysts of the 1960s and 1970s.
This interest was triggered in late 2014 by the confluence
of Femke’s Mondotheque project and an invitation to be an
artist-in-residence in Mons in Belgium at the Mundaneum,
home to Paul Otlet’s recently restored archive.
This led me to identify three tropes of organizing and
navigating written records, which has guided my thinking
about libraries and research ever since: class, reference,
and index. Classification entails tree-like structuring, such
as faceting the meanings of words and expressions, and
developing classification systems for libraries. Referencing
stands for citations, hyperlinking and bibliographies. Indexing ranges from the listing of occurrences of selected terms
to an ‘absolute’ index of all terms, enabling full-text search.
With this in mind, I have done a number of experiments.
There is an index of selected persons and terms from
5
across the Monoskop wiki and Log.5 There is a growing
https://monoskop.org/
list of wiki entries with bibliographies and institutional
Index. Accessed
28 May 2016.
infrastructures of fields and theories in the humanities.6
There is a lexicon aggregating entries from some ten
6
dictionaries of the humanities into a single page with
https://monoskop.org/
hyperlinks to each full entry (unpublished). There is an
Humanities. Accessed
28 May 2016.
alternative interface to the Monoskop Log, in which entries are navigated solely through a tag cloud acting as
a multidimensional filter (unpublished). There is a reader
containing some fifty books whose mutual references are
turned into hyperlinks, and whose main interface consists
of terms specific to each text, generated through tf-idf algorithm (unpublished). And so on.

DB

The publishing market frames the publication as a singular
body of work, autonomous from other titles on offer, and
subjects it to the rules of the market—with a price tag and
copyright notice attached. But for scholars and artists, these
are rarely an issue. Most academic work is subsidized from
public sources in the first place, and many would prefer to
give their work away for free since openness attracts more
citations. Why they opt to submit to the market is for quality
editing and an increase of their own symbolic value in direct
proportion to the ranking of their publishing house. This
is not dissimilar from the music industry. And indeed, for
many the goal is to compose chants that would gain popularity across academia and get their place in the popular
imagination.
On the other hand, besides providing access, digital
libraries are also fit to provide context by treating publications as a corpus of texts that can be accessed through an
unlimited number of interfaces designed with an understanding of the functionality of databases and an openness
to the imagination of the community of users. This can
be done by creating layers of classification, interlinking
bodies of texts through references, creating alternative
indexes of persons, things and terms, making full-text
search possible, making visual search possible—across
the whole of corpus as well as its parts, and so on. Isn’t
this what makes a difference? To be sure, websites such
as Aaaaarg and Monoskop have explored only the tip of

AD

Indeed, looking at the archive in many alternative ways has

214

LOST AND LIVING (IN) ARCHIVES

215

COPYING AS A WAY TO START SOMETHING NEW

the iceberg of possibilities. There is much more to tinker
and hack around.

within a given text and within a discourse in which it is
embedded. What is specific to digital text, however, is that
we can search it in milliseconds. Full-text search is enabled
by the index—search engines operate thanks to bots that
assign each expression a unique address and store it in a
database. In this respect, the index usually found at the
end of a printed book is something that has been automated
with the arrival of machine search.
In other words, even though knowledge in the age of the
internet is still being shaped by the departmentalization of
academia and its related procedures and rituals of discourse
production, and its modes of expression are centred around
the verbal rhetoric, the flattening effects of the index really
transformed the ways in which we come to ‘know’ things.
To ‘write’ a ‘book’ in this context is to produce a searchable
database instead.

AD

It is interesting that whilst the accessibility and search potential has radically changed, the content, a book or any other
text, is still a particular kind of thing with its own characteristics and forms. Whereas the process of writing texts seems
hard to change, would you be interested in creating more
alliances between texts to bring out new bibliographies? In
this sense, starting to produce new texts, by including other
texts and documents, like emails, visuals, audio, CD-ROMs,
or even un-published texts or manuscripts?
DB

Currently Monoskop is compiling more and more ‘source’
bibliographies, containing digital versions of actual texts
they refer to. This has been very much in focus in the past
two or three years and Monoskop is now home to hundreds
of bibliographies of twentieth-century artists, writers, groups,
and movements as well as of various theories and human7
ities disciplines.7 As the next step I would like to move
See for example
on to enabling full-text search within each such biblioghttps://monoskop.
org/Foucault,
raphy. This will make more apparent that the ‘source’
https://monoskop.
bibliography
is a form of anthology, a corpus of texts
org/Lissitzky,
https://monoskop.
representing a discourse. Another issue is to activate
org/Humanities.
cross-references
within texts—to turn page numbers in
All accessed
28 May 2016.
bibliographic citations inside texts into hyperlinks leading
to other texts.
This is to experiment further with the specificity of digital text. Which is different both to oral speech and printed
books. These can be described as three distinct yet mutually
encapsulated domains. Orality emphasizes the sequence
and narrative of an argument, in which words themselves
are imagined as constituting meaning. Specific to writing,
on the other hand, is referring to the written record; texts
are brought together by way of references, which in turn
create context, also called discourse. Statements are ‘fixed’
to paper and meaning is constituted by their contexts—both

216

LOST AND LIVING (IN) ARCHIVES

AD

So, perhaps we finally have come to ‘the death of the author’,
at least in so far as that automated mechanisms are becoming active agents in the (re)creation process. To return to
Monoskop in its current form, what choices do you make
regarding the content of the repositories, are there things
you don’t want to collect, or wish you could but have not
been able to?
DB

In a sense, I turned to a wiki and started Monoskop as
a way to keep track of my reading and browsing. It is a
by-product of a succession of my interests, obsessions, and
digressions. That it is publicly accessible is a consequence
of the fact that paper notebooks, text files kept offline and
private wikis proved to be inadequate at the moment when I
needed to quickly find notes from reading some text earlier.
It is not perfect, but it solved the issue of immediate access
and retrieval. Plus there is a bonus of having the body of
my past ten or twelve years of reading mutually interlinked
and searchable. An interesting outcome is that these ‘notes’
are public—one is motivated to formulate and frame them

217

COPYING AS A WAY TO START SOMETHING NEW

as to be readable and useful for others as well. A similar
difference is between writing an entry in a personal diary
and writing a blog post. That is also why the autonomy
of technical infrastructure is so important here. Posting
research notes on Facebook may increase one’s visibility
among peers, but the ‘terms of service’ say explicitly that
anything can be deleted by administrators at any time,
without any reason. I ‘collect’ things that I wish to be able
to return to, to remember, or to recollect easily.
AD

Can you describe the process, how do you get the books,
already digitized, or do you do a lot yourself? In other words,
could you describe the (technical) process and organizational aspects of the project?
DB

In the beginning, I spent a lot of time exploring other digital
libraries which served as sources for most of the entries on
Log (Gigapedia, Libgen, Aaaaarg, Bibliotik, Scribd, Issuu,
Karagarga, Google filetype:pdf). Later I started corresponding with a number of people from around the world (NYC,
Rotterdam, Buenos Aires, Boulder, Berlin, Ploiesti, etc.) who
contribute scans and links to scans on an irregular basis.
Out-of-print and open-access titles often come directly from
authors and publishers. Many artists’ books and magazines
were scraped or downloaded through URL manipulation
from online collections of museums, archives and libraries.
Needless to say, my offline archive is much bigger than
what is on Monoskop. I tend to put online the files I prefer
not to lose. The web is the best backup solution I have
found so far.
The Monoskop wiki is open for everyone to edit; any user
can upload their own works or scans and many do. Many of
those who spent more time working on the website ended up
being my friends. And many of my friends ended up having
an account as well :). For everyone else, there is no record
kept about what one downloaded, what one read and for
how long... we don’t care, we don’t track.

218

LOST AND LIVING (IN) ARCHIVES

AD

In what way has the larger (free) publishing context changed
your project, there are currently several free texts sharing
initiatives around (some already before you started like Textz.
com or Aaaaarg), how do you collaborate, or distinguish
from each other?
DB

It should not be an overstatement to say that while in the
previous decade Monoskop was shaped primarily by the
‘media culture’ milieu which it intended to document, the
branching out of its repository of highlighted publications
Monoskop Log in 2009, and the broadening of its focus to
also include the whole of the twentieth and twenty-first
century situates it more firmly in the context of online
archives, and especially digital libraries.
I only got to know others in this milieu later. I approached
Sean Dockray in 2010, Marcell Mars approached me the
following year, and then in 2013 he introduced me to Kenneth Goldsmith. We are in steady contact, especially through
public events hosted by various cultural centres and galleries.
The first large one was held at Ljubljana’s hackerspace Kiberpipa in 2012. Later came the conferences and workshops
organized by Kuda at a youth centre in Novi Sad (2013), by
the Institute of Network Cultures at WORM, Rotterdam (2014),
WKV and Akademie Schloss Solitude in Stuttgart (2014),
Mama & Nova Gallery in Zagreb (2015), ECC at Mundaneum,
Mons (2015), and most recently by the Media Department
8
of the University of Malmo (2016).8
For more information see,
The leitmotif of all these events was the digital library
https://monoskop.org/
Digital_libraries#
and their atmosphere can be described as the spirit of
Workshops_and_
early
hacker culture that eventually left the walls of a
conferences.
Accessed 28 May 2016.
computer lab. Only rarely there have been professional
librarians, archivists, and publishers among the speakers, even though the voices represented were quite diverse.
To name just the more frequent participants... Marcell
and Tom Medak (Memory of the World) advocate universal
access to knowledge informed by the positions of the Yugoslav

219

COPYING AS A WAY TO START SOMETHING NEW

Marxist school Praxis; Sean’s work is critical of the militarization and commercialization of the university (in the
context of which Aaaaarg will always come as secondary, as
an extension of The Public School in Los Angeles); Kenneth
aims to revive the literary avant-garde while standing on the
shoulders of his heroes documented on UbuWeb; Sebastian
Lütgert and Jan Berger are the most serious software developers among us, while their projects such as Textz.com and
Pad.ma should be read against critical theory and Situationist cinema; Femke Snelting has initiated the collaborative
research-publication Mondotheque about the legacy of the
early twentieth century Brussels-born information scientist
Paul Otlet, triggered by the attempt of Google to rebrand him
as the father of the internet.
I have been trying to identify implications of the digital-networked textuality for knowledge production, including humanities research, while speaking from the position
of a cultural worker who spent his formative years in the
former Eastern Bloc, experiencing freedom as that of unprecedented access to information via the internet following
the fall of Berlin Wall. In this respect, Monoskop is a way
to bring into ‘archival consciousness’ what the East had
missed out during the Cold War. And also more generally,
what the non-West had missed out in the polarized world,
and vice versa, what was invisible in the formal Western
cultural canons.
There have been several attempts to develop new projects,
and the collaborative efforts have materialized in shared
infrastructure and introductions of new features in respective platforms, such as PDF reader and full-text search on
Aaaaarg. Marcell and Tom along with their collaborators have
been steadily developing the Memory of the World library and
Sebastian resuscitated Textz.com. Besides that, there are
overlaps in titles hosted in each library, and Monoskop bibliographies extensively link to scans on Libgen and Aaaaarg,
while artists’ profiles on the website link to audio and video
recordings on UbuWeb.

220

LOST AND LIVING (IN) ARCHIVES

AD

It is interesting to hear that there weren’t any archivist or
professional librarians involved (yet), what is your position
towards these professional and institutional entities and
persons?
DB

As the recent example of Sci-Hub showed, in the age of
digital networks, for many researchers libraries are primarily free proxies to corporate repositories of academic
9
journals.9 Their other emerging role is that of a digital
For more information see,
repository of works in the public domain (the role piowww.sciencemag.org/
news/2016/04/whosneered in the United States by Project Gutenberg and
downloading-piratedInternet Archive). There have been too many attempts
papers-everyone.
Accessed 28 May 2016.
to transpose librarians’ techniques from the paperbound
world into the digital domain. Yet, as I said before, there
is much more to explore. Perhaps the most exciting inventive approaches can be found in the field of classics, for
example in the Perseus Digital Library & Catalog and the
Homer Multitext Project. Perseus combines digital editions
of ancient literary works with multiple lexical tools in a way
that even a non-professional can check and verify a disputable translation of a quote. Something that is hard to
imagine being possible in print.
AD

I think it is interesting to see how Monoskop and other
repositories like it have gained different constituencies
globally, for one you can see the kind of shift in the texts
being put up. From the start you tried to bring in a strong
‘eastern European voice’, nevertheless at the moment the
content of the repository reflects a very western perspective on critical theory, what are your future goals. And do
you think it would be possible to include other voices? For
example, have you ever considered the possibility of users
uploading and editing texts themselves?
DB

The site certainly started with the primary focus on east-central European media art and culture, which I considered

221

COPYING AS A WAY TO START SOMETHING NEW

myself to be part of in the early 2000s. I was naive enough
to attempt to make a book on the theme between 2008–2010.
During that period I came to notice the ambivalence of the
notion of medium in an art-historical and technological
sense (thanks to Florian Cramer). My understanding of
media art was that it is an art specific to its medium, very
much in Greenbergian terms, extended to the more recent
‘developments’, which were supposed to range from neo-geometrical painting through video art to net art.
At the same time, I implicitly understood art in the sense
of ‘expanded arts’, as employed by the Fluxus in the early
1960s—objects as well as events that go beyond the (academic) separation between the arts to include music, film,
poetry, dance, design, publishing, etc., which in turn made
me also consider such phenomena as experimental film,
electro-acoustic music and concrete poetry.
Add to it the geopolitically unstable notion of East-Central
Europe and the striking lack of research in this area and
all you end up with is a headache. It took me a while to
realize that there’s no point even attempting to write a coherent narrative of the history of media-specific expanded
arts of East-Central Europe of the past hundred years. I
ended up with a wiki page outlining the supposed mile10
stones along with a bibliography.10
https://monoskop.
For this strand, the wiki served as the main notebook,
org/CEE. Accessed
28 May 2016. And
leaving behind hundreds of wiki entries. The Log was
https://monoskop.
more or less a ‘log’ of my research path and the presence
org/Central_and_
Eastern_Europe_
of ‘western’ theory is to a certain extent a by-product of
Bibliography.
my search for a methodology and theoretical references.
Accessed 28 May 2016.
As an indirect outcome, a new wiki section was
launched recently. Instead of writing a history of mediaspecific ‘expanded arts’ in one corner of the world, it takes
a somewhat different approach. Not a sequential text, not
even an anthology, it is an online single-page annotated
index, a ‘meta-encyclopaedia’ of art movements and styles,
intended to offer an expansion of the art-historical canonical
prioritization of the western painterly-sculptural tradition

222

LOST AND LIVING (IN) ARCHIVES

11

https://monoskop.
org/Art. Accessed
28 May 2016.

to also include other artists and movements around the
world.11
AD

Can you say something about the longevity of the project?
You briefly mentioned before that the web was your best
backup solution. Yet, it is of course known that websites
and databases require a lot of maintenance, so what will
happen to the type of files that you offer? More and more
voices are saying that, for example, the PDF format is all
but stable. How do you deal with such challenges?
DB

Surely, in the realm of bits, nothing is designed to last
forever. Uncritical adoption of Flash had turned out to be
perhaps the worst tragedy so far. But while there certainly
were more sane alternatives if one was OK with renouncing its emblematic visual effects and aesthetics that went
with it, with PDF it is harder. There are EPUBs, but scholarly publications are simply unthinkable without page
numbers that are not supported in this format. Another
challenge the EPUB faces is from artists' books and other
design- and layout-conscious publications—its simplified
HTML format does not match the range of possibilities for
typography and layout one is used to from designing for
paper. Another open-source solution, PNG tarballs, is not
a viable alternative for sharing books.
The main schism between PDF and HTML is that one represents the domain of print (easily portable, and with fixed
page size), while the other the domain of web (embedded
within it by hyperlinks pointing both directions, and with
flexible page size). EPUB is developed with the intention of
synthetizing both of them into a single format, but instead
it reduces them into a third container, which is doomed to
reinvent the whole thing once again.
It is unlikely that there will appear an ultimate convertor
between PDF and HTML, simply because of the specificities
of print and the web and the fact that they overlap only in
some respects. Monoskop tends to provide HTML formats

223

COPYING AS A WAY TO START SOMETHING NEW

next to PDFs where time allows. And if the PDF were to
suddenly be doomed, there would be a big conversion party.
On the side of audio and video, most media files on
Monoskop are in open formats—OGG and WEBM. There
are many other challenges: keeping up-to-date with PHP
and MySQL development, with the MediaWiki software
and its numerous extensions, and the mysterious ICANN
organization that controls the web domain.

as an imperative to us to embrace redundancy, to promote
spreading their contents across as many nodes and sites
as anyone wishes. We may look at copying not as merely
mirroring or making backups, but opening up for possibilities to start new libraries, new platforms, new databases.
That is how these came about as well. Let there be Zzzzzrgs,
Ůbuwebs and Multiskops.

AD

What were your biggest challenges beside technical ones?
For example, have you ever been in trouble regarding copyright issues, or if not, how would you deal with such a
situation?
DB

Monoskop operates on the assumption of making transformative use of the collected material. The fact of bringing
it into certain new contexts, in which it can be accessed,
viewed and interpreted, adds something that bookstores
don’t provide. Time will show whether this can be understood as fair use. It is an opt-out model and it proves to
be working well so far. Takedowns are rare, and if they are
legitimate, we comply.
AD

Perhaps related to this question, what is your experience
with users engagement? I remember Sean (from Aaaaarg,
in conversation with Matthew Fuller, Mute 2011) saying
that some people mirror or download the whole site, not
so much in an attempt to ‘have everything’ but as a way
to make sure that the content remains accessible. It is a
conscious decision because one knows that one day everything might be taken down. This is of course particularly
pertinent, especially since while we’re doing this interview
Sean and Marcell are being sued by a Canadian publisher.
DB

That is absolutely true and any of these websites can disappear any time. Archives like Aaaaarg, Monoskop or UbuWeb
are created by makers rather than guardians and it comes

224

LOST AND LIVING (IN) ARCHIVES

225

COPYING AS A WAY TO START SOMETHING NEW

Bibliography
Fuller, Matthew. ‘In the Paradise of Too Many Books: An Interview with
Sean Dockray’. Mute, 4 May 2011. www.metamute.org/editorial/

articles/paradise-too-many-books-interview-seandockray. Accessed 31 May 2016.
Online digital libraries
Aaaaarg, http://aaaaarg.fail.
Bibliotik, https://bibliotik.me.
Issuu, https://issuu.com.
Karagarga, https://karagarga.in.
Library Genesis / LibGen, http://gen.lib.rus.ec.
Memory of the World, https://library.memoryoftheworld.org.
Monoskop, https://monoskop.org.
Pad.ma, https://pad.ma.
Scribd, https://scribd.com.
Textz.com, https://textz.com.
UbuWeb, www.ubu.com.

226

LOST AND LIVING (IN) ARCHIVES

227

COPYING AS A WAY TO START SOMETHING NEW

Barok
Poetics of Research
2014


_An unedited version of a talk given at the conference[Public
Library](http://www.wkv-stuttgart.de/en/program/2014/events/public-library/)
held at Württembergischer Kunstverein Stuttgart, 1 November 2014._

_Bracketed sequences are to be reformulated._

Poetics of Research

In this talk I'm going to attempt to identify [particular] cultural
algorithms, ie. processes in which cultural practises and software meet. With
them a sphere is implied in which algorithms gather to form bodies of
practices and in which cultures gather around algorithms. I'm going to
approach them through the perspective of my practice as a cultural worker,
editor and artist, considering practice in the same rank as theory and
poetics, and where theorization of practice can also lead to the
identification of poetical devices.

The primary motivation for this talk is an attempt to figure out where do we
stand as operators, users [and communities] gathering around infrastructures
containing a massive body of text (among other things) and what sort of things
might be considered to make a difference [or to keep making difference].

The talk mainly [considers] the role of text and the word in research, by way
of several figures.

A

A reference, list, scheme, table, index; those things that intervene in the
flow of narrative, illustrating the point, perhaps in a more economic way than
the linear text would do. Yet they don't function as pictures, they are
primarily texts, arranged in figures. Their forms have been
standardised[normalised] over centuries, withstood the transition to the
digital without any significant change, being completely intuitive to the
modern reader. Compared to the body of text they are secondary, run parallel
to it. Their function is however different to that of the punctuation. They
are there neither to shape the narrative nor to aid structuring the argument
into logical blocks. Nor is their function spatial, like in visual poems.
Their positions within a document are determined according to the sequential
order of the text, [standing as attachments] and are there to clarify the
nature of relations among elements of the subject-matter, or to establish
relations with other documents. The [premise] of my talk is that these
_textual figures_ also came to serve as the abstract[relational] models
determining possible relations among documents as such, and in consequence [to
structure conditions [of research]].

B

It can be said that research, as inquiry into a subject-matter, consists of
discrete queries. A query, such as a question about what something is, what
kinds, parts and properties does it have, and so on, can be consulted in
existing documents or generate new documents based on collection of data [in]
the field and through experiment, before proceeding to reasoning [arguments
and deductions]. Formulation of a query is determined by protocols providing
access to documents, which means that there is a difference between collecting
data outside the archive (the undocumented, ie. in the field and through
experiment), consulting with a person--an archivist (expert, librarian,
documentalist), and consulting with a database storing documents. The
phenomena such as [deepening] of specialization and throughout digitization
[have given] privilege to the database as [a|the] [fundamental] means for
research. Obviously, this is a very recent [phenomenon]. Queries were once
formulated in natural language; now, given the fact that databases are queried
[using] SQL language, their interfaces are mere extensions of it and
researchers pose their questions by manipulating dropdowns, checkboxes and
input boxes mashed together on a flat screen being ran by software that in
turn translates them into a long line of conditioned _SELECTs_ and _JOINs_
performed on tables of data.

Specialization, digitization and networking have changed the language of
questioning. Inquiry, once attached to the flesh and paper has been
[entrusted] to the digital and networked. Researchers are querying the black
box.

C

Searching in a collection of [amassed/assembled] [tangible] documents (ie.
bookshelf) is different from searching in a systematically structured
repository (library) and even more so from searching in a digital repository
(digital library). Not that they are mutually exclusive. One can devise
structures and algorithms to search through a printed text, or read books in a
library one by one. They are rather [models] [embodying] various [processes]
associated with the query. These properties of the query might be called [the
sequence], the structure and the index. If they are present in the ways of
querying documents, and we will return to this issue, are they persistent
within the inquiry as such? [wait]

D

This question itself is a rupture in the sequence. It makes a demand to depart
from one narrative [a continuous flow of words] to another, to figure out,
while remaining bound to it [it would be even more as a so-called rhetorical
question]. So there has been one sequence, or line, of the inquiry--about the
kinds of the query and its properties. That sequence itself is a digression,
from within the sequence about what is research and describing its parts
(queries). We are thus returning to it and continue with a question whether
the properties of the inquiry are the same as the properties of the query.

E

But isn't it true that every single utterance occurring in a sequence yields a
query as well? Let's consider the word _utterance_. [wait] It can produce a
number of associations, for example with how Foucault employs the notion of
_énoncé_ in his _Archaeology of Knowledge_ , giving hard time to his English
translators wondering whether _utterance_ or _statement_ is more appropriate,
or whether they are interchangeable, and what impact would each choice have on
his reception in the Anglophone world. Limiting ourselves to textual forms for
now (and not translating his work but pursing a different inquiry), let us say
the utterance is a word [or a phrase or an idiom] in a sequence such as a
sentence, a paragraph, or a document.

## (F) The
structure[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=1
"Edit section: \(F\) The structure")]

This distinction is as old as recorded Western thought since both Plato and
Aristotle differentiate between a word on its own ("the said", a thing said)
and words in the company of other words. For example, Aristotle's _Categories_
[lay] on the [notion] of words on their own, and they are made the subject-
matter of that inquiry. [For him], the ambiguity of connotation words
[produce] lies in their synonymity, understood differently from the moderns--
not as more words denoting a similar thing but rather one word denoting
various things. Categories were outlined as a device to differentiate among
words according to kinds of these things. Every word as such belonged to not
less and not more than one of ten categories.

So it happens to the word _utterance_ , as to any other word uttered in a
sequence, that it poses a question, a query about what share of the spectrum
of possibly denoted things might yield as the most appropriate in a given
context. The more context the more precise share comes to the fore. When taken
out of the context ambiguity prevails as the spectrum unveils in its variety.

Thus single words [as any other utterances] are questions, queries,
themselves, and by occuring in statements, in context, their [means] are being
singled out.

This process is _conditioned_ by what has been formalized as the techniques of
_regulating_ definitions of words.

### (G) The structure: words as
words[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=2
"Edit section: \(G\) The structure: words as words")]

* [![](/images/thumb/c/c8/Philitas_in_P.Oxy.XX_2260_i.jpg/144px-Philitas_in_P.Oxy.XX_2260_i.jpg)](/File:Philitas_in_P.Oxy.XX_2260_i.jpg)

P.Oxy.XX 2260 i: Oxyrhynchus papyrus XX, 2260, column i, with quotation from
Philitas, early 2nd c. CE. 1(http://163.1.169.40/cgi-
bin/library?e=q-000-00---0POxy--00-0-0--0prompt-10---4------0-1l--1-en-50---
20-about-2260--
00031-001-0-0utfZz-8-00&a=d&c=POxy&cl=search&d=HASH13af60895d5e9b50907367)
2(http://en.wikipedia.org/wiki/File:POxy.XX.2260.i-Philitas-
highlight.jpeg)

* [![](/images/thumb/9/9e/Cyclopaedia_1728_page_210_Dictionary_entry.jpg/88px-Cyclopaedia_1728_page_210_Dictionary_entry.jpg)](/File:Cyclopaedia_1728_page_210_Dictionary_entry.jpg)

Ephraim Chambers, _Cyclopaedia, or an Universal Dictionary of Arts and
Sciences_ , 1728, p. 210. 3(http://digicoll.library.wisc.edu/cgi-
bin/HistSciTech/HistSciTech-
idx?type=turn&entity=HistSciTech.Cyclopaedia01.p0576&id=HistSciTech.Cyclopaedia01&isize=L)

* [![](/images/thumb/b/b8/Detail_from_the_Liddell-Scott_Greek-English_Lexicon_c1843.jpg/160px-Detail_from_the_Liddell-Scott_Greek-English_Lexicon_c1843.jpg)](/File:Detail_from_the_Liddell-Scott_Greek-English_Lexicon_c1843.jpg)

Detail from the Liddell-Scott Greek-English Lexicon, c1843.

Dictionaries have had a long life. The ancient Greek scholar and poet Philitas
of Cos living in the 4th c. BCE wrote a vocabulary explaining the meanings of
rare Homeric and other literary words, words from local dialects, and
technical terms. The vocabulary, called _Disorderly Words_ (Átaktoi glôssai),
has been lost, with a few fragments quoted by later authors. One example is
that the word πέλλα (pélla) meant "wine cup" in the ancient Greek region of
Boeotia; contrasted to the same word meaning "milk pail" in Homer's _Iliad_.

Not much has changed in the way how dictionaries constitute order. Selected
archives of statements are queried to yield occurrences of particular words,
various _criteria[indicators]_ are applied to filtering and sorting them and
in turn the spectrum of [denoted] things allocated in this way is structured
into groups and subgroups which are then given, according to other set of
rules, shorter or longer names. These constitute facets of [potential]
meanings of a word.

So there are at least _four_ sets of conditions [structuring] dictionaries.
One is required to delimit an archive[corpus of texts], one to select and give
preference[weights] to occurrences of a word, another to cluster them, and yet
another to abstract[generalize] the subject-matter of each of these clusters.
Needless to say, this is a craft of a few and these criteria are rarely being
disclosed, despite their impact on research, and more generally, their
influence as conditions for production[making] of a so called _common sense_.

It doesn't take that much to reimagine what a dictionary is and what it could
be, especially having large specialized corpora of texts at hand. These can
also serve as aids in production of new words and new meanings.

### (H) The structure: words as knowledge and the
world[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=3
"Edit section: \(H\) The structure: words as knowledge and the world")]

* [![](/images/thumb/0/02/Boethius_Porphyrys_Isagoge.jpg/120px-Boethius_Porphyrys_Isagoge.jpg)](/File:Boethius_Porphyrys_Isagoge.jpg)

Boethius's rendering of a classification tree described in Porphyry's Isagoge
(3th c.), [6th c.] 10th c.
4(http://www.e-codices.unifr.ch/en/sbe/0315/53/medium)

* [![](/images/thumb/d/d0/Cyclopaedia_1728_page_ii_Division_of_Knowledge.jpg/94px-Cyclopaedia_1728_page_ii_Division_of_Knowledge.jpg)](/File:Cyclopaedia_1728_page_ii_Division_of_Knowledge.jpg)

Ephraim Chambers, _Cyclopaedia, or an Universal Dictionary of Arts and
Sciences_ , London, 1728, p. II. 5(http://digicoll.library.wisc.edu/cgi-
bin/HistSciTech/HistSciTech-
idx?type=turn&entity=HistSciTech.Cyclopaedia01.p0015&id=HistSciTech.Cyclopaedia01&isize=L)

* [![](/images/thumb/d/d6/Encyclopedie_1751_Systeme_figure_des_connaissances_humaines.jpg/116px-Encyclopedie_1751_Systeme_figure_des_connaissances_humaines.jpg)](/File:Encyclopedie_1751_Systeme_figure_des_connaissances_humaines.jpg)

Système figuré des connaissances humaines, _Encyclopédie ou Dictionnaire
raisonné des sciences, des arts et des métiers_ , 1751.
6(http://encyclopedie.uchicago.edu/content/syst%C3%A8me-figur%C3%A9-des-
connaissances-humaines)

* [![](/images/thumb/9/96/Haeckel_Ernst_1874_Stammbaum_des_Menschen.jpg/96px-Haeckel_Ernst_1874_Stammbaum_des_Menschen.jpg)](/File:Haeckel_Ernst_1874_Stammbaum_des_Menschen.jpg)

Haeckel - Darwin's tree.

Another _formalized_ and [internalized] process being at play when figuring
out a word is its [containment]. Word is not only structured by way of things
it potentially denotes but also by words it is potentially part of and those
it contains.

The fuzz around categorization of knowledge _and_ the world in the Western
thought can be traced back to Porphyry, if not further. In his introduction to
Aristotle's _Categories_ this 3rd century AD Neoplatonist began expanding the
notions of genus and species into their hypothetic consequences. Aristotle's
brief work outlines ten categories of 'things that are said' (legomena,
λεγόμενα), namely substance (or substantive, {not the same as matter!},
οὐσία), quantity (ποσόν), qualification (ποιόν), a relation (πρός), where
(ποῦ), when (πότε), being-in-a-position (κεῖσθαι), having (or state,
condition, ἔχειν), doing (ποιεῖν), and being-affected (πάσχειν). In his
different work, _Topics_ , Aristotle outlines four kinds of subjects/materials
indicated in propositions/problems from which arguments/deductions start.
These are a definition (όρος), a genus (γένος), a property (ἴδιος), and an
accident (συμβεβηϰόϛ). Porphyry does not explicitly refer _Topics_ , and says
he omits speaking "about genera and species, as to whether they subsist (in
the nature of things) or in mere conceptions only"
8(http://www.ccel.org/ccel/pearse/morefathers/files/porphyry_isagogue_02_translation.htm#C1),
which means he avoids explicating whether he talks about kinds of concepts or
kinds of things in the sensible world. However, the work sparked confusion, as
the following passage [suggests]:

> "[I]n each category there are certain things most generic, and again, others
most special, and between the most generic and the most special, others which
are alike called both genera and species, but the most generic is that above
which there cannot be another superior genus, and the most special that below
which there cannot be another inferior species. Between the most generic and
the most special, there are others which are alike both genera and species,
referred, nevertheless, to different things, but what is stated may become
clear in one category. Substance indeed, is itself genus, under this is body,
under body animated body, under which is animal, under animal rational animal,
under which is man, under man Socrates, Plato, and men particularly." (Owen
1853,
9(http://www.ccel.org/ccel/pearse/morefathers/files/porphyry_isagogue_02_translation.htm#C2))

Porphyry took one of Aristotle's ten categories of the word, substance, and
dissected it using one of his four rhetorical devices, genus. Employing
Aristotle's categories, genera and species as means for logical operations,
for dialectic, Porphyry's interpretation resulted in having more resemblance
to the perceived _structures_ of the world. So they began to bloom.

There were earlier examples, but Porphyry was the most influential in
injecting the _universalist_ version of classification [implying] the figure
of a tree into the [locus] of Aristotle's thought. Knowledge became
monotheistic.

Classification schemes [growing from one point] play a major role in
untangling the format of modern encyclopedia from that of the dictionary
governed by alphabet. Two of the most influential encyclopedias of the 18th
century are cases in the point. Although still keeping 'dictionary' in their
titles, they are conceived not to represent words but knowledge. The [upper-
most] genus of the body was set as the body of knowledge. The English
_Cyclopaedia, or an Universal Dictionary of Arts and Sciences_ (1728) splits
into two main branches: "natural and scientifical" and "artificial and
technical"; these further split down to 47 classes in total, each carrying a
structured list (on the following pages) of thematic articles, serving as
table of contents. The French _Encyclopedia: or a Systematic Dictionary of the
Sciences, Arts, and Crafts_ (1751) [unwinds] from judgement ( _entendement_ ),
branches into memory as history, reason as philosophy, and imagination as
poetry. The logic of containers was employed as an aid not only to deal with
the enormous task of naming and not omiting anything from what is known, but
also for the management of labour of hundreds of writers and researchers, to
create a mechanism for delegating work and the distribution of
responsibilities. Flesh was also more present, in the field research, with
researchers attending workshops and sites of everyday life to annotate it.

The world came forward to unshine the word in other schemes. Darwin's tree of
evolution and some of the modern document classification systems such as
Charles A. Cutter's _Expansive Classification_ (1882) set to classify the
world itself and set the field for what has came to be known as authority
lists structuring metadata in today's computing.

### The structure
(summary)[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=4
"Edit section: The structure \(summary\)")]

Facetization of meaning and branching of knowledge are both the domain of the
unit of utterance.

While lexicographers[dictionarists] structure thought through multi-layered
processes of abstraction of the written record, knowledge growers dissect it
into hierarchies of [mutually] contained notions.

One seek to describe the word as a faceted list of small worlds, another to
describe the world as a structured lists of words. One play prime in the
domain of epistemology, in what is known, controlling the vocabulary, another
in the domain of ontology, in what is, controlling reality.

Every [word] has its given things, every thing has its place, closer or
further from a single word.

The schism between classifying words and classifying the world implies it is
not possible to construct a universal classification scheme[system]. On top of
that, any classification system of words is bound to a corpus of texts it is
operating upon and any classification system of the world again operates with
words which are bound to a vocabulary[lexicon] which is again bound to a
corpus [of texts]. It doesn't mean it would prevent people from trying.
Classifications function as descriptors of and 'inscriptors' upon the world,
imprinting their authority. They operate from [a locus of] their
corpus[context]-specificity. The larger the corpus, the more power it has on
shaping the world, as far as the word shapes it (yes, I do imply Google here,
for which it is a domain to be potentially exploited).

## (J) The
sequence[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=5
"Edit section: \(J\) The sequence")]

The structure-yielding query [of] the single word [shrinks][zuzuje
sa,spresnuje] with preceding and following words. Inquiry proceeds in the flow
that establishes another kind[mode] of relationality, chaining words into the
sequence. While the structuring property of the query brings words apart from
each other, its sequential property establishes continuity and brings these
units into an ordered set.

This is what is responsible for attaching textual figures mentioned earlier
(lists, schemes, tables) to the body of the text. Associations can be also
stated explicitly, by indexing tables and then referring them from a
particular point in the text. The same goes for explicit associations made
between blocks of the text by means of indexed paragraphs, chapters or pages.

From this follows that all utterances point to the following utterance by the
nature of sequential order, and indexing provides means for pointing elsewhere
in the document as well.

A lot can be said about references to other texts. Here, to spare time, I
would refer you to a talk I gave a few months ago and which is online
10(http://monoskop.org/Talks/Communing_Texts).

This is still the realm of print. What happens with document when it is
digitized?

Digitization breaks a document into units of which each is assigned a numbered
position in the sequence of the document. From this perspective digitization
can be viewed as a total indexation of the document. It is converted into
units rendered for machine operations. This sequentiality is made explicit, by
means of an underlying index.

Sequences and chains are orders of one dimension. Their one-dimensional
ordering allows addressability of each element and [random] access. [Jumps]
between [random] addresses are still sequential, processing elements one at a
time.

## (K) The
index[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=6
"Edit section: \(K\) The index")]

* [![](/images/thumb/2/27/Summa_confessorum.1310.jpg/103px-Summa_confessorum.1310.jpg)](/File:Summa_confessorum.1310.jpg)

Summa confessorum [1297-98], 1310.
7(http://www.bl.uk/onlinegallery/onlineex/illmanus/roymanucoll/j/011roy000008g11u00002000.html)

[The] sequencing not only weaves words into statements but activates other
temporalities, and _presents occurrences of words from past statements_. As
now when I am saying the word _utterance_ , each time there surface contexts
in which I have used it earlier.

A long quote from Frederick G. Kilgour, _The Evolution of the Book_ , 1998, pp
76-77:

> "A century of invention of various types of indexes and reference tools
preceded the advent of the first subject index to a specific book, which
occurred in the last years of the thirteenth century. The first subject
indexes were "distinctions," collections of "various figurative or symbolic
meanings of a noun found in the scriptures" that "are the earliest of all
alphabetical tools aside from dictionaries." (Richard and Mary Rouse supply an
example: "Horse = Preacher. Job 39: 'Hast thou given the horse strength, or
encircled his neck with whinning?')

>

> [Concordance] By the end of the third decade of the thirteenth century Hugh
de Saint-Cher had produced the first word concordance. It was a simple word
index of the Bible, with every location of each word listed by [its position
in the Bible specified by book, chapter, and letter indicating part of the
chapter]. Hugh organized several dozen men, assigning to each man an initial
letter to search; for example, the man assigned M was to go through the entire
Bible, list each word beginning with M and give its location. As it was soon
perceived that this original reference work would be even more useful if words
were cited in context, a second concordance was produced, with each word in
lengthy context, but it proved to be unwieldy. [Soon] a third version was
produced, with words in contexts of four to seven words, the model for
biblical concordances ever since.

>

> [Subject index] The subject index, also an innovation of the thirteenth
century, evolved over the same period as did the concordance. Most of the
early topical indexes were designed for writing sermons; some were organized,
while others were apparently sequential without any arrangement. By midcentury
the entries were in alphabetical order, except for a few in some classified
arrangement. Until the end of the century these alphabetical reference works
indexed a small group of books. Finally John of Freiburg added an alphabetical
subject index to his own book, _Summa Confessorum_ (1297—1298). As the Rouses
have put it, 'By the end of the [13]th century the practical utility of the
subject index is taken for granted by the literate West, no longer solely as
an aid for preachers, but also in the disciplines of theology, philosophy, and
both kinds of law.'"

In one sense neither subject-index nor concordane are indexes, they are words
or group of words selected according to given criteria from the body of the
text, each accompanied with a list of identifiers. These identifiers are
elements of an index, whether they represent a page, chapter, column, or other
[kind of] block of text. Every identifier is an unique _address_.

The index is thus an ordering of a sequence by means of associating its
elements with a set of symbols, when each element is given unique combination
of symbols. Different sizes of sets yield different number of variations.
Symbol sets such as an alphabet, arabic numerals, roman numerals, and binary
digits have different proportions between the length of a string of symbols
and the number of possible variations it can contain. Thus two symbols of
English alphabet can store 26^2 various values, of arabic numerals 10^2, of
roman numberals 8^2 and of binary digits 2^2.

Indexation is segmentation, a breaking into segments. From as early as the
13th century the index such as that of sections has served as enabler of
search. The more [detailed] indexation the more precise search results it
enables.

The subject-index and concordance are tables of search results. There is a
direct lineage from the 13th-century biblical concordances and the birth of
computational linguistic analysis, they were both initiated and realised by
priests.

During the World War II, Jesuit Father Roberto Busa began to look for machines
for the automation of the linguistic analysis of the 11 million-word Latin
corpus of Thomas Aquinas and related authors.

Working on his Ph.D. thesis on the concept of _praesens_ in Aquinas he
realised two things:

> "I realized first that a philological and lexicographical inquiry into the
verbal system of an author has t o precede and prepare for a doctrinal
interpretation of his works. Each writer expresses his conceptual system in
and through his verbal system, with the consequence that the reader who
masters this verbal system, using his own conceptual system, has to get an
insight into the writer's conceptual system. The reader should not simply
attach t o the words he reads the significance they have in his mind, but
should try t o find out what significance they had in the writer's mind.
Second, I realized that all functional or grammatical words (which in my mind
are not 'empty' at all but philosophically rich) manifest the deepest logic of
being which generates the basic structures of human discourse. It is .this
basic logic that allows the transfer from what the words mean today t o what
they meant to the writer.

>

> In the works of every philosopher there are two philosophies: the one which
he consciously intends to express and the one he actually uses to express it.
The structure of each sentence implies in itself some philosophical
assumptions and truths. In this light, one can legitimately criticize a
philosopher only when these two philosophies are in contradiction."
11(http://www.alice.id.tue.nl/references/busa-1980.pdf)

Collaborating with the IBM in New York from 1949, the work, a concordance of
all the words of Thomas Aquinas, was finally published in the 1970s in 56
printed volumes (a version is online since 2005
12(http://www.corpusthomisticum.org/it/index.age)). Besides that, an
electronic lexicon for automatic lemmatization of Latin words was created by a
team of ten priests in the scope of two years (in two phases: grouping all the
forms of an inflected word under their lemma, and coding the morphological
categories of each form and lemma), containing 150,000 forms
13(http://www.alice.id.tue.nl/references/busa-1980.pdf#page=4). Father
Busa has been dubbed the father of humanities computing and recently also of
digital humanities.

The subject-index has a crucial role in the printed book. It is the only means
for search the book offers. Subjects composing an index can be selected
according to a classification scheme (specific to a field of an inquiry), for
example as elements of a certain degree (with a given minimum number of
subclasses).

Its role seemingly vanishes in the digital text. But it can be easily
transformed. Besides serving as a table of pre-searched results the subject-
index also gives a distinct idea about content of the book. Two patterns give
us a clue: numbers of occurrences of selected words give subjects weights,
while words that seem specific to the book outweights other even if they don't
occur very often. A selection of these words then serves as a descriptor of
the whole text, and can be thought of as a specific kind of 'tags'.

This process was formalized in a mathematical function in the 1970s, thanks to
a formula by Karen Spärck Jones which she entitled 'inverse document
frequency' (IDF), or in other words, "term specificity". It is measured as a
proportion of texts in the corpus where the word appears at least once to the
total number of texts. When multiplied by the frequency of the word _in_ the
text (divided by the maximum frequency of any word in the text), we get _term
frequency-inverse document frequency_ (tf-idf). In this way we can get an
automated list of subjects which are particular in the text when compared to a
group of texts.

We came to learn it by practice of searching the web. It is a mechanism not
dissimilar to thought process involved in retrieving particular information
online. And search engines have it built in their indexing algorithms as well.

There is a paper proposing attaching words generated by tf-idf to the
hyperlinks when referring websites 14(http://bscit.berkeley.edu/cgi-
bin/pl_dochome?query_src=&format=html&collection=Wilensky_papers&id=3&show_doc=yes).
This would enable finding the referred content even after the link is dead.
Hyperlinks in references in the paper use this feature and it can be easily
tested: 15(http://www.cs.berkeley.edu/~phelps/papers/dissertation-
abstract.html?lexical-
signature=notemarks+multivalent+semantically+franca+stylized).

There is another measure, cosine similarity, which takes tf-idf further and
can be applied for clustering texts according to similarities in their
specificity. This might be interesting as a feature for digital libraries, or
even a way of organising library bottom-up into novel categories, new
discourses could emerge. Or as an aid for researchers to sort through texts,
or even for editors as an aid in producing interesting anthologies.

## Final
remarks[[edit](/index.php?title=Talks/Poetics_of_Research&action=edit§ion=7
"Edit section: Final remarks")]

1

New disciplines emerge all the time - most recently, for example, cultural
techniques, software studies, or media archaeology. It takes years, even
decades, before they gain dedicated shelves in libraries or a category in
interlibrary digital repositories. Not that it matters that much. They are not
only sites of academic opportunities but, firstly, frameworks of new
perspectives of looking at the world, new domains of knowledge. From the
perspective of researcher the partaking in a discipline involves negotiating
its vocabulary, classifications, corpus, reference field, and specific
terms[subjects]. Creating new fields involves all that, and more. Even when
one goes against all disciplines.

2

Google can still surprise us.

3

Knowledge has been in the making for millenia. There have been (abstract)
mechanisms established that govern its conditions. We now possess specialized
corpora of texts which are interesting enough to serve as a ground to discuss
and experiment with dictionaries, classifications, indexes, and tools for
references retrieval. These all belong to the poetic devices of knowledge-
making.

4

Command-line example of tf-idf and concordance in 3 steps.

* 1\. Process the files text.1-5.txt and produce freq.1-5.txt with lists of (nonlemmatized) words (in respective texts), ordered by frequency:

> for i in {1..5}; do tr '[A-Z]' '[a-z]' < text.$i.txt | tr -c '[a-z]'
'[\012*]' | tr -d '[:punct:]' | sort | uniq -c | sort -k 1nr | sed '1,1d' >
temp.txt; max=$(awk -vvar=1 -F" " 'NR

1 {print $var}' temp.txt); awk
-vmaxx=$max -F' ' '{printf "%-7.7f %s\n", $1=0.5+($1/(maxx*2)), $2}' > freq.$i.txt; done && rm temp.txt

* 2\. Process the files freq.1-5.txt and produce tfidf.1-5.txt containing a list of words (out of 500 most frequent in respective lists), ordered by weight (specificity for each text):

> for j in {1..5}; do rm freq.$j.txt.temp; lines=$(wc -l freq.$j.txt) && for i
in {1..500}; do word=$(awk -vline="$i" -vfield=2 -F" " 'NR

line {print
$field}' freq.$j.txt); tf=$(awk -vline="$i" -vfield=1 -F" " 'NR

line {print
$field}' freq.$j.txt); count=$(egrep -lw $word freq.?.txt | wc -l); idf=$(echo
"1+l(5/$count)" | bc -l); tfidf=$(echo $tf*$idf | bc); echo $word $tfidf >>
freq.$j.txt.temp; done; sort -k 2nr < freq.$j.txt.temp > tfidf.$j.txt; done

* 3\. Process the files tfidf.1-5.txt and their source text, text.txt, and produce occ.txt with concordance of top 3 words from each of them:

> rm occ.txt && for j in {1..5}; do echo "$j" >> occ.txt; ptx -f -w 150
text.txt.$j > occ.$j.txt; for i in {1..3}; do word=$(awk -vline="$i" -vfield=1
-F" " 'NR

line {print $field}' tfidf.$j.txt); egrep -i
"[alpha:](/index.php?title=Alpha:&action=edit&redlink=1 "Alpha: \(page does
not exist\)") $word" occ.$j.txt >> occ.txt; done; done

Dušan Barok

_Written 23 October - 1 November 2014 in Bratislava and Stuttgart._


Mars & Medak
System of a Takedown
2019


System of a Takedown: Control and De-­commodification in the Circuits of Academic Publishing
Marcell Mars and Tomislav Medak

Since 2012 the Public Library/Memory of the World1 project has
been developing and publicly supporting scenarios for massive
disobedience against the current regulation of production and
circulation of knowledge and culture in the digital realm. While
the significance of that year may not be immediately apparent to
everyone, across the peripheries of an unevenly developed world
of higher education and research it produced a resonating void.
The takedown of the book-­sharing site Library.nu in early 2012
gave rise to an anxiety that the equalizing effect that its piracy
had created—­the fact that access to the most recent and relevant
scholarship was no longer a privilege of rich academic institutions
in a few countries of the world (or, for that matter, the exclusive
preserve of academia to begin with)—­would simply disappear into
thin air. While alternatives within these peripheries quickly filled
the gap, it was only through an unlikely set of circumstances that
they were able to do so, let alone continue to exist in light of the
legal persecution they now also face.

48

The starting point for the Public Library/Memory of the World
project was a simple consideration: the public library is the institutional form that societies have devised in order to make knowledge
and culture accessible to all their members regardless of social or
economic status. There’s a political consensus that this principle of
access is fundamental to the purpose of a modern society. Yet, as
digital networks have radically expanded the access to literature
and scientific research, public libraries were largely denied the
ability to extend to digital “objects” the kind of de-­commodified
access they provide in the world of print. For instance, libraries
frequently don’t have the right to purchase e-­books for lending and
preservation. If they do, they are limited by how many times—­
twenty-­six in the case of one publisher—­and under what conditions
they can lend them before not only the license but the “object”
itself is revoked. In the case of academic journals, it is even worse:
as they move to predominantly digital models of distribution,
libraries can provide access to and “preserve” them only for as
long as they pay extortionate prices for ongoing subscriptions. By
building tools for organizing and sharing electronic libraries, creating digitization workflows, and making books available online, the
Public Library/Memory of the World project is aimed at helping to
fill the space that remains denied to real-­world public libraries. It is
obviously not alone in this effort. There are many other platforms,
some more public, some more secretive, working to help people
share books. And the practice of sharing is massive.
—­https://www.memoryoftheworld.org

Capitalism and Schizophrenia
New media remediate old media. Media pay homage to their
(mediatic) predecessors, which themselves pay homage to their
own (mediatic) predecessors. Computer graphics remediate film,
which remediates photography, which remediates painting, and so
on (McLuhan 1965, 8; Bolter and Grusin 1999). Attempts to understand new media technologies always settle on a set of metaphors

(of the old and familiar), in order to approximate what is similar,
and yet at the same time name the new. Every such metaphor has
its semiotic distance, decay, or inverse-­square law that draws the
limit how far the metaphor can go in its explanation of the phenomenon to which it is applied. The intellectual work in the Age of
Mechanical Reproduction thus received an unfortunate metaphor:
intellectual property. A metaphor modeled on the scarce and
exclusive character of property over land. As the Age of Mechanical
Reproduction became more and more the Age of Discrete and
Digital Reproduction, another metaphor emerged, one that reveals
the quandary left after decades of decay resulting from the increasing distanciation of intellectual property from the intellectual work
it seeks to regulate, and that metaphor is: schizophrenia.
Technologies compete with each other—­the discrete and the
digital thus competes with the mechanical—­and the aftermath of
these clashes can be dramatic. People lose their jobs, companies
go bankrupt, disciplines lose their departments, and computer
users lose their old files. More often than not, clashes between
competing technologies create antagonisms between different
social groups. Their voices are (sometimes) heard, and society tries
to balance their interests.
If the institutional remedies cannot resolve the social antagonism,
the law is called on to mediate. Yet in the present, the legal system
only reproduces the schizoid impasse where the metaphor of property over land is applied to works of intellect that have in practical
terms become universally accessible in the digital world. Court
cases do not result in a restoration of balance but rather in the
confirmation of entrenched interests. It is, however, not necessary
that courts act in such a one-­sided manner. As Cornelia Vismann
(2011) reminds us in her analysis of the ancient roots of legal mediation, the juridical process has two facets: first, a theatrical aspect
that has common roots with the Greek dramatic theatre and its
social function as a translator of a matter of conflict into a case for
weighted juridical debate; second, an agonistic aspect not unlike a
sporting competition where a winner has to be decided, one that

49

50

leads to judgment and sanction. In the matter of copyright versus
access, however, the fact that courts cannot look past the metaphor of intellectual property, which reduces any understanding of
our contemporary technosocial condition to an analogy with the
scarcity-­based language of property over land, has meant that they
have failed to adjudicate a matter of conflict between the equalizing effects of universal access to knowledge and the guarantees of
rightful remuneration for intellectual labor into a meaningful social
resolution. Rather they have primarily reasserted the agonistic
aspect by supporting exclusively the commercial interests of large
copyright industries that structure and deepen that conflict at the
societal level.
This is not surprising. As many other elements of contemporary
law, the legal norms of copyright were articulated and codified
through the centuries-­long development of the capitalist state
and world-system. The legal system is, as Nicos Poulantzas (2008,
25–­26) suggests, genetically structured by capitalist development.
And yet at the same time it is semi-­autonomous; the development
of its norms and institutional aspects is largely endogenous and
partly responsive to the specific needs of other social subsystems.
Still, if the law and the courts are the codified and lived rationality
of a social formation, then the choice of intellectual property as a
metaphor in capitalist society comes as no surprise, as its principal
objective is to institute a formal political-­economic framework for
the commodification of intellectual labor that produces knowledge
and culture. There can be no balance, only subsumption and
accumulation. Capitalism and schizophrenia.
Schizophrenia abounds wherever the discrete and the digital
breaking barriers to access meets capitalism. One can only wonder
how the conflicting interests of different divisions get disputed
and negotiated in successful corporate giants like Sony Group
where Sony Pictures Entertainment,2 Sony Music Entertainment3
and Sony Computer Entertainment coexist under the same roof
with the Sony Electronics division, which invented the Walkman
back in 1979 and went on to manufacture devices and gadgets like

home (and professional) audio and video players/recorders (VHS,
Betamax, TV, HiFi, cassette, CD/DVD, mp3, mobile phones, etc.),
storage devices, personal computers, and game consoles. In the
famous 1984 Betamax case (“Sony Corp. of America v. Universal
City Studios, Inc.,” Wikipedia 2015), Universal Studios and the Walt
Disney Company sued Sony for aiding copyright infringement with
their Betamax video recorders. Sony won. The court decision in
favor of fair use rather than copyright infringement laid the legal
ground for home recording technology as the foundation of future
analog, and subsequently digital, content sharing.
Five years later, Sony bought its first major Hollywood studio:
Columbia Pictures. In 2004 Sony Music Entertainment merged with
Bertelsmann Music Group to create Sony BMG. However, things
changed as Sony became the content producer and we entered the
age of the discrete and the digital. Another five years later, in 2009,
Sony BMG sued Joel Tenenbaum for downloading and then sharing
thirty-­one songs. The jury awarded US$675,000 to the music
companies (US$22,000 per song). This is known as “the second
file-­sharing case.” “The first file-­sharing case” was 2007’s Capitol Records, Inc. v. Thomas-­Rasset, which concerned the downloading of
twenty-­four songs. In the second file-­sharing case, the jury awarded
music companies US$1,920,000 in statutory damages (US$80,000
per song). The defendant, Jammie Thomas, was a Native American
mother of four from Brainerd, Minnesota, who worked at the time
as a natural resources coordinator for the Mille Lacs Band of the
Native American Ojibwe people. The conflict between access and
copyright took a clear social relief.
Encouraged by the court decisions in the years that followed, the
movie and music industries have started to publicly claim staggering numbers in annual losses: US$58 billion and 370,000 lost jobs
in the United States alone. The purported losses in sales were,
however, at least seven times bigger than the actual losses and,
if the jobs figures had been true, after only one year there would
have been no one left working in the content industry (Reid 2012).
Capitalism and schizophrenia.

51

52

If there is a reason to make an exception from the landed logic of
property being imposed onto the world of the intellect, a reason
to which few would object, it would be for access for educational
purposes. Universities in particular give an institutional form to
the premise that equal access to knowledge is a prerequisite for
building a society where all people are equal.
In this noble endeavor to make universal access to knowledge
central to social development, some universities stand out more
than the others. Consider, for example, the Massachusetts Institute
of Technology (MIT). The Free Culture and Open Access movements
have never hidden their origins, inspiration, and model in the
success of the Free Software Movement, which was founded in
1984 by Richard Stallman while he was working at the MIT Artificial
Intelligence lab. It was at the MIT Museum that the “Hall of Hacks”
was set up to proudly display the roots of hacking culture. Hacking
culture at MIT takes many shapes and forms. MIT hackers famously
put a fire truck (2006) and a campus police car (1994) onto the
roof of the Great Dome of the campus’s Building 10; they landed
(and then exploded) a weather balloon onto the pitch of Harvard
Stadium during a Harvard–­Yale football game; turned the quote
that “getting an education from MIT is like taking a drink from a Fire
Hose” into a literal fire hydrant serving as a drinking fountain in
front of the largest lecture hall on campus; and many, many other
“hacks” (Peterson 2011).
The World Wide Web Consortium was founded at MIT in 1993.
Presently its mission states as its goal “to enable human communication, commerce, and opportunities to share knowledge,”
on the principles of “Web for All” and the corresponding, more
technologically focused “Web on Everything.” Similarly, MIT began
its OpenCourseWare project in 2002 in order “to publish all of
[MIT’s] course materials online and make them widely available to
everyone” (n.d.). The One Laptop Per Child project was created in
2005 in order to help children “learn, share, create, and collaborate” (2010). Recently the MIT Media Lab (2017) has even started its
own Disobedience Award, which “will go to a living person or group

engaged in what we believe is extraordinary disobedience for
the benefit of society . . . seeking both expected and unexpected
nominees.” When it comes to the governance of access to MIT’s
own resources, it is well known that anyone who is registered and
connected to the “open campus” wireless network, either by being
physically present or via VPN, can search JSTOR, Google Scholar,
and other databases in order to access otherwise paywalled journals from major publishers such as Reed Elsevier, Wiley-­Blackwell,
Springer, Taylor and Francis, or Sage.
The MIT Press has also published numerous books that we love
and without which we would have never developed the Public
Library/Memory of the World project to the stage where it is now.
For instance, only after reading Markus Krajewski’s Paper Machines: About Cards & Catalogs, 1548–­1929 (2011) and learning how
conceptually close librarians came to the universal Turing machine
with the invention of the index card catalog did we center the
Public Library/Memory of the World around the idea of the catalog.
Eric von Hippel’s Democratizing Innovation (2005) taught us how end
users could become empowered to innovate and accordingly we
have built our public library as a distributed network of amateur
librarians acting as peers sharing their catalogs and books. Sven
Spieker’s The Big Archive: Art from Bureaucracy (2008) showed us the
exciting hybrid meta-­space between psychoanalysis, media theory,
and conceptual art one could encounter by visiting the world of
catalogs and archives. Understanding capitalism and schizophrenia would have been hard without Semiotext(e)’s translations of
Deleuze and Guattari, and remaining on the utopian path would
have been impossible if not for our reading of Cybernetic Revolutionaries (Medina 2011), Imagine No Possessions (Kiaer 2005), or Art
Power (Groys 2008).

Our Road into Schizophrenia, Commodity
Paradox, Political Strategy
Our vision for the Public Library/Memory of the World resonated
with many people. After the project initially gained a large number

53

54

of users, and was presented in numerous prominent artistic
venues such as Museum Reina Sofía, Transmediale, Württembergischer Kunstverein, Calvert22, 98weeks, and many more, it was no
small honor when Eric Kluitenberg and David Garcia invited us to
write about the project for an anthology on tactical media that was
to be published by the MIT Press. Tactical media is exactly where
we would situate ourselves on the map. Building on Michel de
Certeau’s concept of tactics as agency of the weak operating in the
terrain of strategic power, the tactical media (Tactical Media Files
2017) emerged in the political and technological conjuncture of the
1990s. Falling into the “art-­into-­life” lineage of historic avant-­gardes,
Situationism, DIY culture, techno-­hippiedom, and media piracy, it
constituted a heterogeneous field of practices and a manifestly
international movement that combined experimental media and
political activism into interventions that contested the post–­Cold
War world of global capitalism and preemptive warfare on a hybrid
terrain of media, institutions, and mass movements. Practices of
tactical media ranged from ephemeral media pranks, hoaxes, and
hacktivism to reappropriations of media apparatuses, institutional
settings, and political venues. We see our work as following in
that lineage of recuperation of the means of communication from
their capture by personal and impersonal structures of political or
economic power.
Yet the contract for our contribution that the MIT Press sent us in
early 2015 was an instant reminder of the current state of affairs
in academic publishing: in return for our contribution and transfer
of our copyrights, we would receive no compensation: no right to
wage and no right to further distribute our work.
Only weeks later our work would land us fully into schizophrenia:
the Public Library/Memory of the World received two takedown
notices from the MIT Press for books that could be found in its
back then relatively small yet easily discoverable online collection
located at https://library.memoryoftheworld.org, including a notice
for one of the books that had served as an inspiration to us: Art
Power. First, no wage and, now, no access. A true paradox of the

present-­day system of knowledge production: products of our
labor are commodities, yet the labor-­power producing them is
denied the same status. While the project’s vision resonates with
many, including the MIT Press, it has to be shut down. Capitalism
and schizophrenia.4
Or, maybe, not. Maybe we don’t have to go down that impasse.
Starting from the two structural circumstances imposed on us by
the MIT Press—­the denial of wage and the denial of access—­we
can begin to analyze why copyright infringement is not merely, as
the industry and the courts would have it, a matter of illegality. But
rather a matter of legitimate action.
Over the past three decades a deep transformation, induced by
the factors of technological change and economic restructuring,
has been unfolding at different scales, changing the way works
of culture and knowledge are produced and distributed across
an unevenly developed world. As new technologies are adopted,
generalized, and adapted to the realities of the accumulation
process—­a process we could see unfolding with the commodification of the internet over the past fifteen years—­the core and
the periphery adopt different strategies of opposition to the
inequalities and exclusions these technologies start to reproduce.
The core, with its emancipatory and countercultural narratives,
pursues strategies that develop legal, economic, or technological
alternatives. However, these strategies frequently fail to secure
broader transformative effects as the competitive forces of the
market appropriate, marginalize, or make obsolete the alternatives
they advocate. Such seems to have been the destiny of much of the
free software, open access, and free culture alternatives that have
developed over this period.
In contrast, the periphery, in order to advance, relies on strategies
of “stealing” that bypass socioeconomic barriers by refusing to
submit to the harmonized regulation that sets the frame for global
economic exchange. The piracy of intellectual property or industrial
secrets thus creates a shadow system of exchange resisting the

55

56

asymmetries of development in the world economy. However, its
illegality serves as a pretext for the governments and companies of
the core to devise and impose further controls over the technosocial systems that facilitate these exchanges.
Both strategies develop specific politics—­a politics of reform, on
the one hand, and a politics of obfuscation and resistance, on the
other—­yet both are defensive politics that affirm the limitations
of what remains inside and what remains outside of the politically
legitimate.
The copyright industry giants of the past and the IT industry giants
of the present are thus currently sorting it out to whose greater
benefit will this new round of commodification work out. For those
who find themselves outside of the the camps of these two factions
of capital, there’s a window of opportunity, however, to reconceive
the mode of production of literature and science that has been
with us since the beginning of the print trade and the dawn of capitalism. It’s a matter of change, at the tail end of which ultimately
lies a dilemma: whether we’re going to live in a more equal or a
more unjust, a more commonised or a more commodified world.

Authorship, Law, and Legitimacy
Before we can talk of such structural transformation, the normative
question we expect to be asked is whether something that is considered a matter of law and juridical decision can be made a matter
of politics and political process. Let’s see.
Copyright has a fundamentally economic function—­to unambiguously establish individualized property in the products of creative
labor. A clear indication of this economic function is the substantive requirement of originality that the work is expected to have
in order to be copyrightable. Legal interpretations set a very low
standard on what counts as original, as their function is no more
than to demarcate one creative contribution from another. Once
a legal title is unambiguously assigned, there is a person holding

property with whose consent the contracting, commodification,
and marketing of the work can proceed.5 In that respect copyright
is not that different from the requirement of formal freedom that
is granted to a laborer to contract out their own labor-­power as a
commodity to capital, giving capital authorization to extract maximum productivity and appropriate the products of the laborer’s
labor.6 Copyright might be just a more efficient mechanism of
exploitation as it unfolds through selling of produced commodities
and not labor power. Art market obscures and mediates the
capital-­labor relation
When we talk today of illegal copying, we primarily mean an
infringement of the legal rights of authors and publishers. There’s an
immediate assumption that the infringing practice of illegal copying
and distribution falls under the domain of juridical sanction, that it is
a matter of law. Yet if we look to the history of copyright, the illegality
of copying was a political matter long before it became a legal one.
Publisher’s rights, author’s rights, and mechanisms of reputation—­
the three elements that are fundamental to the present-­day
copyright system—­all have their historic roots in the context of
absolutism and early capitalism in seventeenth-­and eighteenth-­
century Europe. Before publishers and authors were given a
temporary monopoly over the exploitation of their publications
instituted in the form of copyright, they were operating in a system
where they were forced to obtain a privilege to print books from
royal censors. The first printing privileges granted to publishers, in
early seventeenth-­century Great Britain,7 came with the responsibility of publishers to control what was being published and
disseminated in a growing body of printed matter that started to
reach the public in the aftermath of the invention of print and the
rise of the reading culture. The illegality in these early days of print
referred either to printing books without the permission of the
censor or printing books that were already published by another
printer in the territory where the censor held authority. The transition from the privilege tied to the publisher to the privilege tied to
the natural person of the author would unfold only later.

57

58

In the United Kingdom this transition occurred as the guild of
printers, Stationers’ Company, failed to secure the extension of its
printing monopoly and thus, in order to continue with its business,
decided to advocate the introduction of copyright for the authors
instead. This resulted in the passing of the Copyright Act of 1709,
also known as the Statute of Anne (Rose 2010). The censoring
authority and enterprising publishers now proceeded in lockstep to
isolate the author as the central figure in the regulation of literary
and scientific production. Not only did the author receive exclusive
rights to the work, the author was also made—­as Foucault has
famously analyzed (Foucault 1980, 124)—­the identifiable subject of
scrutiny, censorship, and political sanction by the absolutist state.
Although the Romantic author slowly took the center stage in
copyright regulations, economic compensation for the work would
long remain no more than honorary. Until well into the eighteenth
century, literary writing and creativity in general were regarded as
resulting from divine inspiration and not the individual genius of
the author. Writing was a work of honor and distinction, not something requiring an honest day’s pay.8 Money earned in the growing
printing industry mostly stayed in the pockets of publishers, while
the author received literally an honorarium, a flat sum that served
as a “token of esteem” (Woodmansee 1996, 42). It is only once
authors began to voice demands for securing their material and
political independence from patronage and authority that they also
started to make claims for rightful remuneration.
Thus, before it was made a matter of law, copyright was a matter of
politics and economy.

Copyright, Labor, and Economic Domination
The full-­blown affirmation of the Romantic author-­function marks
the historic moment where a compromise is established between
the right of publishers to the economic exploitation of works and
the right of authors to rightful compensation for those works. Economically, this redistribution from publishers to authors was made

possible by the expanding market for printed books in the eighteenth and nineteenth centuries, while politically this was catalyzed
by the growing desire for the autonomy of scientific and literary
production from the system of feudal patronage and censorship
in gradually liberalizing and modernizing capitalist societies. The
newfound autonomy of production was substantially coupled to
production specifically for the market. However, this irenic balance
could not last for very long. Once the production of culture and
science was subsumed under the exigencies of the generalized
market, it had to follow the laws of commodification and competition from which no form of commodity production can escape.
By the beginning of the twentieth century, copyright expanded to
a number of other forms of creativity, transcending its primarily
literary and scientific ambit and becoming part of the broader
set of intellectual property rights that are fundamental to the
functioning and positioning of capitalist enterprise. The corporatization of the production of culture and knowledge thus brought
about a decisive break from the Romantic model that singularized
authorship in the person of the author. The production of cultural
commodities nowadays involves a number of creative inputs from
both credited (but mostly unwaged) and uncredited (but mostly
waged) contributors. The “moral rights of the author,” a substantive
link between the work and the person of the author, are markedly
out of step with these realities, yet they still perform an important
function in the moral economy of reputation, which then serves as
the legitimation of copyright enforcement and monopoly. Moral
rights allow easy attribution; incentivize authors to subsidize
publishers by self-­financing their own work in the hope of topping
the sales charts, rankings, or indexes; and help markets develop
along winner-­takes-­all principles.
The level of concentration in industries primarily concerned with
various forms of intellectual property rights is staggering. The film
industry is a US$88 billion industry dominated by six major studios
(PwC 2015c). The recorded music industry is an almost US$20
billion industry dominated by only three major labels (PwC 2015b).

59

60

The publishing industry is a US$120 billion industry where the
leading ten companies earn in revenues more than the next forty
largest publishing groups (PwC 2015a; Wischenbart 2014).

The Oligopoly and Academic Publishing
Academic publishing in particular draws the state of play into stark
relief. It’s a US$10 billion industry dominated by five publishers and
financed up to 75 percent from library subscriptions. It’s notorious
for achieving extreme year-­on-­year profit margins—­in the case of
Reed Elsevier regularly over 30 percent, with Taylor and Francis,
Springer, Wiley-­Blackwell and Sage barely lagging behind (Larivière,
Haustein, and Mongeon 2015). Given that the work of contributing
authors is not paid but rather financed by their institutions (provided, that is, that they are employed at an institution) and that
these publications nowadays come mostly in the form of electronic
articles licensed under subscription for temporary use to libraries
and no longer sold as printed copies, the public interest could be
served at a much lower cost by leaving commercial closed-­access
publishers out of the equation entirely.
But that cannot be done, of course. The chief reason for this is that
the system of academic reputation and ranking based on publish-­
or-­perish principles is historically entangled with the business of
academic publishers. Anyone who doesn’t want to put their academic career at risk is advised to steer away from being perceived
as reneging on that not-­so-­tacit deal. While this is patently clear
to many in academia, opting for the alternative of open access
means not playing by the rules, and not playing by the rules can
have real-­life consequences, particularly for younger academics.
Early career scholars have to publish in prestigious journals if they
want to advance in the highly competitive and exclusive system of
academia (Kendzior 2012).
Copyright in academic publishing has thus become simply a mechanism of the direct transfer of economic power from producers to
publishers, giving publishers an instrument for maintaining their

stranglehold on the output of academia. But publishers also have
control over metrics and citation indexes, pandering to the authors
with better tools for maximizing their impact and self-­promotion.
Reputation and copyright are extortive instruments that publishers
can wield against authors and the public to prevent an alternative
from emerging.9
The state of the academic publishing business signals how the
“copyright industries” in general might continue to control the
field as their distribution model now transitions to streaming or
licensed-­access models. In the age of cloud computing, autonomous infrastructures run by communities of enthusiasts are
becoming increasingly a thing of the past. “Copyright industries,”
supported by the complicit legal system, now can pressure proxies
for these infrastructures, such as providers of server colocation,
virtual hosting, and domain-­name network services, to enforce
injunctions for them without ever getting involved in direct, costly
infringement litigation. Efficient shutdowns of precarious shadow
systems allow for a corporate market consolidation wherein the
majority of streaming infrastructures end up under the control of a
few corporations.

Illegal Yet Justified, Collective Civil
Disobedience, Politicizing the Legal
However, when companies do resort to litigation or get involved in
criminal proceedings, they can rest assured that the prosecution
and judicial system will uphold their interests over the right of
public to access culture and knowledge, even when the irrationality
of the copyright system lies in plain sight, as it does in the case of
academic publishing. Let’s look at two examples:
On January 6, 2011, Aaron Swartz, a prominent programmer
and hacktivist, was arrested by the MIT campus police and U.S.
Secret Service on charges of having downloaded a large number
of academic articles from the JSTOR repository. While JSTOR, with
whom Swartz reached a settlement and to whom he returned the

61

62

files, and, later, MIT, would eventually drop the charges, the federal
prosecution decided nonetheless to indict Swartz on thirteen
criminal counts, potentially leading to fifty years in prison and a
US$1 million fine. Under growing pressure by the prosecution
Swartz committed suicide on January 11, 2013.
Given his draconian treatment at the hands of the prosecution
and the absence of institutions of science and culture that would
stand up and justify his act on political grounds, much of Swartz’s
defense focused on trying to exculpate his acts, to make them less
infringing or less illegal than the charges brought against him had
claimed, a rational course of action in irrational circumstances.
However, this was unfortunately becoming an uphill battle as the
prosecution’s attention was accidentally drawn to a statement
written by Swartz in 2008 wherein he laid bare the dysfunctionality
of the academic publishing system. In his Guerrilla Open Access
Manifesto, he wrote: “The world’s entire scientific and cultural heritage, published over centuries in books and journals, is increasingly
being digitized and locked up by a handful of private corporations. . . . Forcing academics to pay money to read the work of their
colleagues? Scanning entire libraries but only allowing the folks at
Google to read them? Providing scientific articles to those at elite
universities in the First World, but not to children in the Global
South? It’s outrageous and unacceptable.” After a no-­nonsense
diagnosis followed an even more clear call to action: “We need
to download scientific journals and upload them to file sharing
networks. We need to fight for Guerilla Open Access” (Swartz 2008).
Where a system has failed to change unjust laws, Swartz felt, the
responsibility was on those who had access to make injustice a
thing of the past.
Whether Swartz’s intent actually was to release the JSTOR repository remains subject to speculation. The prosecution has never
proven that it was. In the context of the legal process, his call to
action was simply taken as a matter of law and not for what it
was—­a matter of politics. Yet, while his political action was pre-

empted, others have continued pursuing his vision by committing
small acts of illegality on a massive scale. In June 2015 Elsevier won
an injunction against Library Genesis, the largest illegal repository
of electronic books, journals, and articles on the Web, and its
subsidiary platform for accessing academic journals, Sci-­hub. A
voluntary and noncommercial project of anonymous scientists
mostly from Eastern Europe, Sci-­hub provides as of end of 2015
access to more than 41 million academic articles either stored
in its database or retrieved through bypassing the paywalls of
academic publishers. The only person explicitly named in Elsevier’s
lawsuit was Sci-­hub’s founder Alexandra Elbakyan, who minced no
words: “When I was working on my research project, I found out
that all research papers I needed for work were paywalled. I was
a student in Kazakhstan at the time and our university was not
subscribed to anything” (Ernesto 2015). Being a computer scientist,
she found the tools and services on the internet that allowed her to
bypass the paywalls. At first, she would make articles available on
internet forums where people would file requests for the articles
they needed, but eventually she automated the process, making
access available to everyone on the open web. “Thanks to Elsevier’s
lawsuit, I got past the point of no return. At this time I either have
to prove we have the full right to do this or risk being executed like
other ‘pirates’ . . . If Elsevier manages to shut down our projects or
force them into the darknet, that will demonstrate an important
idea: that the public does not have the right to knowledge. . . .
Everyone should have access to knowledge regardless of their
income or affiliation. And that’s absolutely legal. Also the idea
that knowledge can be a private property of some commercial
company sounds absolutely weird to me” (Ernesto 2015).
If the issue of infringement is to become political, a critical mass
of infringing activity has to be achieved, access technologically
organized, and civil disobedience collectively manifested. Only in
this way do the illegal acts stand a chance of being transformed
into the legitimate acts.

63

64

Where Law Was, there Politics Shall Be
And thus we have made a full round back to where we started. The
parallel development of liberalism, copyright, and capitalism has
resulted in a system demanding that the contemporary subject
act in accordance with two opposing tendencies: “more capitalist
than capitalist and more proletarian than proletariat” (Deleuze
and Guattari 1983, 34). Schizophrenia is, as Deleuze and Guattari
argue, a condition that simultaneously embodies two disjunctive
positions. Desire and blockage, flow and territory. Capitalism is
the constant decoding of social blockages and territorializations
aimed at liberating the production of desires and flows further
and further, only to oppose them at its extreme limit. It decodes
the old socius by means of private property and commodity
production, privatization and abstraction, the flow of wealth and
flows of workers (140). It allows contemporary subjects—­including
corporate entities such as the MIT Press or Sony—­to embrace their
contradictions and push them to their limits. But capturing them in
the orbit of the self-­expanding production of value, it stops them
at going beyond its own limit. It is this orbit that the law sanctions
in the present, recoding schizoid subjects into the inevitability of
capitalism. The result is the persistence of a capitalist reality antithetical to common interest—­commercial closed-­access academic
publishing—­and the persistence of a hyperproletariat—­an intellectual labor force that is too subsumed to organize and resist the
reality that thrives parasitically on its social function. It’s a schizoid
impasse sustained by a failed metaphor.
The revolutionary events of the Paris Commune of 1871, its mere
“existence” as Marx has called it,10 a brief moment of “communal
luxury” set in practice as Kristin Ross (2015) describes it, demanded
that, in spite of any circumstances and reservations, one takes a
side. And such is our present moment of truth.
Digital networks have expanded the potential for access and
created an opening for us to transform the production of knowledge and culture in the contemporary world. And yet they have
likewise facilitated the capacity of intellectual property industries

to optimize, to cut out the cost of printing and physical distribution.
Digitization is increasingly helping them to control access, expand
copyright, impose technological protection measures, consolidate
the means of distribution, and capture the academic valorization
process.
As the potential opening for universalizing access to culture and
knowledge created by digital networks is now closing, attempts at
private legal reform such as Creative Commons licenses have had
only a very limited effect. Attempts at institutional reform such as
Open Access publishing are struggling to go beyond a niche. Piracy
has mounted a truly disruptive opposition, but given the legal
repression it has met with, it can become an agent of change only if
it is embraced as a kind of mass civil disobedience. Where law was,
there politics shall be.
Many will object to our demand to replace the law with politicization. Transitioning from politics to law was a social achievement
as the despotism of political will was suppressed by legal norms
guaranteeing rights and liberties for authors; this much is true. But
in the face of the draconian, failed juridical rationality sustaining
the schizoid impasse imposed by economic despotism, these developments hold little justification. Thus we return once more to the
words of Aaron Swartz to whom we remain indebted for political
inspiration and resolve: “There is no justice in following unjust laws.
It’s time to come into the light and, in the grand tradition of civil
disobedience, declare our opposition to this private theft of public
culture. . . . With enough of us, around the world, we’ll not just send
a strong message opposing the privatization of knowledge—­we’ll
make it a thing of the past. Will you join us?” (Swartz 2008).

Notes
1

We initially named our project Public Library because we have developed it
as a technosocial project from a minimal definition that defines public library
as constituted by three elements: free access to books for every member of
a society, a library catalog, and a librarian (Mars, Zarroug and Medak, 2015).
However, this definition covers all public libraries and shadow libraries
complementing the work of public libraries in providing digital access. We have
thus decided to rename our project as Memory of the World, after our project’s

65

initial domain name. This is a phrase coined by Henri La Fontaine, whose men-

66

tion we found in Markus Krajewski’s Paper Machines (2011). It turned out that
UNESCO runs a project under the same name with the objective to preserve
valuable archives for the whole of humanity. We have appropriated that objective. Given that this change has happened since we drafted the initial version
of this text in 2015, we’ll call our project in this text with a double name Public
Library/Memory of the World.
2

Sony Pictures Entertainment became the owner of two (MGM, Columbia Pictures) out of eight Golden Age major movie studios (“Major Film Studio,” Wikipedia 2015).

3

In 2012 Sony Music Entertainment is one of the Big Three majors (“Record
Label,” Wikipedia 2015).

4

Since this anecdote was recounted by Marcell in his opening keynote in the
Terms of Media II conference at Brown University, we have received another
batch of takedown notices from the MIT Press. It seemed as no small irony,
because at the time the Terms of Media conference reader was rumored to be
distributed by the MIT Press.

5

“In law, authorship is a point of origination of a property right which, thereafter, like other property rights, will circulate in the market, ending up in the
control of the person who can exploit it most profitably. Since copyright serves
paradoxically to vest authors with property only to enable them to divest that
property, the author is a notion which needs only to be sustainable for an
instant” (Bently 1994).

6

For more on the formal freedom of the laborer to sell his labor-­power, see
chapter 6 of Marx’s Capital (1867).

7

For a more detailed account of the history of printing privilege in Great Britain,
but also the emergence of peer review out of the self-­censoring performed by
the Royal Academy and Académie de sciences in return for the printing privilege, see Biagioli 2002.

8

The transition of authorship from honorific to professional is traced in Woodmansee 1996.

9

Not all publishers are necessarily predatory. For instance, scholar-­led open-­
access publishers, such as those working under the banner of Radical Open
Access (http://radicaloa.disruptivemedia.org) have been experimenting with
alternatives to the dominant publishing models, workflows, and metrics, radicalizing the work of conventional open access, which has by now increasingly
become recuperated by big for-­profit publishers, who see in open access an
opportunity to assume the control over the economy of data in academia.
Some established academic publishers, too, have been open to experiments
that go beyond mere open access and are trying to redesign how academic
writing is produced, made accessible, and valorized. This essay has the good
fortune of appearing as a joint publication of two such publishers: Meson Press
and University of Minnesota Press.

10

“The great social measure of the Commune was its own working existence”
(Marx 1871).

References
Bently, Lionel. 1994. “Copyright and the Death of the Author in Literature and Law.”
The Modern Law Review 57, no. 6: 973–­86. Accessed January 2, 2018. doi:10.1111/
j.1468–­2230.1994.tb01989.x.
Biagioli, Mario. 2002. “From Book Censorship to Academic Peer Review.” Emergences:
Journal for the Study of Media & Composite Cultures 12, no. 1: 11–­45.
Bolter, Jay David, and Richard Grusin. 1999. Remediation: Understanding New Media.
Cambridge, Mass.: MIT Press.
Deleuze, Gilles, and Félix Guattari. 1983. Anti-­Oedipus: Capitalism and Schizophrenia.
Minneapolis: University of Minnesota Press.
Ernesto. 2015. “Sci-­Hub Tears Down Academia’s ‘Illegal’ Copyright Paywalls.” TorrentFreak, June 27. Accessed October 18, 2015. https://torrentfreak.com/sci-hub-tears
-down-academias-illegal-copyright-paywalls-150627/.
Foucault, Michel. 1980. “What Is an Author?” In Language, Counter-­Memory, Practice:
Selected Essays and Interviews, ed. Donald F. Bouchard, 113–­38. Ithaca, N.Y.: Cornell
University Press.
Groys, Boris. 2008. Art Power. Cambridge, Mass.: MIT Press.
Kendzior, Sarah. 2012. “Academic Paywalls Mean Publish and Perish.” Al Jazeera
English, October 2. Accessed October 18, 2015. http://www.aljazeera.com/indepth/
opinion/2012/10/20121017558785551.html.
Kiaer, Christina. 2005. Imagine No Possessions: The Socialist Objects of Russian Constructivism. Cambridge, Mass.: MIT Press.
Krajewski, Markus. 2011. Paper Machines: About Cards & Catalogs, 1548–­1929. Cambridge, Mass.: MIT Press.
Larivière, Vincent, Stefanie Haustein, and Philippe Mongeon. 2015. “The Oligopoly of
Academic Publishers in the Digital Era.” PLoS ONE 10, no. 6. Accessed January 2,
2018. doi:10.1371/journal.pone.0127502.
Mars, Marcell, Marar Zarroug, and Tomislav Medak. 2015. “Public Library (essay).” in
Public Library, ed. Marcell Mars and Tomislav Medak. Zagreb: Multimedia Institute
& What, how & for Whom/WHW.
Marx, Karl. 1867. Capital, Vol. 1. Available at: Marxists.org. Accessed April 9, 2017.
https://www.marxists.org/archive/marx/works/1867-c1/ch06.htm.
Marx, Karl. 1871. “The Civil War in France.” Available at: Marxists.org. Accessed April 9,
2017. https://www.marxists.org/archive/marx/works/1871/civil-war-france/.
McLuhan, Marshall. 1965. Understanding Media: The Extensions of Man. New York:
McGraw-­Hill.
Medina, Eden. 2011. Cybernetic Revolutionaries: Technology and Politics in Allende’s
Chile. Cambridge, Mass. MIT Press.
MIT Media Lab. 2017. “MIT Media Lab Disobedience Award.” Accessed 10 April 2017,
https://media.mit.edu/disobedience/.
MIT OpenCourseWare. n.d. “About OCW | MIT OpenCourseWare | Free Online
Course Materials.” Accessed October 28, 2015. http://ocw.mit.edu/about/.
One Laptop per Child. 2010. “One Laptop per Child (OLPC): Vision.” Accessed October
28, 2015. http://laptop.org/en/vision/.

67

68

Peterson, T. F., ed. 2011. Nightwork: A History of Hacks and Pranks at MIT. Cambridge,
Mass.: MIT Press.
Poulantzas, Nicos. 2008. The Poulantzas Reader: Marxism, Law, and the State. London:
Verso.
PwC. 2015a. “Book Publishing.” Accessed October 18, 2015. http://www.pwc.com/gx
/en/industries/entertainment-media/outlook/segment-insights/book-publishing
.html.
PwC. 2015b. “Filmed Entertainment.” Accessed October 18, 2015. http://www.pwc.com
/gx/en/industries/entertainment-media/outlook/segment-insights/filmed-enter
tainment.html.
PwC. 2015c. “Music: Growth Rates of Recorded and Live Music.” Accessed October 18,
2015. http://www.pwc.com/gx/en/global-entertainment-media-outlook/assets/
2015/music-key-insights-1-growth-rates-of-recorded-and-live-music.pdf.
Reid, Rob. 2012. “The Numbers behind the Copyright Math.” TED Blog, March 20.
Accessed October 28, 2015, http://blog.ted.com/the-numbers-behind-the
-copyright-math/.
Rose, Mark. 2010. “The Public Sphere and the Emergence of Copyright.” In Privilege
and Property, Essays on the History of Copyright, ed. Ronan Deazley, Martin Kretschmer, and Lionel Bently, 67–­88. Open Book Publishers.
Ross, Kristin. 2015. Communal Luxury: The Political Imaginary of the Paris Commune.
London: Verso.
Spieker, Sven. 2008. The Big Archive: Art from Bureaucracy. Cambridge, Mass.: MIT
Press.
Swartz, Aaron. 2008. “Guerilla Open Access Manifesto.” Internet Archive. Accessed
October 18, 2015. https://archive.org/stream/GuerillaOpenAccessManifesto/
Goamjuly2008_djvu.txt.
Tactical Media Files. 2017. “The Concept of Tactical Media.” Accessed May 4, 2017.
http://www.tacticalmediafiles.net/articles/44999.
Vismann, Cornelia. 2011. Medien der Rechtsprechung. Frankfurt a.M.: S. Fischer Verlag.
von Hippel, Eric. 2005. Democratizing Innovation. Cambridge, Mass.: MIT Press.
Wikipedia, the Free Encyclopedia. 2015a. “Major Film Studio.” Accessed January 2,
2018. https://en.wikipedia.org/w/index.php?title=Major_film_studio&oldid
=686867076.
Wikipedia, the Free Encyclopedia. 2015b. “Record Label.” Accessed January 2, 2018.
https://en.wikipedia.org/w/index.php?title=Record_label&oldid=685380090.
Wikipedia, the Free Encyclopedia. 2015c. “Sony Corp. of America v. Universal City
Studios, Inc.” Accessed January 2, 2018. https://en.wikipedia.org/w/index.php?
title=Sony_Corp._of_America_v._Universal_City_Studios,_Inc.&oldid=677390161.
Wischenbart, Rüdiger. 2015. “The Global Ranking of the Publishing Industry 2014.”
Wischenbart. Accessed October 18, 2015. http://www.wischenbart.com/upload/
Global-Ranking-of-the-Publishing-Industry_2014_Analysis.pdf.
Woodmansee, Martha. 1996. The Author, Art, and the Market: Rereading the History of
Aesthetics. New York: Columbia University Press.
World Wide Web Consortium. n.d.“W3C Mission.” Accessed October 28, 2015. http://
www.w3.org/Consortium/mission.


 

Display 200 300 400 500 600 700 800 900 1000 ALL characters around the word.