Murtaugh
A bag but is language nothing of words
2016


## A bag but is language nothing of words

### From Mondotheque

#####

(language is nothing but a bag of words)

[Michael Murtaugh](/wiki/index.php?title=Michael_Murtaugh "Michael Murtaugh")

In text indexing and other machine reading applications the term "bag of
words" is frequently used to underscore how processing algorithms often
represent text using a data structure (word histograms or weighted vectors)
where the original order of the words in sentence form is stripped away. While
"bag of words" might well serve as a cautionary reminder to programmers of the
essential violence perpetrated to a text and a call to critically question the
efficacy of methods based on subsequent transformations, the expression's use
seems in practice more like a badge of pride or a schoolyard taunt that would
go: Hey language: you're nothin' but a big BAG-OF-WORDS.

## Bag of words

In information retrieval and other so-called _machine-reading_ applications
(such as text indexing for web search engines) the term "bag of words" is used
to underscore how in the course of processing a text the original order of the
words in sentence form is stripped away. The resulting representation is then
a collection of each unique word used in the text, typically weighted by the
number of times the word occurs.

Bag of words, also known as word histograms or weighted term vectors, are a
standard part of the data engineer's toolkit. But why such a drastic
transformation? The utility of "bag of words" is in how it makes text amenable
to code, first in that it's very straightforward to implement the translation
from a text document to a bag of words representation. More significantly,
this transformation then opens up a wide collection of tools and techniques
for further transformation and analysis purposes. For instance, a number of
libraries available in the booming field of "data sciences" work with "high
dimension" vectors; bag of words is a way to transform a written document into
a mathematical vector where each "dimension" corresponds to the (relative)
quantity of each unique word. While physically unimaginable and abstract
(imagine each of Shakespeare's works as points in a 14 million dimensional
space), from a formal mathematical perspective, it's quite a comfortable idea,
and many complementary techniques (such as principle component analysis) exist
to reduce the resulting complexity.

What's striking about a bag of words representation, given is centrality in so
many text retrieval application is its irreversibility. Given a bag of words
representation of a text and faced with the task of producing the original
text would require in essence the "brain" of a writer to recompose sentences,
working with the patience of a devoted cryptogram puzzler to draw from the
precise stock of available words. While "bag of words" might well serve as a
cautionary reminder to programmers of the essential violence perpetrated to a
text and a call to critically question the efficacy of methods based on
subsequent transformations, the expressions use seems in practice more like a
badge of pride or a schoolyard taunt that would go: Hey language: you're
nothing but a big BAG-OF-WORDS. Following this spirit of the term, "bag of
words" celebrates a perfunctory step of "breaking" a text into a purer form
amenable to computation, to stripping language of its silly redundant
repetitions and foolishly contrived stylistic phrasings to reveal a purer
inner essence.

## Book of words

Lieber's Standard Telegraphic Code, first published in 1896 and republished in
various updated editions through the early 1900s, is an example of one of
several competing systems of telegraph code books. The idea was for both
senders and receivers of telegraph messages to use the books to translate
their messages into a sequence of code words which can then be sent for less
money as telegraph messages were paid by the word. In the front of the book, a
list of examples gives a sampling of how messages like: "Have bought for your
account 400 bales of cotton, March delivery, at 8.34" can be conveyed by a
telegram with the message "Ciotola, Delaboravi". In each case the reduction of
number of transmitted words is highlighted to underscore the efficacy of the
method. Like a dictionary or thesaurus, the book is primarily organized around
key words, such as _act_ , _advice_ , _affairs_ , _bags_ , _bail_ , and
_bales_ , under which exhaustive lists of useful phrases involving the
corresponding word are provided in the main pages of the volume. [1]

[![Liebers
P1016847.JPG](/wiki/images/4/41/Liebers_P1016847.JPG)](/wiki/index.php?title=File:Liebers_P1016847.JPG)

[![Liebers
P1016859.JPG](/wiki/images/3/35/Liebers_P1016859.JPG)](/wiki/index.php?title=File:Liebers_P1016859.JPG)

[![Liebers
P1016861.JPG](/wiki/images/3/34/Liebers_P1016861.JPG)](/wiki/index.php?title=File:Liebers_P1016861.JPG)

[![Liebers
P1016869.JPG](/wiki/images/f/fd/Liebers_P1016869.JPG)](/wiki/index.php?title=File:Liebers_P1016869.JPG)

> [...] my focus in this chapter is on the inscription technology that grew
parasitically alongside the monopolistic pricing strategies of telegraph
companies: telegraph code books. Constructed under the bywords “economy,”
“secrecy,” and “simplicity,” telegraph code books matched phrases and words
with code letters or numbers. The idea was to use a single code word instead
of an entire phrase, thus saving money by serving as an information
compression technology. Generally economy won out over secrecy, but in
specialized cases, secrecy was also important.[2]

In Katherine Hayles' chapter devoted to telegraph code books she observes how:

> The interaction between code and language shows a steady movement away from
a human-centric view of code toward a machine-centric view, thus anticipating
the development of full-fledged machine codes with the digital computer. [3]

[![Liebers
P1016851.JPG](/wiki/images/1/13/Liebers_P1016851.JPG)](/wiki/index.php?title=File:Liebers_P1016851.JPG)
Aspects of this transitional moment are apparent in a notice included
prominently inserted in the Lieber's code book:

> After July, 1904, all combinations of letters that do not exceed ten will
pass as one cipher word, provided that it is pronounceable, or that it is
taken from the following languages: English, French, German, Dutch, Spanish,
Portuguese or Latin -- International Telegraphic Conference, July 1903 [4]

Conforming to international conventions regulating telegraph communication at
that time, the stipulation that code words be actual words drawn from a
variety of European languages (many of Lieber's code words are indeed
arbitrary Dutch, German, and Spanish words) underscores this particular moment
of transition as reference to the human body in the form of "pronounceable"
speech from representative languages begins to yield to the inherent potential
for arbitrariness in digital representation.

What telegraph code books do is remind us of is the relation of language in
general to economy. Whether they may be economies of memory, attention, costs
paid to a telecommunicatons company, or in terms of computer processing time
or storage space, encoding language or knowledge in any form of writing is a
form of shorthand and always involves an interplay with what one expects to
perform or "get out" of the resulting encoding.

> Along with the invention of telegraphic codes comes a paradox that John
Guillory has noted: code can be used both to clarify and occlude. Among the
sedimented structures in the technological unconscious is the dream of a
universal language. Uniting the world in networks of communication that
flashed faster than ever before, telegraphy was particularly suited to the
idea that intercultural communication could become almost effortless. In this
utopian vision, the effects of continuous reciprocal causality expand to
global proportions capable of radically transforming the conditions of human
life. That these dreams were never realized seems, in retrospect, inevitable.
[5]

[![Liebers
P1016884.JPG](/wiki/images/9/9c/Liebers_P1016884.JPG)](/wiki/index.php?title=File:Liebers_P1016884.JPG)

[![Liebers
P1016852.JPG](/wiki/images/7/74/Liebers_P1016852.JPG)](/wiki/index.php?title=File:Liebers_P1016852.JPG)

[![Liebers
P1016880.JPG](/wiki/images/1/11/Liebers_P1016880.JPG)](/wiki/index.php?title=File:Liebers_P1016880.JPG)

Far from providing a universal system of encoding messages in the English
language, Lieber's code is quite clearly designed for the particular needs and
conditions of its use. In addition to the phrases ordered by keywords, the
book includes a number of tables of terms for specialized use. One table lists
a set of words used to describe all possible permutations of numeric grades of
coffee (Choliam = 3,4, Choliambos = 3,4,5, Choliba = 4,5, etc.); another table
lists pairs of code words to express the respective daily rise or fall of the
price of coffee at the port of Le Havre in increments of a quarter of a Franc
per 50 kilos ("Chirriado = prices have advanced 1 1/4 francs"). From an
archaeological perspective, the Lieber's code book reveals a cross section of
the needs and desires of early 20th century business communication between the
United States and its trading partners.

The advertisements lining the Liebers Code book further situate its use and
that of commercial telegraphy. Among the many advertisements for banking and
law services, office equipment, and alcohol are several ads for gun powder and
explosives, drilling equipment and metallurgic services all with specific
applications to mining. Extending telegraphy's formative role for ship-to-
shore and ship-to-ship communication for reasons of safety, commercial
telegraphy extended this network of communication to include those parties
coordinating the "raw materials" being mined, grown, or otherwise extracted
from overseas sources and shipped back for sale.

## "Raw data now!"

From [La ville intelligente - Ville de la connaissance](/wiki/index.php?title
=La_ville_intelligente_-_Ville_de_la_connaissance "La ville intelligente -
Ville de la connaissance"):

Étant donné que les nouvelles formes modernistes et l'utilisation de matériaux
propageaient l'abondance d'éléments décoratifs, Paul Otlet croyait en la
possibilité du langage comme modèle de « [données
brutes](/wiki/index.php?title=Bag_of_words "Bag of words") », le réduisant aux
informations essentielles et aux faits sans ambiguïté, tout en se débarrassant
de tous les éléments inefficaces et subjectifs.


From [The Smart City - City of Knowledge](/wiki/index.php?title
=The_Smart_City_-_City_of_Knowledge "The Smart City - City of Knowledge"):

As new modernist forms and use of materials propagated the abundance of
decorative elements, Otlet believed in the possibility of language as a model
of '[raw data](/wiki/index.php?title=Bag_of_words "Bag of words")', reducing
it to essential information and unambiguous facts, while removing all
inefficient assets of ambiguity or subjectivity.


> Tim Berners-Lee: [...] Make a beautiful website, but first give us the
unadulterated data, we want the data. We want unadulterated data. OK, we have
to ask for raw data now. And I'm going to ask you to practice that, OK? Can
you say "raw"?

>

> Audience: Raw.

>

> Tim Berners-Lee: Can you say "data"?

>

> Audience: Data.

>

> TBL: Can you say "now"?

>

> Audience: Now!

>

> TBL: Alright, "raw data now"!

>

> [...]

>

> So, we're at the stage now where we have to do this -- the people who think
it's a great idea. And all the people -- and I think there's a lot of people
at TED who do things because -- even though there's not an immediate return on
the investment because it will only really pay off when everybody else has
done it -- they'll do it because they're the sort of person who just does
things which would be good if everybody else did them. OK, so it's called
linked data. I want you to make it. I want you to demand it. [6]

## Un/Structured

As graduate students at Stanford, Sergey Brin and Lawrence (Larry) Page had an
early interest in producing "structured data" from the "unstructured" web. [7]

> The World Wide Web provides a vast source of information of almost all
types, ranging from DNA databases to resumes to lists of favorite restaurants.
However, this information is often scattered among many web servers and hosts,
using many different formats. If these chunks of information could be
extracted from the World Wide Web and integrated into a structured form, they
would form an unprecedented source of information. It would include the
largest international directory of people, the largest and most diverse
databases of products, the greatest bibliography of academic works, and many
other useful resources. [...]

>

> **2.1 The Problem**
> Here we define our problem more formally:
> Let D be a large database of unstructured information such as the World
Wide Web [...] [8]

In a paper titled _Dynamic Data Mining_ Brin and Page situate their research
looking for _rules_ (statistical correlations) between words used in web
pages. The "baskets" they mention stem from the origins of "market basket"
techniques developed to find correlations between the items recorded in the
purchase receipts of supermarket customers. In their case, they deal with web
pages rather than shopping baskets, and words instead of purchases. In
transitioning to the much larger scale of the web, they describe the
usefulness of their research in terms of its computational economy, that is
the ability to tackle the scale of the web and still perform using
contemporary computing power completing its task in a reasonably short amount
of time.

> A traditional algorithm could not compute the large itemsets in the lifetime
of the universe. [...] Yet many data sets are difficult to mine because they
have many frequently occurring items, complex relationships between the items,
and a large number of items per basket. In this paper we experiment with word
usage in documents on the World Wide Web (see Section 4.2 for details about
this data set). This data set is fundamentally different from a supermarket
data set. Each document has roughly 150 distinct words on average, as compared
to roughly 10 items for cash register transactions. We restrict ourselves to a
subset of about 24 million documents from the web. This set of documents
contains over 14 million distinct words, with tens of thousands of them
occurring above a reasonable support threshold. Very many sets of these words
are highly correlated and occur often. [9]

## Un/Ordered

In programming, I've encountered a recurring "problem" that's quite
symptomatic. It goes something like this: you (the programmer) have managed to
cobble out a lovely "content management system" (either from scratch, or using
any number of helpful frameworks) where your user can enter some "items" into
a database, for instance to store bookmarks. After this ordered items are
automatically presented in list form (say on a web page). The author: It's
great, except... could this bookmark come before that one? The problem stems
from the fact that the database ordering (a core functionality provided by any
database) somehow applies a sorting logic that's almost but not quite right. A
typical example is the sorting of names where details (where to place a name
that starts with a Norwegian "Ø" for instance), are language-specific, and
when a mixture of languages occurs, no single ordering is necessarily
"correct". The (often) exascerbated programmer might hastily add an additional
database field so that each item can also have an "order" (perhaps in the form
of a date or some other kind of (alpha)numerical "sorting" value) to be used
to correctly order the resulting list. Now the author has a means, awkward and
indirect but workable, to control the order of the presented data on the start
page. But one might well ask, why not just edit the resulting listing as a
document? Not possible! Contemporary content management systems are based on a
data flow from a "pure" source of a database, through controlling code and
templates to produce a document as a result. The document isn't the data, it's
the end result of an irreversible process. This problem, in this and many
variants, is widespread and reveals an essential backwardness that a
particular "computer scientist" mindset relating to what constitutes "data"
and in particular it's relationship to order that makes what might be a
straightforward question of editing a document into an over-engineered
database.

Recently working with Nikolaos Vogiatzis whose research explores playful and
radically subjective alternatives to the list, Vogiatzis was struck by how
from the earliest specifications of HTML (still valid today) have separate
elements (OL and UL) for "ordered" and "unordered" lists.

> The representation of the list is not defined here, but a bulleted list for
unordered lists, and a sequence of numbered paragraphs for an ordered list
would be quite appropriate. Other possibilities for interactive display
include embedded scrollable browse panels. [10]

Vogiatzis' surprise lay in the idea of a list ever being considered
"unordered" (or in opposition to the language used in the specification, for
order to ever be considered "insignificant"). Indeed in its suggested
representation, still followed by modern web browsers, the only difference
between the two visually is that UL items are preceded by a bullet symbol,
while OL items are numbered.

The idea of ordering runs deep in programming practice where essentially
different data structures are employed depending on whether order is to be
maintained. The indexes of a "hash" table, for instance (also known as an
associative array), are ordered in an unpredictable way governed by a
representation's particular implementation. This data structure, extremely
prevalent in contemporary programming practice sacrifices order to offer other
kinds of efficiency (fast text-based retrieval for instance).

## Data mining

In announcing Google's impending data center in Mons, Belgian prime minister
Di Rupo invoked the link between the history of the mining industry in the
region and the present and future interest in "data mining" as practiced by IT
companies such as Google.

Whether speaking of bales of cotton, barrels of oil, or bags of words, what
links these subjects is the way in which the notion of "raw material" obscures
the labor and power structures employed to secure them. "Raw" is always
relative: "purity" depends on processes of "refinement" that typically carry
social/ecological impact.

Stripping language of order is an act of "disembodiment", detaching it from
the acts of writing and reading. The shift from (human) reading to machine
reading involves a shift of responsibility from the individual human body to
the obscured responsibilities and seemingly inevitable forces of the
"machine", be it the machine of a market or the machine of an algorithm.

From [X = Y](/wiki/index.php?title=X_%3D_Y "X = Y"):

Still, it is reassuring to know that the products hold traces of the work,
that even with the progressive removal of human signs in automated processes,
the workers' presence never disappears completely. This presence is proof of
the materiality of information production, and becomes a sign of the economies
and paradigms of efficiency and profitability that are involved.


The computer scientists' view of textual content as "unstructured", be it in a
webpage or the OCR scanned pages of a book, reflect a negligence to the
processes and labor of writing, editing, design, layout, typesetting, and
eventually publishing, collecting and cataloging [11].

"Unstructured" to the computer scientist, means non-conformant to particular
forms of machine reading. "Structuring" then is a social process by which
particular (additional) conventions are agreed upon and employed. Computer
scientists often view text through the eyes of their particular reading
algorithm, and in the process (voluntarily) blind themselves to the work
practices which have produced and maintain these "resources".

Berners-Lee, in chastising his audience of web publishers to not only publish
online, but to release "unadulterated" data belies a lack of imagination in
considering how language is itself structured and a blindness to the need for
more than additional technical standards to connect to existing publishing
practices.

Last Revision: 2*08*2016

1. ↑ Benjamin Franklin Lieber, Lieber's Standard Telegraphic Code, 1896, New York;
2. ↑ Katherine Hayles, "Technogenesis in Action: Telegraph Code Books and the Place of the Human", How We Think: Digital Media and Contemporary Technogenesis, 2006
3. ↑ Hayles
4. ↑ Lieber's
5. ↑ Hayles
6. ↑ Tim Berners-Lee: The next web, TED Talk, February 2009
7. ↑ "Research on the Web seems to be fashionable these days and I guess I'm no exception." from Brin's [Stanford webpage](http://infolab.stanford.edu/~sergey/)
8. ↑ Extracting Patterns and Relations from the World Wide Web, Sergey Brin, Proceedings of the WebDB Workshop at EDBT 1998,
9. ↑ Dynamic Data Mining: Exploring Large Rule Spaces by Sampling; Sergey Brin and Lawrence Page, 1998; p. 2
10. ↑ Hypertext Markup Language (HTML): "Internet Draft", Tim Berners-Lee and Daniel Connolly, June 1993,
11. ↑

Retrieved from
[https://www.mondotheque.be/wiki/index.php?title=A_bag_but_is_language_nothing_of_words&oldid=8480](https://www.mondotheque.be/wiki/index.php?title=A_bag_but_is_language_nothing_of_words&oldid=8480)

Stankievech
Letter to the Superior Court of Quebec Regarding Arg.org
2016


Letter to the Superior Court of Quebec Regarding Arg.org

Charles Stankievech
19 January 2016

To the Superior Court of Quebec:
I am writing in support of the online community and library platform called “Arg.org” (also known under additional aliases and
urls including “aaaaarg.org,” “grr.aaaaarg.org,” and most recently
“grr.aaaaarg.fail”). It is my understanding that a copyright infringement lawsuit has been leveled against two individuals who
support this community logistically. This letter will address what
I believe to be the value of Arg.org to a variety of communities
and individuals; it is written to encompass my perspective on the
issue from three distinct positions: (1) As Director of the Visual
Studies Program, Faculty of Architecture, Landscape, and Design,
University of Toronto, where I am a professor and oversee three
degree streams for both graduate and undergraduate students;
(2) As the co-director of an independent publishing house based
in Berlin, Germany, and Toronto, Canada, which works with international institutions around the world; (3) As a scholar and writer
who has published in a variety of well-regarded international
journals and presses. While I outline my perspective in relation to
these professional positions below, please note that I would also
be willing to testify via video-conference to further articulate
my assessment of Arg.org’s contribution to a diverse international
community of artists, scholars, and independent researchers.
98

Essay continuing from page 49

“Warburgian tradition.”47 If we consider the Warburg Library
in its simultaneous role as a contained space and the reflection
of an idiosyncratic mental energy, General Stumm’s aforementioned feeling of “entering an enormous brain” seems an
especially concise description. Indeed, for Saxl the librarian,
“the books remain a body of living thought as Warburg had
planned,”48 showing “the limits and contents of his scholarly
worlds.”49 Developed as a research tool to solve a particular
intellectual problem—and comparable on a number of levels
to exhibition-led inquiry—Aby Warburg’s organically structured, themed library is a three-dimensional instance of a library that performatively articulates and potentiates itself,
which is not yet to say exhibits, as both spatial occupation and
conceptual arrangement, where the order of things emerges
experimentally, and in changing versions, from the collection
and its unusual cataloging.50

47

48
49
50

Saxl speaks of “many tentative and personal excrescences” (“The History of
Warburg’s Library,” 331). When Warburg fell ill in 1920 with a subsequent fouryear absence, the library was continued by Saxl and Gertrud Bing, the new and
later closest assistant. Despite the many helpers, according to Saxl, Warburg always
remained the boss: “everything had the character of a private book collection, where
the master of the house had to see it in person that the bills were paid in time,
that the bookbinder chose the right material, or that neither he nor the carpenter
delivering a new shelf over-charged” (Ibid., 329).
Ibid., 331.
Ibid., 329.
A noteworthy aside: Gertrud Bing was in charge of keeping a meticulous index of
names and keywords; evoking the library catalog of Borges’s fiction, Warburg even
kept an “index of un-indexed books.” See Diers, “Porträt aus Büchern,” 21.

99

1. Arg.org supports a collective & semiprivate community of
academics & intellectuals.
As the director of a graduate-level research program at the University of Toronto, I have witnessed first-hand the evolution
of academic research. Arg.org has fostered a vibrant community
of thinkers, students, and writers, who can share their research
and create new opportunities for collaboration and learning
because of the knowledge infrastructure provided by the platform.
The accusation of copyright infringement leveled against the
community misses the point of the research platform altogether.
While there are texts made available for download at no expense
through the Arg.org website, it is essential to note that these texts
are not advertised, nor are they accessible to the general public.
Arg.org is a private community whose sharing platform can only
be accessed by invitation. Such modes of sharing have always
existed in academic communities; for example, when a group of
professors would share Xerox copies of articles they want to read
together as part of a collaborative research project. Likewise,
it would be hard to imagine a community of readers at any time
in history without the frequent lending and sharing of books.
From this perspective, Arg.org should be understood within a
twenty-first century digital ethos, where the sharing of intellectual
property and the generation of derivative IP occurs through collaborative platforms. On this point, I want to draw further attention
to two fundamental aspects of Arg.org.
a. One essential feature of the Arg.org platform is that it gives
invited users the ability to create reading lists from available texts—
what are called on the website “collections.” These collections
are made up of curated folders containing text files (usually in
Portable Document Format); such collections allow for new and
novel associations of texts, and the development of working
bibliographies that assist in research. Users can discover previously unfamiliar materials—including entire books and excerpted
chapters, essays, and articles—through these shared collections.
Based on the popularity of previous collections I have personally
assembled on the Arg.org platform, I have been invited to give
100

In the Memory Hall of Reproductions
Several photographs document how the Warburg Library was
also a backdrop for Warburg’s picture panels, the wood boards
lined with black fabric, which, not unlike contemporary mood
boards, held the visual compositions he would assemble and
re-assemble from around 2,000 photographs, postcards, and
printed reproductions cut out of books and newspapers.
Sometimes accompanied by written labels or short descriptions, the panels served as both public displays and researchin-process, and were themselves photographed with the aim
to eventually be disseminated as book pages in publications.
In the end, not every publishing venture was realized, and
most panels themselves were even lost along the way; in fact,
today, the panel photographs are the only visual remainder of
this type of research from the Warburg Institute. Probably the
most acclaimed of the panels are those which Warburg developed in close collaboration with his staff during the last years
of his life and from which he intended to create a sequential
picture atlas of human memory referred to as the Mnemosyne
Atlas. Again defying the classical boundaries of the disciplines, Warburg had appropriated visual material from the
archives of art history, natural philosophy, and science to
vividly evoke and articulate his thesis through the creation of
unprecedented associations. Drawing an interesting analogy,
the following statement from Warburg scholar Kurt Forster
underlines the importance of the panels for the creation of
meaning:
Warburg’s panels belong into the realm of the montage à la Schwitters or Lissitzky. Evidently, such a

101

guest lectures at various international venues; such invitations
demonstrate that this cognitive work is considered original
research and a valuable intellectual exercise worthy of further
discussion.
b. The texts uploaded to the Arg.org platform are typically documents scanned from the personal libraries of users who have
already purchased the material. As a result, many of the documents are combinations of the original published text and annotations or notes from the reader. Commentary is a practice that
has been occurring for centuries; in Medieval times, the technique
of adding commentary directly onto a published page for future
readers to read alongside the original writing was called “Glossing.”
Much of the philosophy, theology, and even scientific theories
were originally produced in the margins of other texts. For example, in her translation and publication of Charles Babbage’s lecture
on the theory of the first computer, Ada Lovelace had more notes
than the original lecture. Even though the text was subsequently
published as Babbage’s work, today modern scholarship acknowledges Lovelace as important voice in the theorization of the
modern computer due to these vital marginal notes.
2. Arg.org supports small presses.
Since 2011, I have been the co-founder and co-director of
K. Verlag, an independent press based in Berlin, Germany, and
Toronto, Canada. The press publishes academic books on art
and culture, as well as specialty books on art exhibitions. While
I am aware of the difficulties faced by small presses in terms of
profitability, especially given fears that the sharing of books online
could further hurt book sales; however, my experience has been
in the opposite direction. At K. Verlag, we actually upload our new
publications directly to Arg.org because we know the platform
reaches an important community of readers and thinkers. Fully
conscious of the uniqueness of printed books and their importance, digital circulation of ebooks and scanned physical books
present a range of different possibilities in reaching our audiences
in a variety of ways. Some members of Arg.org may be too
102

comparison does not need to claim artistic qualities
for Warburg’s panels, nor does it deny them regarding
Schwitters’s or Lissitzky’s collages. It simply lifts the
role of graphic montage from the realm of the formal
into the realm of the construction of meaning.51
Interestingly, even if Forster makes a point not to categorize
Warburg’s practice as art, in twentieth-century art theory and
visual culture scholarship, his idiosyncratic technique has
evidently been mostly associated with art practice. In fact,
insofar as Warburg is acknowledged (together with Marcel
Duchamp and, perhaps, the less well-known André Malraux),
it is as one of the most important predecessors for artists
working with the archive.52 Forster articulates the traditional
assumption that only artists were “allowed” to establish idiosyncratic approaches and think with objects outside of the
box. However, within the relatively new discourse of the
“curatorial,” contra the role of the “curator,” the curatorial
delineates its territory as that which is no longer defined exclusively by what the curator does (i.e. responsibilities of classification and care) but rather as a particular agency in terms of
epistemologically and spatially working with existing materials and collections. Consequently, figures such as Warburg
51
52

Kurt Forster, quoted in Benjamin H.D. Buchloh, “Gerhard Richter’s Atlas: Das anomische Archiv,” in Paradigma Fotografie: Fotokritik am Ende des fotografischen Zeitalters,
ed. Herta Wolf (Frankfurt/M.: Suhrkamp Verlag, 2002), 407, with further references.
One such example is the Atlas begun by Gerhard Richter in 1962; another is
Thomas Hirschhorn’s large-format, mixed-media collage series MAPS. Entitled
Foucault-Map (2008), The Map of Friendship Between Art and Philosophy (2007),
and Hannah-Arendt-Map (2003), these works are partly made in collaboration
with the philosopher Marcus Steinweg. They bring a diverse array of archival and
personal documents or small objects into associative proximities and reflect the
complex impact philosophy has had on Hirschhorn’s art and thinking.

103

poor to afford to buy our books (eg. students with increasing debt,
precarious artists, or scholars in countries lacking accessible
infrastructures for high-level academic research). We also realize
that Arg.org is a library-community built over years; the site
connects us to communities and individuals making original work
and we are excited if our books are shared by the writers, readers,
and artists who actively support the platform. Meanwhile, we
have also seen that readers frequently discover books from our
press through a collection of books on Arg.org, download the
book for free to browse it, and nevertheless go on to order a print
copy from our shop. Even when this is not the case, we believe
in the environmental benefit of Arg.org; printing a book uses
valuable resources and then requires additional shipping around
the world—these practices contradict our desire for the broadest
dissemination of knowledge through the most environmentallyconscious of means.
3. Arg.org supports both official institutional academics
& independent researchers.
As a professor at the University of Toronto, I have access to one
of the best library infrastructures in the world. In addition to
core services, this includes a large number of specialty libraries,
archives, and massive online resources for research. Such
an investment by the administration of the university is essential
to support the advanced research conducted in the numerous
graduate programs and by research chairs. However, there are
at least four ways in which the official, sanctioned access to these
library resources can at times fall short.
a. Physical limitations. While the library might have several copies
of a single book to accommodate demand, it is often the case
that these copies are simultaneously checked out and therefore
not available when needed for teaching or writing. Furthermore,
the contemporary academic is required to constantly travel for
conferences, lectures, and other research obligations, but travelling with a library is not possible. Frequently while I am working
abroad, I access Arg.org to find a book which I have previously
104

and Malraux, who thought apropos objects in space (even
when those objects are dematerialized as reproductions),
become productive forerunners across a range of fields: from
art, through cultural studies and art history, to the curatorial.
Essential to Warburg’s library and Mnemosyne Atlas, but
not yet articulated explicitly, is that the practice of constructing two-dimensional, heterogeneous image clusters shifts the
value between an original work of art and its mechanical
reproduction, anticipating Walter Benjamin’s essay written a
decade later.53 While a museum would normally exhibit an
original of Albrecht Dürer’s Melencolia I (1514) so it could be
contemplated aesthetically (admitting that even as an etching
it is ultimately a form of reproduction), when inserted as a
quotidian reprint into a Warburgian constellation and exhibited within a library, its “auratic singularity”54 is purposefully
challenged. Favored instead is the iconography of the image,
which is highlighted by way of its embeddedness within a
larger (visual-emotional-intellectual) economy of human consciousness.55 As it receives its impetus from the interstices
53

54
55

One of the points Benjamin makes in “The Artwork in the Age of Mechanical
Reproduction” is that reproducibility increases the “exhibition value” of a work of art,
meaning its relationship to being viewed is suddenly valued higher than its
relationship to tradition and ritual (“cult value”); a process which, as Benjamin writes,
nevertheless engenders a new “cult” of remembrance and melancholy (224–26).
Benjamin defines “aura” as the “here and now” of an object, that is, as its spatial,
temporal, and physical presence, and above all, its uniqueness—which in his
opinion is lost through reproduction. Ibid., 222.
It is worth noting that Warburg wrote his professorial dissertation on Albrecht
Dürer. Another central field of his study was astrology, which Warburg examined
from historical and philosophical perspectives. It is thus not surprising to find
out that Dürer’s Melencolia I (1514), addressing the relationship between the
human and the cosmos, was of the highest significance to Warburg as a recurring
theme. The etching is shown, for instance, as image 8 of Plate 58, “Kosmologie bei
Dürer” (Cosmology in Dürer); reproduced in Warnke, ed., Aby Moritz Warburg:
Der Bilderatlas Mnemosyne, Gesammelte Schriften, Vol. 1, 106–7. The connections

105

purchased, and which is on my bookshelf at home, but which
is not in my suitcase. Thus, the Arg.org platform acts as a patch
for times when access to physical books is limited—although
these books have been purchased (either by the library or the
reader herself) and the publisher is not being cheated of profit.
b. Lack of institutional affiliation. The course of one’s academic
career is rarely smooth and is increasingly precarious in today’s
shift to a greater base of contract sessional instructors. When
I have been in-between institutions, I lost access to the library
resources upon which my research and scholarship depended.
So, although academic publishing functions in accord with library
acquisitions, there are countless intellectuals—some of whom
are temporary hires or in-between job appointments, others whom
are looking for work, and thus do not have access to libraries.
In this position, I would resort to asking colleagues and friends
to share their access or help me by downloading articles through
their respective institutional portals. Arg.org helps to relieve
this precarity through a shared library which allows scholarship
to continue; Arg.org is thus best described as a community of
readers who share their research and legally-acquired resources
so that when someone is researching a specific topic, the adequate book/essay can be found to fulfill the academic argument.
c. Special circumstances of non-traditional education. Several
years ago, I co-founded the Yukon School of Visual Arts in
Dawson City as a joint venture between an Indigenous government and the State college. Because we were a tiny school,
we did not fit into the typical academic brackets regarding student
population, nor could we access the sliding scale economics
of academic publishers. As a result, even the tiniest package for
a “small” academic institution would be thousands of times larger
than our population and budget. As a result, neither myself
nor my students could access the essential academic resources
required for a post-secondary education. I attempted to solve this
problem by forging partnerships, pulling in favors, and accessing
resources through platforms like Arg.org. It is important to realize
106

among text and image, visual display and publishing, the
expansive space of the library and the dense volume of the
book, Aby Warburg’s wide-ranging work appears to be best
summarized by the title of one of the Mnemosyne plates:
“Book Browsing as a Reading of the Universe.”56

To the Paper Museum
Warburg had already died before Benjamin theorized the
impact of mechanical reproduction on art in 1935. But it is
Malraux who claims to have embarked on a lengthy, multipart project about similitudes in the artistic heritage of the
world in exactly the same year, and for whom, in opposition
to the architectonic space of the museum, photographic
reproduction, montage, and the book are the decisive filters
through which one sees the world. At the outset of his book
Le Musée imaginaire (first published in 1947),57 Malraux argues
that the secular modern museum has been crucial in reframing and transforming objects into art, both by displacing
them from their original sacred or ritual context and purpose,
and by bringing them into proximity and adjacency
with one another, thereby opening new possible readings

56
57

and analogies between Warburg’s image-based research and his theoretical ideas,
and von Trier’s Melancholia, are striking; see Anna-Sophie Springer’s visual essay
“Reading Rooms Reading Machines” on p. 91 of this book.
“Buchblättern als Lesen des Universums,” Plate 23a, reproduced in Warnke, Aby
Moritz Warburg: Der Bilderatlas Mnemosyne, Gesammelte Schriften, Vol. 1, 38–9.
The title of the English translation, The Museum Without Walls, by Stuart Gilbert
and Francis Price (London: Secker & Warburg, 1967), must be read in reference
to Erasmus’s envisioning of a “library without walls,” made possible through the
invention of the printing press, as Anthony Grafton mentions in his lecture, “The
Crisis of Reading,” The CUNY Graduate Center, New York, 10 November 2014.

107

that Arg.org was founded to meet these grassroots needs; the
platform supports a vast number of educational efforts, including
co-research projects, self-organized reading groups, and numerous other non-traditional workshops and initiatives.
d. My own writing on Arg.org. While using the platform, I have frequently come across my own essays and publications on the
site; although I often upload copies of my work to Arg.org myself,
these copies had been uploaded by other users. I was delighted
to see that other users found my publications to be of value and
were sharing my work through their curated “collections.” In some
cases, I held outright exclusive copyright on the text and I was
pleased it was being distributed. In other rare cases, I shared the
copyright or was forced to surrender my IP prior to publication;
I was still happy to see this type of document uploaded. I realize
it is not within my authority to grant copyright that is shared,
however, the power structure of contemporary publishing is often
abusive towards the writer. Massive, for-profit corporations have
dominated the publishing of academic texts and, as a result of
their power, have bullied young academics into signing away their
IP in exchange for publication. Even the librarians at Harvard
University—who spend over $3.75 million USD annually on journal subscriptions alone—believe that the economy of academic
publishing and bullying by a few giants has crossed a line, to the
point where they are boycotting certain publishers and encouraging faculty to publish instead in open access journals.
I want to conclude my letter of support by affirming that
Arg.org is at the cutting edge of academic research and knowledge
production. Sean Dockray, one of the developers of Arg.org,
is internationally recognized as a leading thinker regarding the
changing nature of research through digital platforms; he is regularly invited to academic conferences to discuss how the community on the Arg.org platform is experimenting with digital research.
Reading, publishing, researching, and writing are all changing
rapidly as networked digital culture influences professional and
academic life more and more frequently. Yet, our legal frameworks and business models are always slower than the practices

(“metamorphoses”) of individual objects—and, even more
critically, producing the general category of art itself. As
exceptions to this process, Malraux names those creations that
are so embedded in their original architecture that they defy
relocation in the museum (such as church windows, frescoes,
or monuments); this restriction of scale and transportation, in
fact, resulted in a consistent privileging of painting and sculpture within the museological apparatus.58
Long before networked societies, with instant Google
Image searches and prolific photo blogs, Malraux dedicated
himself to the difficulty of accessing works and oeuvres
distributed throughout an international topography of institutions. He located a revolutionary solution in the dematerialization and multiplication of visual art through photography
and print, and, above all, proclaimed that an imaginary museum
based on reproductions would enable the completion of a
meaningful collection of artworks initiated by the traditional
museum.59 Echoing Benjamin’s theory regarding the power of
the reproduction to change how art is perceived, Malraux
writes, “Reproduction is not the origin but a decisive means
for the process of intellectualization to which we subject art.
58

59

I thank the visual culture scholar Antonia von Schöning for pointing me to
Malraux after reading my previous considerations of the book-as-exhibition. Von
Schöning herself is author of the essay “Die universelle Verwandtschaft zwischen
den Bildern: André Malraux’Musée Imaginaire als Familienalbum der Kunst,”
kunsttexte.de, April 2012, edoc.hu-berlin.de/kunsttexte/2012-1/von-schoening
-antonia-5/PDF/von-schoening.pdf.
André Malraux, Psychologie der Kunst: Das imaginäre Museum (Baden-Baden:
Woldemar Klein Verlag, 1949), 9; see also Rosalind Krauss, “The Ministry of
Fate,” in A New History of French Literature, ed. Denis Hollier (Cambridge, MA
and London: Harvard University Press, 1989), 1000–6: “The photographic archive
itself, insofar as it is the locale of a potentially complete assemblage of world
artifacts, is a repository of knowledge in a way that no individual museum could
ever be” (1001).

109

of artists and technologists. Arg.org is a non-profit intellectual
venture and should therefore be considered as an artistic experiment, a pedagogical project, and an online community of coresearchers; it should not be subject to the same legal judgments
designed to thwart greedy profiteers and abusive practices.
There are certainly some documents to be found on Arg.org that
have been obtained by questionable or illegal means—every
Web 2.0 platform is bound to find such examples, from Youtube
to Facebook; however, such examples occur as a result of a small
number of participant users, not because of two dedicated individuals who logistically support the platform. A strength of Arg.org
and a source of its experimental vibrancy is its lack of policing,
which fosters a sense of freedom and anonymity which are both
vital elements for research within a democratic society and
the foundations of any library system. As a result of this freedom,
there are sometimes violations of copyright. However, since
Arg.org is a committed, non-profit community-library, such transgressions occur within a spirit of sharing and fair use that characterize this intellectual community. This sharing is quite different
from the popular platform Academia.edu, which is searchable
by non-users and acquires value by monetizing its articles through
the sale of digital advertising space and a nontransparent investment exit strategy. Arg.org is the antithesis of such a model
and instead fosters a community of learning through its platform.
Please do not hesitate to contact me for further information,
or to testify as a witness.
Regards,
Charles Stankievech,
Director of Visual Studies Program, University of Toronto
Co-Director of K. Verlag, Berlin & Toronto

… Medieval works, as diverse as the tapestry, the glass window,
the miniature, the fresco, and the sculpture become united as
one family if reproduced together on one page.”60 In his search
for a common visual rhetoric, Malraux went further than
merely arranging creations from one epoch and cultural sphere
by attempting to collect and directly juxtapose artworks and
artifacts from very diverse and distant cultural, historical, and
geographic contexts.
His richly illustrated series of books thus functions as a
utopian archive of new temporalities of art liberated from
history and scale by de-contextualizing and re-situating the
works, or rather their reproduced images, in unorthodox combinations. Le Musée imaginaire was thus an experimental virtual
museum intended to both form a repository of knowledge and
provide a space of association and connection that could not
be sustained by any other existing place or institution. From an
art historical point of view—Malraux was not a trained scholar
and was readily criticized by academics—his theoretical
assumptions of “universal kinship” (von Schöning) and the
“anti-destiny” of art have been rejected. His material selection
process and visual appropriation and manipulation through
framing, lighting, and scale, have also been criticized for their
problematic and often controversial—one could say, colonizing—implications.61 Among the most recent critics is the art
historian Walter Grasskamp, who argues that Malraux moreover might well have plagiarized the image-based work of the
60
61

André Malraux, Das imaginäre Museum, 16.
See the two volumes of Georges Duthuit, Le Musée Inimaginable (Paris: J. Corti,
1956); Ernst Gombrich, “André Malraux and the Crisis of Expressionism,” The
Burlington Magazine 96 (1954): 374–78; Michel Merlot, “L’art selon André Malraux,
du Musée imaginaire à l’Inventaire general,” In Situ 1 (2001), www.insitu.revues
.org/1053; and von Schöning, “Die universelle Verwandtschaft zwischen den Bildern.”

111


 

Display 200 300 400 500 600 700 800 900 1000 ALL characters around the word.