POKÉMANIAC SIPH wants to fight! @Siphonay

8 posts1 participant0 posts today

**Andreas Wagner** @anwagnerdreas@hcommons.social · 6d

Andreas Wagner @anwagnerdreas@hcommons.social

Hab ein bisschen darüber nachgedacht und hier meine Antwort: das geht aus grundsätzlichen Gründen nicht!

Die Chat-LLMs, die jede natürlichsprachliche Anweisung zu verstehen scheinen, machen nämlich genau das *nicht*: einer Anweisung folgen. Sie plappern nur vor sich hin und sehen halt zu, dass das Geplapper gut an das anschließt, was bisher so geplappert wurde.

Weil ihr Trainingsmaterial viel Dialog umfasst, sieht es dann so aus, als würden sie dialogische Anweisungen und Intentionen verstehen, eigene Intentionen als Reaktion entwickeln und dann artikulieren. Aber sie "vervollständigen" nur ein in der Form des Dialogs verfasstes Dokument.

Das "Eingehen" auf die Aufforderung (um nicht zu sagen: das Verständnis des Tasks) ist also Teil der textgenerativen Funktion und nicht vom Prozessieren, der Ausgabe und den Halluzinationen abtrennbar.

D.h. schlechte Nachricht: Ohne GenAI/autoregression muss man für jeden Task extra Finetunen!

Oder seht ihr das anders?

#LLM (post-) #DHd2025

**Henrik Schönemann** @lavaeolus@fedihum.org · Mar 7

Henrik Schönemann @lavaeolus@fedihum.org

Oh, #DHd2025
Ich wurde ja mehrfach nach meinen Hoodies und Aufklebern gefragt, die sind alle von @PGExplaining

Mar 7

**Asterisk** @jomla@fedihum.org · Mar 7 *

Asterisk @jomla@fedihum.org

"Wir müssten nicht sagen 'Gibt es Fragen', sondern 'Gibt es Diskussionsbeiträge?'" @Mareike2405 finde ich eigentlich grundsätzlich gut, weil es auch diese Hierarchie zwischen vermeintlichen Expert:in-Referent:innen und fragend-naivem Publikum herausfordert. #DHd2025

Mar 7 *

**Henrik Schönemann** @lavaeolus@fedihum.org · Mar 7

Henrik Schönemann @lavaeolus@fedihum.org

In Wien nächstes Jahr dann Queer-Meetup auf der #DHd2026

#DHd2025

Mar 7

**Christof Schöch** @christof@fedihum.org · Mar 7

Christof Schöch @christof@fedihum.org

@dingemansemark

I share this observation, but also experience it within myself as two opposing tendencies: fascinated by the apparent possibilities, but worried about a whole host of implications as well.

In addition, however, I am worried that external expecations to be "innovative" and "excellent" push more of us towards LLMs and GPTs than is healthy for a methodologically diverse field. Definitely some vibes of "big data" from 10-15 years ago going on...

#DHd2025 #LLMs #GPT

Mar 7

**Mark Dingemanse** @dingemansemark@scholar.social · Mar 7

Mark Dingemanse @dingemansemark@scholar.social

As a relative outsider I hope I can be permitted the observation that #dhd2025 at times seemed oddly split, with some workshops awash in unabashed (and unexamined) technosolutionism while others displayed more ethical & societal awareness & deeper theoretical grounding (e.g. #greeningDH, #datafeminism)

Eubanks 2011: "We must actively choose the kind of technosocial worlds we want to inhabit"

“We can create technologies that protect socially just values or
we can build technologies that permit those values to disappear.
We must actively choose the kind of technosocial worlds we
want to inhabit"

Mar 7

**Mark Dingemanse** @dingemansemark@scholar.social · Mar 6

Mark Dingemanse @dingemansemark@scholar.social

Eight books I built on in my lecture & can recommend for #dhd2025 — the lower row with Suchman, Illich, Eubanks and Franklin perhaps less familiar in some DH quarters but all the more relevant for how they study the intersections of culture, technology, science, and the social

Refs at https://markdingemanse.net/dhd2025

Eight covers of books:
Global Debates in the Digital Humanities (Fiormonte et al, eds)
Bloomsbury Handbook of the Digital Humanities (O'Sullivan, ed)
Digital Humanities Outside the Center (McGra et al, eds)
Doing Black Digital Humanities with radical intentionality (Knight Steele et al., eds)

Human-machine reconfigurations (Lucy Suchman)
Tools for Conviviality (Ivan Illich)
Digital Dead End (Virginia Eubanks)
The Real World of Technology (Ursula M. Franklin)

Mar 6

**@frueheneuzeit** @stefan_hessbrueggen@fedihum.org · Mar 6

@frueheneuzeit @stefan_hessbrueggen@fedihum.org

Fresh from the digital presses: https://doi.org/10.5281/zenodo.14967726 My data set with > 50000 catalogue records of early modern dissertations in French libraries. Background for my talk tomorrow at #dhd2025

ZenodoEMDFL: Early Modern Dissertations in French LibrariesThis dataset consists of two zip files, `emdfl_data.zip` and `emdfl_data_code.zip`. The `emdfl_data.zip` archive consists of seven CSV files which contain information -- with different levels of precision and completeness -- on between 17 942 and 55 178 dissertations published between 1564 and 1800 and held in French libraries. The datasets were derived from catalogue records in the general catalogue of the French national library (Bibliothèque Nationale de France, BNF) and the union catalogue of French university libraries (SUDOC). The `emdfl_data_code.zip` archive contains the underlying git repository, code in Python notebooks and downloaded files in order to document the research process. It is documented in a README file in the root directory of the repository. Here, we only describe the selection criteria and additional details for the `emdfl_data.zip`archive. In order to be included in the final dataset, catalogue records had to meet four criteria: We were able to find a determinate date of publication in the catalogue record. We could identify at least one reference to an author or contributor. We do not expect a unique identifier derived froman authority file. We could find a valid title. The dissertation was published after 1563. The last criterion is somewhat ad hoc, but is based on the diagnosis that for records containing an earlier date, the status of the underlying books as academic dissertations is somewhat dubious. All records meeting these four criteria are part of the 'bronze' dataset. It is coextensive with what we have called the ‘silver’ dataset for libraries. In other words, all records in the bronze dataset also contain valid identifiers for holding libraries. Both sets contain 55 178 records. The ‘silver’ dataset ‘Place’ documents in addition uniquely identifiable places of publication and contains 49 423 records. The silver dataset ‘Discipline’ contains additional information about the discipline to which a given dissertation belongs and contains 36 924 records. The silver dataset ‘Persons’ contains all records for which we could obtain at least one valid VIAF identifier for a person (an author, supervisor, or printer). It comprises 31 184 records. All records associated with one of these files have a unique ID, so that these datasets can be merged for analysis e. g. of the temporal and geographical distribution of dissertations, a topic that would require the intersection of the silver ‘Discipline’ and ‘Place’ datasets. The ‘gold’ dataset ‘Person Year Place’ includes at least one unique identifier for a person in the catalogue record, the year and the place of publication. It comprises 27 413 catalogue records. The ‘gold’ dataaset ‘All’ includes the same information as ‘Person Year Place’, but in addition also references the discipline of the dissertation. It contains 17 942 records.

Mar 6

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6 *

Andreas Wagner @anwagnerdreas@hcommons.social

Technische Frage: Ist es eigentlich möglich, ein autoencoding oder seq2seq Modell so zu trainieren, dass es - wie die bekannten Chat-Modelle - beliebige Anweisungen in natürlicher Sprache entgegennehmen und verarbeiten kann, oder ist dazu die generative Architektur unabdingbar?

Das ist ja vielleicht der größte Vorteil des Trainings, das diese Modelle erfahren haben.

#LLM #DHd2025

Mar 6 *

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6 *

Andreas Wagner @anwagnerdreas@hcommons.social

Eine grundlegende technische Differenz, die m.E. jede wissenschaftspolitische LLM Strategie berücksichten muss:

Generative (autoregressive) Modelle (die würden wir z.B. für Code Generation brauchen) sind etwas anderes als autoencoding Modelle (für z.B. Klassifikation) oder seq2seq Modelle (für z.B. (multimodale) Übersetzungen). Die autoencoders müssten im Vergleich zu GPT, Claude & Co. - bei gleicher Skalierungsstufe wohlgemerkt - Klassifikation und Informationsextraktion *viel besser* beherrschen, kein ausbeuterisches RLHF benötigen und nur wenig für Halluzinationen anfällig sein. Sie sind halt von den kommerziellen Anbietern nicht auf dieselbe Stufe hochskaliert worden wie die "Chat" Modelle.

Das müssten wir in der Wissenschaft vielleicht selber machen, aber das hätte ja auch Vorteile.

#DHd2025 #LLM

Mar 6 *

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6

Andreas Wagner @anwagnerdreas@hcommons.social

Ich hoffe, bei der #DHd2026 sind wir weiter und hören statt "Menschliche Nachkontrolle ist auch bei GPT-6u immer noch nötig" Einschätzungen und Erfahrungsberichte wie "Mit dem Wechsel von direkter Korrektur- und Tagging-Arbeit zu Nachkontrolle von LLM Output hat sich unser Aufwand und die erforderlichen Kompetenzen wie folgt verändert: ..."

#DHd2025

Mar 6

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6

Andreas Wagner @anwagnerdreas@hcommons.social

Interessant dazu dürfte Forschung wie die von Anthropic sein, die ich schon die ganze Zeit nicht schaffe endlich zu lesen: https://www.anthropic.com/research/mapping-mind-language-model

#DHd2025 #LLM #DigitalHumanities

4/4

www.anthropic.comMapping the Mind of a Large Language ModelWe have identified how millions of concepts are represented inside Claude Sonnet, one of our deployed large language models. This is the first ever detailed look inside a modern, production-grade large language model.

Mar 6

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6 *

Andreas Wagner @anwagnerdreas@hcommons.social

... und ich weiß nicht, ob es da reicht, behavioristisch die Regelmäßigkeit und Konformität im Verhalten zu beobachten. Wenn wir *Gewissheit* haben wollen, und/oder wenn wir *verstehen* wollen, was passiert, müssen wir die Begründungs- bzw. Kausalitätsverhältnisse anschauen. Da bin ich wohl eher Aristoteliker (Idealist?)...

#DHd2025 #LLM #DigitalHumanities

3/4

Mar 6 *

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6 *

Andreas Wagner @anwagnerdreas@hcommons.social

Wenn ich mal anthropomorphisieren darf: so ein Automat fragt sich nicht nur "Ich habe jetzt 'muss man sehen, dass' geschrieben, was könnte als nächstes kommen?", sondern er "berücksichtigt" dabei Faktoren wie "jetzt muss ein Autorenname kommen" oder "ich bin gerade in dem Teil des Textes, in dem ein Einwand dargelegt wird". Diese Prozesse/Berechnungen verstehen wir immer noch zu wenig.

#DHd2025 #LLM #DigitalHumanities

2/4

Mar 6 *

**Andreas Wagner** @anwagnerdreas@hcommons.social · Mar 6 *

Andreas Wagner @anwagnerdreas@hcommons.social

Die Frage, ob LLMs intelligent (oder nützlich oder zuverlässig oder verständig oder kompetente "Sprecher") sind oder nicht, bewegt sich für mich auf einer komisch allgemeinen Ebene.

Um zu verstehen, was sie tun, muss man m.E. nicht nur verstehen, dass sie probabilistische Next Token Predictors sind (die autoregressiven LLMs jedenfalls), sondern auch, dass (und wie und welche genau) sie interne Repräsentationen von abstrakten Einheiten wie grammatischen Wortklassen, rhetorischen Textstrukturen und semantischen Feldern haben - und wie diese Repräsentationen in die Next Token Wahrscheinlichkeiten hineinwirken.

#DHd2025 #LLM #DigitalHumanities

1/4

Mar 6 *