Contents of 

Glottometrics 15, 2007 (including abstracts)

 

Haitao Liu
Probability distribution of dependency distance           1-12
Abstract. This paper investigates probability distributions of dependency distances in six texts extracted from a Chinese dependency treebank. The fitting results reveal that the investigated distribution can be well captured by the right truncated Zeta distribution. In order to restrict the model only to natural language, two samples with randomly generated governors are investigated. One of them can be described e.g. by the Hyperpoisson distribution, the other satisfies the Zeta distribution. The paper also presents a study on sequential plot and mean dependency distance of six texts with three analyses (syntactic, and two random). Of these three analyses, syntactic analysis has a minimum (mean) dependency distance.
Oxana Kotsyuba
Russizismen im deutschen Wortschatz 13-23
Abstract. The history of the German language is a history depicting the influence of foreign languages on German, as has been portrayed in different publications on the influence of the English, French, and Italian languages. The influence of other modern languages, among them the Russian language, has not been analysed to a great extent. This paper, based on gained data, intends to determine whether the Piotrowski-Law applies to the process of word-borrowing from Russian into German.
Karl-Heinz Best

Zur Entwicklung des Wortschatzes der Elektrotechnik, Informationstechnik und Elektrophysik im Deutschen

24-27
Abstract. The purpose of this paper is to present some further evidence for the validity of the logistic law in the development of the dictionary. To this end we test some data on the increase of terms and signs in a technical language presented by Warner (2007).
Motohiro Ishida, Kazue Ishida
On distributions of sentence lengths in Japanese writing 28-44
Abstract. The lognormal distribution had long been thought to be the most appropriate probability distribution for Japanese sentence length distributions. Yet this view had been supported only by few researches with sparse sampling data and reasoning contradicting language reality. In order to show a more realistic approach, we analyzed a substantial number of samples. At first, 150 essays and short stories were drawn as a random sample, out of which any pieces of writing whose length was either less than 100 or more than 1000 sentences were excluded. As a result, 113 pieces remained as sample texts. We also paid attention to the kinds of sentences, separating those of dialogue from narrative ones. From each one of these 113 sample texts, three sentence length frequency distributions were acquired – the first one for a complete text, the second one for the collection of direct speech in the same text, and the third one for all the narrative parts excluding direct speech above. The results completely overturn the long-standing belief, proving that a lognormal distribution – which has been computed but will not be shown here – can never be well applied to Japanese sentence length distributions. Our new findings indicate that in place of this lognormal distribution, the Hyperpascal distribution  maintains an excellent goodness of fit.
Ján Mačutek,  Ioan-Iovitz Popescu,  Gabriel Altmann
Confidence intervals and tests for the h-point and related text characteristics 45-52
Abstract. Confidence intervals and tests for recently introduced text characteristics (the h-point and its relatives) are derived.
Reginald Smith
Investigation of the Zipf-plot of the extinct Meroitic language 53-61
Abstract: The ancient and extinct language Meroitic is investigated using Zipf’s Law. In particular, since Meroitic is still undeciphered, the Zipf law analysis allows us to assess the quality of current texts and possible avenues for future investigation using statistical techniques.
Reinhard Köhler, Reinhard Rapp
A psycholinguistic application of synergetic linguistics          62-70
Abstract: The paper presents a new attempt to analyse the relationship between word familiarity and word frequency within the framework of synergetic linguistics. Whereas in psychology it is customary to apply correlational analyses to such questions the current paper sets up a functional model and tests it on empirical data from two large corpora and a psycholinguistic database.
Ioan-Iovitz Popescu, Gabriel Altmann
Writer´s view of text generation 71-81
Abstract: Generally, a “writer’s view”, defined by the angle between the ends of the word rank-frequency distribution, as seen from the h-point, should be limited in the interval [π/2, π]. However, as shown in the present paper with 176 texts from 20 languages, actually the lower limit appears to be the golden number φ = 1.618... , rather than π/2 = 1.57...
Peter Grzybek

On the systematic and system-based study of grapheme frequencies: a re-analysis of German letter frequencies

82-91
Abstract: This study looks at the theoretical modeling of letter frequencies. Based on recent findings demonstrating the negative hypergeometric function to be an adequate model, a re-analysis of German data reported by Best (2005) is conducted, concentrating on a detailed examination of parameter behavior. It is shown that all parameters of this distribution behave regularly, if the analysis is based on the system’s inventory size, rather than on the class of items occurring in the given sample. Directions for future research are pointed out, particularly involving factors influencing parameter values.
History of Quantitative Linguistics  92-100
Karl-Heinz Best, Gabriel Altmann
XXX. Gustav Herdan (1897-1968) 92-96
Emmerich Kelih
XXXI. B.I. Jarcho as a pioneer of the exact study of literature 96-100

Glottometrics ist eine unregelmäßig erscheinende Zeitschrift für die quantitative Erforschung von Sprache und Text.

 

Glottometrics is a scientific journal for the quantitative research on language and text published at irregular intervals

Beiträge in Deutsch oder Englisch sollten an einen der Herausgeber in einem gängigen Textverarbeitssystem (vorrangig WORD) geschickt werden.

 

Contributions in English or German written with a common text processing system (preferably WORD) should be sent to one of the editors
Glottometrics kann aus dem Internet heruntergeladen, auf CD-ROM (PDF-Format) oder in Buchform bestellt werden. Glottometrics can be downloaded from the Internet, obtained on CD-ROM (in PDF) or in form of printed copies

Herausgeber/Editors:

G. Altmann 02351973070-0001@t-online.de
K.-H. Best kbest@gwdg.de
P. Grzybek peter.grzybek@uni-graz.at 
A. Hardie a.hardie@lancester.ac.uk 
L. Hrebicek hrebicek@orient.cas.cz
R. Köhler koehler@uni-trier.de
J. Macutek jmacutek@yahoo.com 
G. Wimmer wimmer@mat.savba.sk
A. Ziegler arneziegler@compuserve.de

Herunterladen/ Downloading: http://www.ram-verlag.de

ISSN 1617-8351                                                                                                   back