1. |
- Darányi, Sándor, et al.
(författare)
-
A Physical Metaphor to Study Semantic Drift
- 2016
-
Ingår i: Proceedings of SuCCESS-16, 1st International Workshop on Semantic Change & Evolving Semantics. - 9781450321389
-
Konferensbidrag (refereegranskat)abstract
- In accessibility tests for digital preservation, over time we experience drifts of localized and labelled content in statistical models of evolving semantics represented as a vector field. This articulates the need to detect, measure, interpret and model outcomes of knowledge dynamics. To this end we employ a high-performance machine learning algorithm for the training of extremely large emergent self-organizing maps for exploratory data analysis. The working hypothesis we present here is that the dynamics of semantic drifts can be modeled on a relaxed version of Newtonian mechanics called social mechanics. By using term distances as a measure of semantic relatedness vs. their PageRank values indicating social importance and applied as variable ‘term mass’, gravitation as a metaphor to express changes in the semantic content of a vector field lends a new perspective for experimentation. From ‘term gravitation’ over time, one can compute its generating potential whose fluctuations manifest modifications in pairwise term similarity vs. social importance, thereby updating Osgood’s semantic differential. The dataset examined is the public catalog metadata of Tate Galleries, London.
|
|
2. |
- Darányi, Sándor, 1951-, et al.
(författare)
-
A Potential Surface Underlying Meaning?
- 2015
-
Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
- Machine learning algorithms utilizing gradient descent to identify concepts or more general learnables hint at a so-far ignored possibility, namely that local and global minima represent any vocabulary as a landscape against which evaluation of the results can take place. A simple example to illustrate this idea would be a potential surface underlying gravitation. However, to construct a gravitation-based representation of, e.g., word meaning, only the distance between localized items is a given in the vector space, whereas the equivalents of mass or charge are unknown in semantics. Clearly, the working hypothesis that physical fields could be a useful metaphor to study word and sentence meaning is an option but our current representations are incomplete in this respect.For a starter, consider that an RBF kernel has the capacity to generate a potential surface and hence create the impression of gravity, providing one with distance-based decay of interaction strength, plus a scalar scaling factor for the interaction, but of course no term masses. We are working on an experiment design to change that. Therefore, with certain mechanisms in neural networks that could host such quasi-physical fields, a novel approach to the modeling of mind content seems plausible, subject to scrutiny.Work in progress in another direction of the same idea indicates that by using certain algorithms, already emerged vs. still emerging content is clearly distinguishable, in line with Aristotle’s Metaphysics. The implications are that a model completed by “term mass” or “term charge” would enable the computation of the specific work equivalent of sentences or documents, and that via replacing semantics by other modalities, vector fields of more general symbolic content could exist as well. Also, the perceived hypersurface generated by the dynamics of language use may be a step toward more advanced models, for example addressing the Hamiltonian of expanding semantic systems, or the relationship between reaction paths in quantum chemistry vs. sentence construction by gradient descent.
|
|
3. |
- Darányi, Sándor, et al.
(författare)
-
Conceptual machinery of the mythopoetic mind : Attis, a case study
- 2015
-
Ingår i: Proceedings of QI-15, 9th International Quantum Interaction Symposium.
-
Konferensbidrag (refereegranskat)abstract
- In search for the right interpretation regarding a body of related content, we screened a small corpus of myths about Attis, a minor deity from the Hellenistic period in Asia Minor to identify the noncommutativity of key concepts used in storytelling. Looking at the protagonist's typical features, our experiment showed incompatibility with regard to his gender and downfall. A crosscheck for entanglement found no violation of a Bell inequality, its best approximation being on the border of the local polytope.
|
|
4. |
- Darányi, Sándor, et al.
(författare)
-
Connecting the Dots : Mass, Energy, Word Meaning, and Particle-Wave Duality
- 2012
-
Konferensbidrag (refereegranskat)abstract
- With insight from linguistics that degrees of text cohesion are similar to forces in physics, and the frequent use of the energy concept in text categorization by machine learning, we consider the applicability of particle-wave duality to semantic content inherent in index terms. Wave-like interpretations go back to the regional nature of such content, utilizing functions for its representation, whereas content as a particle can be conveniently modelled by position vectors. Interestingly, wave packets behave like particles, lending credibility to the duality hypothesis. We show in a classical mechanics framework how metaphorical term mass can be computed.
|
|
5. |
|
|
6. |
- Darányi, Sándor, et al.
(författare)
-
The gravity of meaning : Physics as a metaphor to model semantic changes
- 2012
-
Konferensbidrag (refereegranskat)abstract
- Based on a computed toy example, we offer evidence that by plugging in similarity of word meaning as a force plus a small modification of Newton’s 2nd law, one can acquire specific “mass” values for index terms in a Saltonesque dynamic library environment. The model can describe two types of change which affect the semantic composition of document collections: the expansion of a corpus due to its update, and fluctuations of the gravitational potential energy field generated by normative language use as an attractor juxtaposed with actual language use yielding time-dependent term frequencies. By the evolving semantic potential of a vocabulary and concatenating the respective term “mass” values, one can model sentences or longer strings of symbols as vector-valued functions. Since the line integral of such functions is used to express the work of a particle in a gravitational field, the work equivalent of strings can be calculated.
|
|
7. |
- Darányi, Sándor, et al.
(författare)
-
The Sphynx's new riddle : How to relate the canonical formula of myth to quantum interaction
- 2013
-
Konferensbidrag (refereegranskat)abstract
- We introduce Claude Lévi Strauss' canonical formula (CF), an attempt to rigorously formalise the general narrative structure of myth. This formula utilises the Klein group as its basis, but a recent work draws attention to its natural quaternion form, which opens up the possibility that it may require a quantum inspired interpretation. We present the CF in a form that can be understood by a non-anthropological audience, using the formalisation of a key myth (that of Adonis) to draw attention to its mathematical structure. The future potential formalisation of mythological structure within a quantum inspired framework is proposed and discussed, with a probabilistic interpretation further generalising the formula.
|
|
8. |
- Darányi, Sandor, et al.
(författare)
-
Toward a 5M Model of Digital Libraries
- 2010
-
Konferensbidrag (refereegranskat)abstract
- Whereas the DELOS DRM and the 5S model of digital libraries (DL) addresses the formal side of DL, we argue that a parallel 5M model is emerging as best practice worldwide, integrating multicultural, multilingual, multimodal digital objects with multivariate statistics-based document indexing, categorization and retrieval methods. The fifth M stands for the modeling the information searching behavior of users, and of collection development. We show how an extension of the 5S model to Hilbert space (a) points toward the integration of several Ms; (b) makes the tracking of evolving semantic content feasible, and (c) leads to a field interpretation of word and sentence semantics underlying language change. First experimental results from the Strathprints e-repository verify the mathematical foundations of the 5M model.
|
|
9. |
- Darányi, Sándor, et al.
(författare)
-
Toward Sequencing “Narrative DNA” : Tale Types, Motif Strings and Memetic Pathways
- 2012
-
Konferensbidrag (övrigt vetenskapligt/konstnärligt)abstract
- The Aarne-Thompson-Uther Tale Type Catalog (ATU) is a bibliographic tool which uses metadata from tale content, called motifs, to define tale types as canonical motif sequences. The motifs themselves are listed in another bibliographic tool, the Aarne-Thompson Motif Index (AaTh). Tale types in ATU are defined in an abstracted fashion and can be processed like a corpus. We analyzed 219 types with 1202 motifs from the “Tales of magic” (types 300-749) segment to exemplify that motif sequences show signs of recombination in the storytelling process. Compared to chromosome mutations in genetics, we offer examples for insertion/deletion, duplication and, possibly, transposition, whereas the sample was not sufficient to find inverted motif strings as well. These initial findings encourage efforts to sequence motif strings like DNA in genetics, attempting to find for instance the longest common motif subsequences in tales. Expressing the network of motif connections by graphs suggests that tale plots as consolidated pathways of content help one memorize culturally engraved messages. We anticipate a connection between such networks and addington’s epigenetic landscape.
|
|
10. |
- Gao, Shi Chao, et al.
(författare)
-
Data-driven estimation of blood pressure using photoplethysmographic signals
- 2016
-
Ingår i: Proceedings of EMBC-16, 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. - : IEEE. - 9781457702204
-
Konferensbidrag (refereegranskat)abstract
- Noninvasive measurement of blood pressure by optical methods receives considerable interest, but the complexity of the measurement and the difficulty of adjusting parameters restrict applications. We develop a method for estimating the systolic and diastolic blood pressure using a single-point optical recording of a photoplethysmographic (PPG) signal. The estimation is data-driven, we use automated machine learning algorithms instead of mathematical models. Combining supervised learning with a discrete wavelet transform, the method is insensitive to minor irregularities in the PPG waveform, hence both pulse oximeters and smartphone cameras can record the signal. We evaluate the accuracy of the estimation on 78 samples from 65 subjects (40 male, 25 female, age 29±7) with no history of cardiovascular disease. The estimate for systolic blood pressure has a mean error 4.9±4.9 mm Hg, and 4.3±3.7 mm Hg for diastolic blood pressure when using the oximeter-obtained PPG. The same values are 5.1±4.3 mm Hg and 4.6±4.3 mm Hg when using the phone-obtained PPG, comparing with A&D UA-767PBT result as gold standard. The simplicity of the method encourages ambulatory measurement, and given the ease of sharing the measured data, we expect a shift to data-oriented approaches deriving insight from ubiquitous mobile devices that will yield more accurate machine learning models in monitoring blood pressure.
|
|