You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _data/publications.yaml
+17-9Lines changed: 17 additions & 9 deletions
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,7 @@ areas:
16
16
low-precision: "Low Precision"
17
17
sparsity: "Sparsity"
18
18
efficient-ml: "Efficient ML"
19
-
gnns: "Graph Neural Networks"
19
+
gnns: "Graph Learning"
20
20
physics: "Physics"
21
21
graphics: "Graphics"
22
22
@@ -34,6 +34,14 @@ papers:
34
34
2025:
35
35
conference:
36
36
37
+
- title: "The Role of Graph Topology in the Performance of Biomedical Knowledge Graph Completion Models"
38
+
url: https://www.arxiv.org/abs/2409.04103
39
+
date: 2025-10-07
40
+
area: [gnns]
41
+
authors: "Alberto Cattaneo, Stephen Bonner, Thomas Martynec, Edward Morrissey, Carlo Luschi, Ian P Barrett, Daniel Justus"
42
+
abstract: "Knowledge Graph Completion has been increasingly adopted as a useful method for several tasks in biomedical research, like drug repurposing or drug-target identification. To that end, a variety of datasets and Knowledge Graph Embedding models has been proposed over the years. However, little is known about the properties that render a dataset useful for a given task and, even though theoretical properties of Knowledge Graph Embedding models are well understood, their practical utility in this field remains controversial. We conduct a comprehensive investigation into the topological properties of publicly available biomedical Knowledge Graphs and establish links to the accuracy observed in real-world applications. By releasing all model predictions and a new suite of analysis tools we invite the community to build upon our work and continue improving the understanding of these crucial applications."
43
+
published: "Bioinformatics, Volume 41, Issue 10, October 2025"
44
+
37
45
- title: "On Stochastic Rounding with Few Random Bits"
38
46
url: https://arxiv.org/abs/2504.20634
39
47
date: 2025-05-07
@@ -52,6 +60,14 @@ papers:
52
60
53
61
workshop:
54
62
63
+
- title: "Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs"
64
+
url: https://arxiv.org/abs/2511.04473
65
+
date: 2025-11-06
66
+
area: [gnns]
67
+
authors: "Alberto Cattaneo, Carlo Luschi, Daniel Justus"
68
+
abstract: "Retrieval of information from graph-structured knowledge bases represents a promising direction for improving the factuality of LLMs. While various solutions have been proposed, a comparison of methods is difficult due to the lack of challenging QA datasets with ground-truth targets for graph retrieval. We present SynthKGQA, a framework for generating high-quality synthetic Knowledge Graph Question Answering datasets from any Knowledge Graph, providing the full set of ground-truth facts in the KG to reason over each question. We show how, in addition to enabling more informative benchmarking of KG retrievers, the data produced with SynthKGQA also allows us to train better models. We apply SynthKGQA to Wikidata to generate GTSQA, a new dataset designed to test zero-shot generalization abilities of KG retrievers with respect to unseen graph structures and relation types, and benchmark popular solutions for KG-augmented LLMs on it."
69
+
published: "arXiv Preprint"
70
+
55
71
- title: "Elucidating the Design Space of FP4 training"
56
72
url: https://arxiv.org/abs/2509.17791
57
73
date: 2025-09-22
@@ -105,14 +121,6 @@ papers:
105
121
abstract: "The nearest neighbour search problem underlies many important machine learning applications, including efficient long-context generation, retrieval-augmented generation, and knowledge graph completion. However, computing top-k exactly suffers from limited parallelism, making it inefficient for highly parallel machine learning accelerators. By relaxing the requirement that the top-k is exact, bucketed algorithms can dramatically increase parallelism by independently computing many smaller top-k operations. We explore the design choices for this class of algorithms using both theoretical analysis and empirical evaluation on downstream tasks. Our motivating examples are sparsity algorithms for language models, which often use top-k to select the most important parameters or activations. We also release a fast bucketed top-k implementation for PyTorch."
106
122
published: "NeurIPS'24 Workshop on Adaptive Foundation Models"
107
123
108
-
- title: "The Role of Graph Topology in the Performance of Biomedical Knowledge Graph Completion Models"
109
-
url: https://www.arxiv.org/abs/2409.04103
110
-
date: 2024-09-06
111
-
area: [gnns]
112
-
authors: "Alberto Cattaneo, Stephen Bonner, Thomas Martynec, Carlo Luschi, Ian P Barrett, Daniel Justus"
113
-
abstract: "Knowledge Graph Completion has been increasingly adopted as a useful method for several tasks in biomedical research, like drug repurposing or drug-target identification. To that end, a variety of datasets and Knowledge Graph Embedding models has been proposed over the years. However, little is known about the properties that render a dataset useful for a given task and, even though theoretical properties of Knowledge Graph Embedding models are well understood, their practical utility in this field remains controversial. We conduct a comprehensive investigation into the topological properties of publicly available biomedical Knowledge Graphs and establish links to the accuracy observed in real-world applications. By releasing all model predictions and a new suite of analysis tools we invite the community to build upon our work and continue improving the understanding of these crucial applications."
114
-
published: "ICML'24 Workshop on Machine Learning for Life and Material Science: From Theory to Industry applications"
115
-
116
124
- title: "Scalify: scale propagation for efficient low-precision LLM training"
0 commit comments