Hide menu

Christoph W. Kessler, Linköping University, Sweden

LIST OF PUBLICATIONS



Books and Book Chapters

[B1]
Jörg Keller, Christoph W. Kessler, Jesper L. Träff:
Practical PRAM Programming
Textbook, 596 pages, Wiley Interscience, New York, 2001.

[B2]
Christoph W. Kessler:
Compiling for VLIW DSPs
Book chapter, 38 pages, in: S. Bhattacharyya, E. Deprettere, R. Leupers, and J. Takala, eds., Handbook of Signal Processing Systems, Springer, 2010.
Also in the Second edition, pp. 1177-1214, Springer, 2013. DOI 10.1007/978-1-4614-6859-2_36.
Revised version in Third edition, pp. 979-1020, Springer, 2019. DOI 10.1007/978-3-319-91734-4

[B3]
Christoph W. Kessler:
Programming the Cell Processor
Book chapter, pp. 155-198 in: A. Adl-Tabatabai, V. Pankratius, W. Tichy, eds., Fundamentals of Multicore Software Development, CRC Press / Taylor and Francis, December 2011.

[B4]
Christoph W. Kessler, Sergei Gorlatch, Johan Enmyren, Usman Dastgeer, Michel Steuwer, Philipp Kegel:
Skeleton Programming for Portable Many-Core Computing.
Book chapter (Ch. 6) in: S. Pllana and F. Xhafa, eds., Programming Multi-Core and Many-Core Computing Systems, Wiley Interscience, New York, pages 121-142, accepted 2011, published March 2017. Book-ISBN: 978-0-470-93690-0.

[B5]
Elisabeth Larsson, Afshin Zafari, Marco Righero, M. Alessandro Francavilla, Giorgio Giordanengo, Francesca Vipiana, Giuseppe Vecchi, Christoph W. Kessler, Corinne Ancourt, Clemens Grelck:
Parallelization of Hierarchical Matrix Algorithms for Electromagnetic Scattering Problems.
Book chapter in: J. Kolodziej, H. Gonzalez-Velez (Eds.): High-Performance Modelling and Simulation for Big Data Applications. Springer LNCS vol. 11400 pp. 36-68, Apr. 2019. DOI: 10.1007/978-3-030-16272-6. Book-ISBN: 978-3-030-16271-9.

Articles in Peer-Reviewed Journals

[J1]
Christoph W. Kessler, Thomas Rauber:
Generating Optimal Contiguous Evaluations for Expression DAGs
Computer Languages vol. 21(2), 1995. DOI: 10.1016/0096-0551(95)00005-O

[J2]
Christoph W. Kessler:
Pattern-Driven Automatic Parallelization
Scientific Programming, vol. 5(3), pp. 251-274, 1996.

[J3]
Christoph W. Kessler, Helmut Seidl.
The Fork95 Parallel Programming Language: Design, Implementation, Application
International Journal on Parallel Programming, vol. 25(1), pp. 17-50, Plenum Press, Feb. 1997. DOI: 10.1007/BF02700045.

[J4]
Christoph W. Kessler:
Scheduling Expression DAGs for Minimal Register Need
Computer Languages, vol. 24(1), pp. 33-53, Elsevier, 1998.

[J5]
Christoph W. Kessler, Jesper L. Träff:
Language and Library Support for Practical PRAM Programming
Parallel Computing, vol. 25: 105-135, Elsevier, 1999.

[J6]
Beniamino diMartino, Christoph W. Kessler:
Two Program Comprehension Tools for Automatic Parallelization
IEEE Concurrency, IEEE CS Press, vol. 8(1), pages 37-47, Spring 2000.

[J7]
Christoph W. Kessler:
NestStep: Nested Parallelism and Virtual Shared Memory for the BSP Model
The Journal of Supercomputing, vol. 17:245-262, Kluwer Academic Publishers, 2000.

[J8]
Christoph W. Kessler:
Managing Distributed Shared Arrays in a Bulk-Synchronous Parallel Environment
Concurrency and Computation: Practice and Experience, vol. 16:133-153, Wiley, 2004. DOI: 10.1002/cpe.768

[J9]
Christoph W. Kessler and Andrzej Bednarski.
Optimal integrated code generation for VLIW architectures.
Concurrency and Computation: Practice and Experience, vol. 18: 1353-1390, Wiley, 2006. DOI: 10.1002/cpe.1012

[J10]
Christoph W. Kessler, Andrzej Bednarski, Mattias Eriksson.
Classification and generation of schedules for VLIW processors
Concurrency and Computation: Practice and Experience 19:2369-2389, Wiley, 2007.

[J11]
Christoph W. Kessler.
Programming Techniques for the Cell Processor.
it - Information Technology 53(2):66--74, ISSN 1611-2776, Oldenbourg Verlag, April 2011.

[J12]
Siegfried Benkner, Sabri Pllana, Jesper L. Träff, Philippas Tsigas, Uwe Dolinsky, Cedric Augonnet, Beverly Bachmayer, Christoph W. Kessler, David Moloney, Vitaly Osipov:
PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems.
IEEE Micro 31(5), September/October 2011.
PDF (Final manuscript)

[J13]
Christoph W. Kessler, Welf Löwe:
Optimized composition of performance-aware parallel components.
Concurrency and Computation: Practice and Experience 24(5): 481-498, April 2012.
(Published online in Wiley Online Library, DOI: 10.1002/cpe.1844, Sep. 2011.)

[J14]
Mattias Eriksson and Christoph W. Kessler.
Integrated Code Generation for Loops.
ACM Transactions on Embedded Computing Systems, vol. 11S issue 1, June 2012, article No. 19, ACM.
Authors' final manuscript (PDF), accepted 2010.

[J15]
Jörg Keller, Christoph Kessler, Rikard Hulten:
Optimized on-chip pipelining for memory-intensive computations on multi-core processors with explicit memory hierarchy.
Journal of Universal Computer Science 18(14):1987-2023, Oct. 2012. DOI: 10.3217/jucs-018-14-1987

[J16]
Amin Shafiee Sarvestani, Erik Hansson, Christoph Kessler:
Extensible Recognition of Algorithmic Patterns in DSP Programs for Automatic Parallelization.
International Journal of Parallel Programming, Nov. 2012 (online) / Dec. 2013, Volume 41, Issue 6, pp 806-824 (print). DOI: 10.1007/s10766-012-0229-2

[J17]
Usman Dastgeer, Lu Li, Christoph Kessler:
The PEPPHER Composition Tool: Performance-Aware Composition for GPU-based Systems.
Computing, vol. 96 no. 12 (2014), pages 1195-1211, ISSN 0010-485X (print) / 1436-5057 (online), Nov. 2013 (DOI 10.1007/s00607-013-0371-8). Springer.

[J18]
Martti Forsell, Erik Hansson, Christoph Kessler, Jari-Matti Mäkelä, Ville Leppänen:
NUMA Computing with Hardware and Software Co-Support on Configurable Emulated Shared Memory Architectures.
International Journal of Networking and Computing 4(1): 189-206, 2014. ISSN 2185-2839 (print), ISSN 2185-2847 (online).

[J19]
Usman Dastgeer, Christoph Kessler:
Performance-aware Composition Framework for GPU-based Systems.
The Journal of Supercomputing 71(12):4646-4662, Dec. 2015 (online: Jan. 2014), Springer. DOI 10.1007/s11227-014-1105-1

[J20]
Nicolas Melot, Christoph Kessler, Jörg Keller, Patrick Eitschberger:
Fast Crown Scheduling Heuristics for Energy-Efficient Mapping and Scaling of Moldable Streaming Tasks on Many-Core Systems.
ACM Transactions on Architecture and Code Optimization (TACO), Vol. 11 Issue 4, January 2015, Article No. 62, Jan. 2015, ACM. DOI: 10.1145/2687653.
Invited presentation at HiPEAC'15 conference, Amsterdam, Jan. 2015.

[J21]
Usman Dastgeer, Christoph Kessler:
Smart containers and skeleton programming for GPU-based systems.
International Journal of Parallel Programming 44(3):506-530, June 2016 (online: March 2015), Springer. DOI: 10.1007/s10766-015-0357-6.

[J22]
Lu Li, Usman Dastgeer, Christoph Kessler:
Pruning strategies in adaptive off-line tuning for optimized composition of components on heterogeneous systems.
Parallel Computing 51:37-45, Jan. 2016 (online: Oct. 2015), Elsevier. DOI: 10.1016/j.parco.2015.09.003

[J23]
Lu Li, Christoph Kessler:
MeterPU: A Generic Measurement Abstraction API Enabling Energy-tuned Skeleton Backend Selection.
Journal of Supercomputing, Volume 74, Issue 11, pp. 5643-5658, Nov. 2018 (online: June 2016), Springer. DOI: 10.1007/s11227-016-1792-x

[J24]
August Ernstsson, Lu Li, Christoph Kessler:
SkePU 2: Flexible and type-safe skeleton programming for heterogeneous parallel systems.
International Journal of Parallel Programming 46(1):62, Jan. 2018 (online: Jan. 2017), Springer. DOI: 10.1007/s10766-017-0490-5

[J25]
August Ernstsson, Christoph Kessler:
Extending smart containers for data locality-aware skeleton programming.
Concurrency and Computation: Practice and Experience 31(5), March 2019 (online: Oct. 2018), Wiley. DOI: 10.1002/cpe.5003

[J26]
Tomas Öhberg, August Ernstsson, Christoph Kessler:
Hybrid CPU-GPU execution support in the skeleton programming framework SkePU.
The Journal of Supercomputing 76(7):5038-5056, July 2020 (online: March 2019), Springer. DOI: 10.1007/s11227-019-02824-7

[J27]
Ludovic Henrio, Christoph Kessler, Lu Li:
Leveraging Access Mode Declarations in a Model for Memory Consistency in Heterogeneous Systems.
Journal of Logical and Algebraic Methods in Programming, vol. 110, Jan. 2020 (online: Oct. 2019), Elsevier. DOI: 10.1016/j.jlamp.2019.100498

[J28]
Vasco Amaral, Beatriz Norberto, Miguel Goulao, Marco Aldinucci, Siegfried Benkner, Andrea Bracciali, Paulo Carreira, Edgars Celms, Luis Correia, Clemens Grelck, Helen Karatza, Christoph Kessler, Peter Kilpatrick, Hugo Martiniano, Ilias Mavridis, Sabri Pllana, Ana Respicio, Jose Simao, Luis Veiga, Ari Visa:
Programming Languages for Data-Intensive HPC Applications: a Systematic Mapping Study.
Parallel Computing 91(102584), March 2020, Elsevier. DOI: 10.1016/j.parco.2019.102584

[J29]
Christoph Kessler, Sebastian Litzinger, Jörg Keller:
Static Scheduling of Moldable Streaming Tasks with Task Fusion for Parallel Systems with DVFS.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD) 39(11), pages 4166-4178, Nov. 2020. DOI: 10.1109/TCAD.2020.3013054

[J30]
Christoph Kessler, Sebastian Litzinger, Jörg Keller:
Crown-Scheduling of Sets of Parallelizable Tasks for Robustness and Energy-elasticity on Many-core Systems with Discrete Dynamic Voltage and Frequency Scaling
Journal of Systems Architecture, vol. 115, number 101999, Elsevier, May 2021. DOI: https://doi.org/10.1016/j.sysarc.2021.101999 (Open Access)

[J31]
August Ernstsson, Johan Ahlqvist, Stavroula Zouzoula, Christoph Kessler:
SkePU 3: Portable High-Level Programming of Heterogeneous Systems and HPC Clusters
International Journal of Parallel Programming 49(6), Dec. 2021, Springer (online: May 2021), open access.

[J32]
Lazaros Papadopoulos, Dimitrios Soudris, Christoph Kessler, August Ernstsson, Johan Ahlqvist, Nikos Vasilas, Athanasios I. Papadopoulos, Panos Seferlis, Charles Prouveur, Matthieu Haefele, Samuel Thibault, Athanasios Salamanis, Theodoros Ioakimidis, Dionysios Kehagias:
EXA2PRO: A Framework for High Development Productivity on Heterogeneous Computing Systems
IEEE Transactions on Parallel and Distributed Systems 33(4):792-804, April 2022 (online: Aug. 2021). Open Access. DOI: 10.1109/TPDS.2021.3104257.

[J33]
August Ernstsson, Nicolas Vandenbergen, Jörg Keller, Christoph Kessler:
A Deterministic Portable Parallel Pseudo-Random Number Generator for Pattern-Based Programming of Heterogeneous Parallel Systems.
International Journal of Parallel Programming 50, pages 319-340, Aug. 2022 (online: March 2022). Open Access. DOI: 10.1007/s10766-022-00726-5

[J34]
Jörg Keller, Sebastian Litzinger, Christoph Kessler:
Integrating Energy-Optimizing Scheduling of Moldable Streaming Tasks with Design Space Exploration for Multiple Core Types on Configurable Platforms.
Journal of Signal Processing Systems 94, pages 849-864, Sep. 2022 (online: June 2022). Open Access. DOI: 10.1007/s11265-022-01787-y

[J35]
August Ernstsson, Dalvan Griebler, Christoph Kessler:
Assessing Application Efficiency and Performance Portability in Single-Source Programming for Heterogeneous Parallel Systems.
International Journal of Parallel Programming 51(1), Jan. 2023 (online: Dec. 2022). Springer, Open Access. DOI: 10.1007/s10766-022-00746-1

[J36]
Sebastian Litzinger, Jörg Keller, Christoph Kessler:
Packing Multiple Types of Cores for Energy-Optimized Heterogeneous Hardware-Software Co-Design of Moldable Streaming Computations.
IEEE Access vol. 11, pp. 19301-19311, March 2023

[J37]
Björn Birath, August Ernstsson, John Tinnerholm, Christoph Kessler:
High-Level Programming of FPGA-Accelerated Systems with Parallel Patterns.
International Journal of Parallel Programming, May 2024. Springer, Open Access. DOI: 10.1007/s10766-024-00770-3


Papers in International Peer-Reviewed Conferences

[C1]
Christoph W. Kessler, Wolfgang J. Paul, Thomas Rauber:
A randomized heuristic approach to register allocation
Proc. of 3rd Int. Symposium on Progr. Lang. Implementation and Logic Programming (PLILP'91), Passau, Germany, Aug. 1991, Springer LNCS vol. 528.

[C2]
Christoph W. Kessler, Wolfgang J. Paul, Thomas Rauber:
Scheduling Vector Straight Line Code on Vector Processors
in: R. Giegerich and S. Graham [Eds.]: Code Generation - Concepts, Techniques, Tools. Springer Workshops in Computer Science, Springer, 1992.

[C3]
Christoph W. Kessler, Thomas Rauber:
Efficient Register Allocation for Large Basic Blocks
Proc. of 5th Int. Symposium on Progr. Lang. Implementation and Logic Programming (PLILP'93), Springer LNCS 714, 1993.

[C4]
Christoph W. Kessler and Wolfgang J. Paul:
Automatic Parallelization by Pattern Matching
Proc. of 2nd Int. ACPC Conference, Gmunden, Austria, Oct. 1993, Springer LNCS Vol. 734.

[C5]
Christoph W. Kessler:
Exact Array Data Flow Analysis and Pattern Recognition in Dense Matrix Computations
in: K.M. Decker and R. Rehmann [Eds.]: Proc. of IFIP WG10.3 Working Conference on Programming Environments for Massively Parallel Systems, Verlag Birkhäuser, April 1994.

[C6]
Christoph W. Kessler:
Pattern-Driven Automatic Program Transformation and Parallelization
Proc. of 3rd Euromicro Workshop on Parallel and Distributed Processing, San Remo, Italy, Jan. 1995, pp. 76-83. IEEE CS Press.

[C7]
Christoph W. Kessler, Thomas Rauber:
Optimal Contiguous Expression DAG Evaluations
Proc. of Int. Conf. on Fundamentals of Computation Theory (FCT'95), Dresden, Germany, Aug. 1995, Springer LNCS.

[C8]
Christoph W. Kessler, Helmut Seidl.
Integrating Synchronous and Asynchronous Paradigms: the Fork95 Parallel Programming Language
Proceedings of MPPM-95 Conference on Massively Parallel Programming Models, Berlin, Oct. 9-12, 1995, IEEE CS press.
See also: Technical Report Nr. 95-05 of University of Trier, Department for Mathematics and Computer Science, May 1995.

[C9]
Beniamino di Martino and Christoph W. Kessler
Program Comprehension Engines for Automatic Parallelization: A Comparative Study
Proc. of 1st Int. Workshop on Software Engineering for Parallel and Distributed Systems, Chapman&Hall, March 25-26, 1996, Berlin, Germany.

[C10]
Christoph W. Kessler and Jesper L. Träff.
A Library of Basic PRAM Algorithms in FORK (Research Summary)
Proceedings of SPAA'96 ACM Symposium on Parallel Algorithms and Architectures, Padua, Italy, June 24-26, 1996. ACM Press.

[C11]
Christoph W. Kessler.
Parallel Fourier-Motzkin Elimination
Proc. of Euro-Par'96, Lyon, France, Aug. 1996. Springer LNCS 1124, pp. 66-71.
Author version (PDF). - For a more detailed version see the technical report [M7].

[C12]
Christoph W. Kessler:
Scheduling Expression DAGs for Minimal Register Need
Proc. of 8th Int. Symposium on Programming Languages: Implementations, Logics, and Programs (PLILP'96), Springer LNCS 1140, pp. 228-242, Aachen, Sept. 27-29, 1996.
Appeared also as Technical Report 96-12, Univ. Trier, FB IV Mathematik/Informatik, 1996.

[C13]
Christoph W. Kessler, Jesper L. Träff.
Language and Library Support for Practical PRAM Programming
Proceedings of PDP'97 Fifth Euromicro Workshop on Parallel and Distributed Processing, London, UK, Jan. 22-24, 1997, pp. 216-221. IEEE CS Press.

[C14]
Christoph W. Kessler, Helmut Seidl.
Language Support for Synchronous Parallel Critical Sections
Proc. of APDC-97 Int. Conf. on Advances in Parallel and Distributed Computing, Shanghai, March 19-21, 1997, pp. 92-99. IEEE CS Press

[C15]
Christoph W. Kessler:
Applicability of Program Comprehension to Sparse Matrix Computations
Proc. of Euro-Par'97, Passau, Aug. 26-28, 1997, Springer LNCS 1300

[C16]
Arno Formella, Thomas Grün, Christoph W. Kessler:
The SB-PRAM: Concept, Design and Construction.
Proceedings of MPPM-97 3rd Int. Conference on Massively Parallel Programming Models, London, Nov. 1997, pp. 163-172, IEEE Computer Society Press, 1997. DOI: 10.1109/MPPM.1997.715972

[C16]
Christoph W. Kessler, Helmut Seidl.
ForkLight: A Control-Synchronous Parallel Programming Language.
Proc. HPCN'99, Amsterdam, April 12-14, 1999, Springer LNCS vol. 1593 pp. 525-534. - BibTEX

[C17]
Christoph W. Kessler, Craig H. Smith:
The SPARAMAT Approach to Automatic Comprehension of Sparse Matrix Computations.
Proc. Int. Workshop on Program Comprehension (IWPC'99), Pittsburgh, May 5-7, 1999, pp. 200-207, IEEE CS Press. DOI: http://dx.doi.org/10.1109/WPC.1999.777759.
Long version: Technical report [M11] - BibTEX

[C18]
Christoph W. Kessler:
NestStep: Nested Parallelism and Virtual Shared Memory for the BSP Model
Proc. PDPTA'99 Int. Conf. on Parallel and Distributed Processing Techniques and Applications, Las Vegas, June 28 - July 1, 1999. Vol. II, pp. 613-619. CSREA Press. - BibTEX

[C19]
Christoph W. Kessler, Andrzej Bednarski:
A Dynamic Programming Approach to Optimal Integrated Code Generation
Proc. ACM SIGPLAN Workshop on Languages, Compilers, and Tools for Embedded Systems (LCTES'2001), June 22-23, 2001, Snowbird, Utah, USA. - BibTEX

[C20]
Christoph W. Kessler, Andrzej Bednarski:
Optimal Integrated Code Generation for Clustered VLIW Architectures
Proc. ACM SIGPLAN conference on Languages, Compilers, and Tools for Embedded Systems (LCTES-SCOPES'2002), June 19-21, 2002, Berlin, Germany. - BibTEX

[C21]
Mikhail Chalabine, Christoph W. Kessler, Staffan Wiklund:
Optimising Intensive Interprocess Communication in a Parallelised Telecommunication Traffic Simulator
Proc. HPC-2003 High-Performance Computing (track of the Advanced Simulation Technology Conference), Orlando, Florida, USA, April 2003. - BibTEX

[C22]
Christoph W. Kessler:
A practical access to the theory of parallel algorithms
Proc. ACM SIGCSE'04 Symposium on Computer Science Education, Norfolk, Virginia, USA, March 2004. - BibTEX

[C23]
Andrzej Bednarski and Christoph Kessler:
Exploiting Symmetries for Optimal Integrated Code Generation
Proc. Int. Conf. on Embedded Systems and Applications (ESA'04), June 21-24, 2004, Las Vegas, Nevada, USA.

[C24]
Håkan Mattsson and Christoph Kessler:
Towards a Bulk-Synchronous Distributed Shared Memory Programming Environment for Grids
In: J. Dongarra, K. Madsen and J. Wasniewski, eds., Proc. PARA'04 Workshop on State-of-the-art in Scientific Computing, Lyngby, Denmark, June 2004, Springer LNCS 3732, pp. 519-526, 2006.

[C25]
Mikhail Chalabine and Christoph Kessler:
Parallelisation of Sequential Programs by Invasive Composition and Aspect Weaving
Proc. APPT'05 Sixth International Workshop on Advanced Parallel Processing Technologies (APPT 2005), Hong Kong, China, 27-28 Oct. 2005. Springer LNCS 3756, pp. 131-140, 2005.

[C26]
Mikhail Chalabine and Christoph Kessler:
Crosscutting Concerns in Parallelization by Invasive Software Composition and Aspect Weaving
Proc. Hawaii Int. Conference on System Sciences (HICSS-39), Kauai, Hawaii, USA, Jan. 2006.

[C27]
Mattias Eriksson, Christoph Kessler, and Mikhail Chalabine:
Load Balancing of Irregular Parallel Divide-and-Conquer Algorithms in Group-SPMD Programming Environments
Proc. 8th Workshop on Parallel Systems and Algorithms (PASA 2006), Frankfurt am Main, Germany, March 2006.
GI Lecture Notes in Informatics (LNI), vol. P-81, pp. 313-322, 2006.

[C28]
Andrzej Bednarski, Christoph Kessler:
Optimal Integrated VLIW Code Generation with Integer Linear Programming.
Proc. Euro-Par 2006 conference, Springer LNCS 4128, pp. 461-472, 2006.

[C29]
Mikhail Chalabine, Christoph Kessler, Peter Bunus
Automatic Round-trip Software Engineering in Aspect Weaving Systems.
Proc. 21st IEEE/ACM International Conference on Automated Software Engineering (ASE2006), Tokyo, Japan, Sept. 2006, pp. 305-308.

[C30]
Christoph Kessler, Peter Fritzson, Mattias Eriksson.
NestStepModelica - Mathematical Modeling and Bulk-Synchronous Parallel Simulation.
In: Bo Kågström, Erik Elmroth, Jack Dongarra and Jerzy Wasniewski (eds.): PARA'06 State-of-the-Art in Scientific and Parallel Computing, Umea, Sweden, June 2006. Springer LNCS vol. 4699, pp. 1006-1015, 2007. DOI: 978-3-540-75755-9_118

[C31]
Mikhail Chalabine, Christoph Kessler.
A Formal Framework for Automated Round-trip Software Engineering in Static Aspect Weaving and Transformations.
Proc. ACM SIGSOFT/IEEE 29th Int. Conference on Software Engineering (ICSE-2007), Minneapolis, USA, May 2007. - Slides (PDF)

[C32]
Mikhail Chalabine, Christoph Kessler.
A Survey of Reasoning in Parallelization.
Proc. 8th ACIS Int. Conf. on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007 ), Qingdao, China, July 2007. IEEE.

[C33]
Christoph Kessler, Welf Löwe.
A Framework for Performance-Aware Composition of Explicitly Parallel Components.
Proc. ParCo-2007 conference, Jülich/Aachen, Germany, Sept. 2007.
In C. Bischof et al. (eds.): Parallel Computing: Architectures, Algorithms and Applications, Advances in Parallel Computing Series, Volume 15, IOS Press, pp. 227-234, Feb. 2008.
Also published in: NIC Series Volume 38, December 2007, ISBN 978-3-9810843-4-4.

[C34]
Jörg Keller, Christoph Kessler, Kalle König, Wolfgang Heenes.
Hybrid Parallel Sort on the Cell Processor.
Proc. 9th Workshop on Parallel Systems and Algorithms (PASA-2008), Dresden, Germany, Feb. 2008. GI Lecture Notes in Informatics (LNI), vol. 124, pp. 107-112, 2008. Reprinted in PARS-Mitteilungen 25, pp. 91-96, GI/ITG PARS, ISSN 0177-0454, Dec. 2008.

[C35]
Jesper Andersson, Morgan Ericsson, Christoph Kessler, Welf Löwe.
Profile-Guided Composition
Proc. 7th Int. Symposium on Software Composition (SC 2008) at ETAPS, Budapest, Hungary, March 2008. Springer LNCS 4954: 157-164.

[C36]
Mattias Eriksson, Oskar Skoog, Christoph Kessler.
Optimal vs. Heuristic Integrated Code Generation for Clustered VLIW Architectures.
Proc. 11th ACM SIGBED Int. Workshop on Software and Compilers for Embedded Systems (SCOPES 2008), München, Germany, March 2008, pp. 11-20. ACM.

[C37]
Markus Ålind, Mattias Eriksson, Christoph Kessler.
BlockLib: A Skeleton Library for Cell Broadband Engine.
Proc. Int. Workshop on Multicore Software Engineering (IWMSE-2008) at ICSE-2008, Leipzig, Germany, May 2008. ACM.

[C38]
Jörg Keller, Christoph Kessler.
Optimized Pipelined Parallel Merge Sort on the Cell BE.
2nd Int. Workshop on Highly Parallel Processing on a Chip (HPPC-2008) at Euro-Par 2008, Las Palmas de Gran Canaria, Spain, Aug. 2008. Pre-Proceedings, pp. 8-17, 2008. Final version in E. Luque et al. (Eds.): Euro-Par 2008 Workshops, Springer LNCS 5415: 131-140, 2009.

[C39]
Andreas Leha, Mikhail Chalabine, Christoph Kessler.
Parallelizing Scientific Code with Invasive Interactive Parallelization - A Case Study with Reuseware.
Proc. Int. Workshop on Component-Based High Performance Computing (CBHPC-2008), Oct. 2008.

[C40]
Morgan Ericsson, Welf Löwe, Christoph Kessler, Jesper Andersson.
Composition and Optimization.
Proc. Int. Workshop on Component-Based High Performance Computing (CBHPC-2008), Oct. 2008.

[C41]
Mattias Eriksson, Christoph Kessler:
Integrated Modulo Scheduling for Clustered VLIW Architectures.
Proc. HiPEAC-2009 High-Performance and Embedded Architecture and Compilers, Paphos, Cyprus, Jan. 2009. Springer LNCS 5409, pp. 65-79.

[C42]
Rikard Hulten, Christoph Kessler, Jörg Keller:
Optimized On-Chip Pipelined Mergesort on the Cell/B.E.
Proc. EuroPar-2010 conference, Part II, Springer LNCS 6272, pp. 187-198, August 2010.
PDF (authors' final version); the original publication is available at link.springer.com.

[C43]
Johan Enmyren, Christoph Kessler:
SkePU: A Multi-Backend Skeleton Programming Library for Multi-GPU Systems.
Proc. 4th Int. Workshop on High-Level Parallel Programming and Applications (HLPP-2010), Baltimore, USA, Sep. 2010, pages 5-14. ACM. DOI: 10.1145/1863482.1863487.

[C44]
Usman Dastgeer, Johan Enmyren, Christoph Kessler:
Auto-tuning SkePU: A Multi-Backend Skeleton Programming Framework for Multi-GPU Systems.
Proc. IWMSE-2011, Hawaii, USA, May 2011, pp. 25-32, ACM.
A previous version was also presented at:
Proc. Fourth Workshop on Programmability Issues for Multi-Core Computers (MULTIPROG-2011), January 23, 2011, in conjunction with HiPEAC-2011 conference, Heraklion, Greece.

[C45]
Mattias Eriksson, Christoph Kessler:
Integrated Offset Assignment.
Proc. 9th Workshop on Optimizations for DSP and Embedded Systems (ODES-9), co-located with CGO-2011, Chamonix, France, April 2011.

[C46]
Erik Hansson, Joar Sohl, Christoph Kessler, Dake Liu:
Case Study of Efficient Parallel Memory Access Programming for the Embedded Heterogeneous Multicore DSP Architecture ePUMA.
Proc. Int. Workshop on Multi-Core Computing Systems (MuCoCoS-2011), June 2011, Seoul, Korea. IEEE CS Press.

[C47]
Antonina Danylenko, Welf Löwe, Christoph Kessler:
Comparing Machine Learning Approaches for Context-Aware Composition.
Proc. 10th Int. Conf. on Software Composition, June/July 2011, Zürich, Switzerland. Springer LNCS vol. 6708, 2011. DOI: 10.1007/978-3-642-22045-6_2.

[C48]
Kenan Avdic, Nicolas Melot, Jörg Keller, Christoph Kessler:
Parallel sorting on Intel Single-Chip Cloud computer.
2nd Int. Workshop on Applications for Multi- and Manycore Processors (A4MMC-2), co-located with ISCA-2011, June 2011, San Jose, CA, USA. ACM.

[C49]
Nicolas Melot, Kenan Avdic, Jörg Keller, Christoph Kessler:
Investigation of main memory bandwidth on Intel Single-Chip Cloud Computer.
Proc. 3rd Many-core Applications Research Community Symposium (MARC3), Ettlingen, Germany, July 5-6, 2011. KIT Scientific Publishing.

[C50]
Usman Dastgeer, Christoph Kessler, Samuel Thibault:
Flexible runtime support for efficient skeleton programming on hybrid systems.
Proc. ParCo-2011 Int. Conference on Parallel Computing, Ghent, Belgium, Sep. 2011.
In: Advances in Parallel Computing vol. 22, DOI 10.3233/978-1-61499-041-3-159, pages 159-166, IOS press.

[C51]
Sabri Pllana, Siegfried Benkner, Jesper Larsson Träff, Philippas Tsigas, Andrew Richards, Raymond Namyst, Beverly Bachmaier, Christoph Kessler, David Moloney, Peter Sanders:
The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures.
Proc. ParCo-2011 Int. Conference on Parallel Computing, Ghent, Belgium, Sep. 2011.

[C52]
Jörg Keller, Mudassar Majeed, Christoph Kessler:
Balancing CPU Load for Irregular MPI Applications.
Proc. ParCo-2011 Int. Conference on Parallel Computing, Ghent, Belgium, Sep. 2011.

[C53]
Usman Dastgeer and Christoph Kessler:
A performance-portable generic component for 2D convolution computations on GPU-based systems.
Proc. Fifth Workshop on Programmability Issues for Multi-Core Computers (MULTIPROG-2012) Workshop at HiPEAC-2012, Paris, Jan. 2012.

[C54]
Akhtar Ali, Usman Dastgeer and Christoph Kessler.
OpenCL on shared memory multicore CPUs.
Proc. Fifth Workshop on Programmability Issues for Multi-Core Computers (MULTIPROG-2012) at HiPEAC-2012, Paris, Jan. 2012.

[C55]
Christoph Kessler, Erik Hansson:
Flexible Scheduling and Thread Allocation for Synchronous Parallel Tasks.
Proc. PASA-2012, München, Germany, Feb. 2012. In ARCS-2012 workshops, GI Lecture Notes in Informatics vol. P-200, pp. 507-514.

[C56]
Christoph Kessler, Usman Dastgeer, Samuel Thibault, Raymond Namyst, Andrew Richards, Uwe Dolinsky, Siegfried Benkner, Jesper Larsson Träff and Sabri Pllana:
Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems.
Proc. DATE-2012 conference on Design, Automation and Test in Europe, Dresden, March 2012. IEEE CS Press, pp. 1403-1408.
(PDF (author version), PDF at IEEE Xplore)

[C57]
Jürgen Brenner, Christoph Kessler, and Jörg Keller.
Executing PRAM Programs on GPUs.
Proc. International Conference on Computational Science, ICCS 2012, PAPP Workshop, Omaha, Nebraska (USA),
in Procedia CS 9: 1799-1806, Elsevier, June 2012. PDF (Author version)

[C58]
Nicolas Melot, Christoph Kessler, Kenan Avdic, Patrick Cichowski, Jörg Keller:
Engineering parallel sorting for the Intel SCC.
Proc. International Conference on Computational Science, ICCS 2012, WEPA Workshop, Omaha, Nebraska (USA),
in Procedia CS 9: 1890-1899, Elsevier, June 2012. PDF (Author version)

[C59]
Jari-Matti Mäkelä, Erik Hansson, Daniel Åkesson, Martti Forsell, Christoph Kessler, Ville Leppänen:
Design of the Language Replica for Hybrid PRAM-NUMA Many-Core Architectures.
Proc. ISPA 2012 4th IEEE International Workshop on Multicore and Multithreaded Architectures and Algorithms, July 2012.

[C60]
Martin Kessler, Erik Hansson, Daniel Åkesson, Christoph Kessler:
Exploiting Instruction Level Parallelism for REPLICA - A Configurable VLIW Architecture With Chained Functional Units.
Proc. 18th Int. Conf. on Parallel and Distributed Processing Techniques and Applications (PDPTA'12), Las Vegas, USA, July 2012. Pages 275-281, CSREA Press, ISBN 1-60132-228-3.

[C61]
Lu Li, Usman Dastgeer, Christoph Kessler:
Adaptive off-line tuning for optimized composition of components for heterogeneous many-core systems.
Seventh International Workshop on Automatic Performance Tuning (iWAPT-2012), 17 July 2012, Kobe, Japan.
In: Proc. VECPAR-2012 Conference, Kobe, Japan, July 2012. Springer LNCS 7851, pp. 329-345, 2013.
Author version (PDF), 2012

[C62]
Patrick Cichowski, Jörg Keller and Christoph Kessler:
Modelling Power Consumption of the Intel SCC.
Proc.
6th MARC Symposium at ONERA'2012, Toulouse, France, 19-20 july 2012. Published by HAL, France.

[C63]
Usman Dastgeer, Lu Li, Christoph Kessler:
The PEPPHER Composition Tool: Performance-Aware Dynamic Composition of Applications for GPU-based Systems.
In: Proc. 2012 Int. Workshop on Multi-Core Computing Systems (MuCoCoS 2012), Nov. 16, 2012, Salt Lake City, Utah, USA, in conjunction with Supercomputing Conference (SC12). ISBN 978-1-4673-6218-4, DOI: 10.1109/SC.Companion.2012.97, IEEE.

[C64]
Patrick Cichowski, Jörg Keller, Christoph Kessler:
Energy-efficient Mapping of Task Collections onto Manycore Processors.
In: E. Ayguade et al. (eds.), Proc. Sixth Workshop on Programmability Issues for Multi-Core Computers (MULTIPROG'13) at HiPEAC'13, Berlin, Germany, Jan. 2013.

[C65]
Martti Forsell, Erik Hansson, Christoph Kessler, Jari-Matti Mäkelä, Ville Leppänen:
Hardware and Software Support for NUMA Computing on Configurable Emulated Shared Memory Architectures.
15th Workshop on Advances on Parallel and Distributed Computational Models (APDCM 2013), May 2013, in: IPDPS-2013 Workshop proceedings, ISBN 978-0-7695-4979-8, pages 640-648, IEEE, 2013.

[C66]
Mudassar Majeed, Usman Dastgeer, Christoph Kessler:
Cluster-SkePU: A Multi-Backend Skeleton Programming Library for GPU Clusters
Proc. Int. Conf. on Parallel and Distr. Processing Techniques and Applications (PDPTA-2013), Las Vegas, USA, July 2013.

[C67]
Usman Dastgeer, Lu Li, Christoph Kessler:
Adaptive implementation selection in the SkePU skeleton programming library.
Proc. of the 2013 Biennial Conference on Advanced Parallel Processing Technology (APPT-2013), Stockholm, Sweden, Aug. 2013. Springer LNCS 8299, pp. 170-183, 2013.
Author version (PDF)

[C68]
Usman Dastgeer, Christoph Kessler:
A Framework for Performance-aware Composition of Applications for GPU- based Systems.
Proc. Sixth International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), 2013, in conjunction with 42nd Int. Conf. on Parallel Processing (ICPP-2013), Lyon, France, October 2013. DOI 10.1109/ICPP.2013.83, IEEE.

[C69]
Christoph Kessler, Nicolas Melot, Patrick Eitschberger, and Jörg Keller:
Crown Scheduling: Energy-Efficient Resource Allocation, Mapping and Discrete Frequency Scaling for Collections of Malleable Streaming Tasks.
Proc. of 23rd Int. Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS-2013), Karlsruhe, Germany, Sept. 9-11, 2013.
Nominated for Best-Paper Award (top-4).

[C70]
Usman Dastgeer and Christoph Kessler:
Conditional component composition for GPU-based systems.
Proc. Seventh Workshop on Programmability Issues for Multi-Core Computers (MULTIPROG-2014) at HiPEAC-2014, Vienna, Austria, Jan. 2014.

[C71]
Erik Hansson, Erik Alnervik, Christoph Kessler, and Martti Forsell:
A Quantitative Comparison of PRAM based Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs.
PASA-2014 11th Workshop on Parallel Systems and Algorithms, Lübeck, Germany, Feb. 2014.
In Proc. Architecture of Computer Systems (ARCS'14), VDE-Verlag, ISBN 978-3-8007-3579-2, 2014.
Appeared also in PARS-Mitteilungen 31:27-33, Sep. 2014, ISSN 0177-0454.

[C72]
Usman Dastgeer, Christoph Kessler:
Smart containers and skeleton programming for GPU-based systems.
In C. Grelck (ed.): Proc. of the 7th Int. Symposium on High-level Parallel Programming and Applications (HLPP'14), Amsterdam, July 2014.

[C73]
Lu Li, Usman Dastgeer, Christoph Kessler:
Pruning strategies in adaptive off-line tuning for optimized composition of components on heterogeneous systems.
Proc. Seventh International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2) at ICPP, Minneapolis, USA, Sep. 2014, pp. 255-264, IEEE.

[C74]
Erik Hansson, Christoph Kessler:
Optimized selection of runtime mode for the reconfigurable PRAM-NUMA architecture REPLICA using machine-learning.
7th International Workshop on Multi-/Many-Core Computing Systems (MuCoCoS-2014), in conjunction with the Euro-Par 2014 Conference, 26 August, 2014, Porto, Portugal. Euro-Par Workshops Proceedings, Springer LNCS vol. 8806, pp. 133-145, 2014.

[C75]
Erik Hansson, Christoph Kessler:
Global optimization of execution mode selection for the reconfigurable PRAM-NUMA multicore architecture REPLICA.
Proc. PDAA workshop at CANDAR'14 symposium, Shizuoka, Japan, Dec. 2014. IEEE. DOI: 10.1109/CANDAR.2014.72

[C76]
Lu Li, Christoph Kessler:
Validating Energy Compositionality of GPU Computations.
Proc. HIPEAC Workshop on Energy Efficiency with Heterogeneous Computing (EEHCO-2015), Amsterdam, Jan. 2015.

[C77]
Lu Li, Christoph Kessler:
MeterPU: A Generic Measurement Abstraction API Enabling Energy-tuned Skeleton Backend Selection.
International Workshop on Reengineering for Parallelism in Heterogeneous Parallel Platforms (REPARA-2015). In: Proc. of ISPA-2015 conference, Helsinki, Finland, Aug. 2015, pp. 154-159, IEEE. DOI: 10.1109/Trustcom.2015.625

[C78]
Nicolas Melot, Johan Janzen, Christoph Kessler:
Mimer and Schedeval: Comparison Tools for Static Schedulers and Streaming Applications on Concrete Manycore Architectures.
Eighth International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2) at ICPP, 2015. IEEE. DOI: 10.1109/ICPPW.2015.24

[C79]
Christoph Kessler, Lu Li, Aras Atalar, Alin Dobre:
XPDL: Extensible Platform Description Language to Support Energy Modeling and Optimization.
Proc. 44th International Conference on Parallel Processing Workshops, ICPP-EMS Embedded Multicore Systems, in conjunction with ICPP-2015, Beijing, 1-4 sep. 2015, pp. 51-60, IEEE. DOI 10.1109/ICPPW.2015.17

[C80]
Erik Hansson, Christoph Kessler:
Optimized variant-selection code generation for loops on heterogeneous multicore systems.
ParCo-2015 conference, Edinburgh, UK, 1-4 Sep. 2015.
Published in: Gerhard R. Joubert, Hugh Leather, Mark Parsons, Frans Peters, Mark Sawyer (eds.): Advances in Parallel Computing, Volume 27: Parallel Computing: On the Road to Exascale, IOS Press, April 2016, pages 103-112. DOI 10.3233/978-1-61499-621-7-103.

[C81]
Oskar Sjöström, Soon-Heum Ko, Usman Dastgeer, Lu Li, Christoph Kessler:
Portable Parallelization of the EDGE CFD Application for GPU-based Systems using the SkePU Skeleton Programming Library.
ParCo-2015 conference, Edinburgh, UK, 1-4 Sep. 2015.
Published in: Gerhard R. Joubert, Hugh Leather, Mark Parsons, Frans Peters, Mark Sawyer (eds.): Advances in Parallel Computing, Volume 27: Parallel Computing: On the Road to Exascale, IOS Press, April 2016, pages 135-144. DOI 10.3233/978-1-61499-621-7-135.

[C82]
Nicolas Melot, Christoph Kessler, Jörg Keller:
Improving Energy-Efficiency of Static Schedules by Core Consolidation and Switching Off Unused Cores.
ParCo-2015 conference, Edinburgh, UK, 1-4 Sep. 2015.
Published in: Gerhard R. Joubert, Hugh Leather, Mark Parsons, Frans Peters, Mark Sawyer (eds.): Advances in Parallel Computing, Volume 27: Parallel Computing: On the Road to Exascale, IOS Press, April 2016, pages 285-294. DOI 10.3233/978-1-61499-621-7-285.
  • PDF (Author version)
  • [C83]
    Sebastian Thorarensen, Rosandra Cuello, Christoph Kessler, Lu Li, Brendan Barry:
    Efficient Execution of SkePU Skeleton Programs on the Low-power Multicore Processor Myriad2.
    Proc. Euromicro PDP-2016 Int. Conf. on Parallel, Distributed, and Network-Based Processing, Heraklion, Greece, Feb. 2016, pages 398-402, IEEE. DOI: 10.1109/PDP.2016.123

    [C84]
    Lu Li, Christoph Kessler:
    VectorPU: A Generic and Efficient Data-container and Component Model for Transparent Data Transfer on GPU-based Heterogeneous Systems.
    Proc. 8th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 6th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM'17), Stockholm, Jan. 2017, ACM. DOI: 10.1145/3029580.3029582
  • Presentation slides (PDF)
  • [C85]
    Manfred Torggler, Christoph Kessler, Jörg Keller:
    Asymmetric Crown Scheduling.
    Proc. Euromicro PDP-2017 Int. Conf. on Parallel, Distributed, and Network-Based Processing, St. Petersburg, Mar. 2017, IEEE. DOI: 10.1109/PDP.2017.22

    [C86]
    Suejb Memeti, Lu Li, Sabri Pllana, Joanna Kolodziej, Christoph Kessler:
    Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption.
    Proceedings of the 2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing (ARMS-CC'17), Washington, DC, USA, pp. 1-6, July 2017, ACM. DOI: 10.1145/3110355.3110356

    [C87]
    Lu Li, Christoph Kessler:
    Lazy Allocation and Transfer Fusion Optimization for GPU-based Heterogeneous Systems.
    Proc. Euromicro PDP-2018 Int. Conf. on Parallel, Distributed, and Network-Based Processing, Cambridge, UK, Mar. 2018, pp. 311-315, IEEE. DOI: 10.1109/PDP2018.2018.00054

    [C88]
    Ludovic Henrio, Christoph Kessler, Lu Li:
    Ensuring Memory Consistency in Heterogeneous Systems Based on Access Mode Declarations.
    Proc. 2018 International Symposium on Formal Approaches to Parallel and Distributed Computing (4PAD), part of the 2018 International Conference on High Performance Computing & Simulation (HPCS 2018), Orleans, France, July 16-20, 2018. IEEE, pp. 716-723. DOI: 10.1109/HPCS.2018.00117
    Preprint (PDF)

    [C89]
    Jörg Keller, Christoph Kessler:
    Dealing with Hardware Faults in Energy-Efficient Static Schedules of Multi-Variant Programs on Heterogeneous Platforms.
    15th GI/ITG Workshop on Dependability and Fault Tolerance (VERFE'19), Copenhagen, Denmark, 20-21 May 2019. In: C. Trinitis, T. Pionteck (eds.), ARCS 2019 Workshop Proceedings, ISBN 978-3-8007-4957-7 (E-Book: ISBN 978-3-8007-4958-4), VDE/IEEE, 2019.

    [C90]
    Nicolas Melot, Christoph Kessler, Jörg Keller, Patrick Eitschberger:
    Co-optimizing Core Allocation, Mapping and DVFS in Streaming Programs with Moldable Tasks for Energy Efficient Execution on Manycore Architectures.
    19th International Conference on Application of Concurrency to System Design (ACSD-2019), Aachen, Germany, June 23-28, 2019. IEEE. DOI: 10.1109/ACSD.2019.00011

    [C91]
    Christoph Kessler:
    Global Optimization of Operand Transfer Fusion in Heterogeneous Computing.
    Proc. 22nd International Workshop on Software and Compilers for Embedded Systems (SCOPES-2019), St. Goar, Germany, May 2019. ACM. DOI: 10.1145/3323439.3323981
    Prototype implementation source code

    [C92]
    Sebastian Litzinger, Jörg Keller, Christoph Kessler:
    Scheduling Moldable Parallel Streaming Tasks on Heterogeneous Platforms with Frequency Scaling.
    Proc. 27th European Signal Processing Conference (EUSIPCO 2019), A Coruna, Spain, Sep. 2019, IEEE. DOI: 10.23919/EUSIPCO.2019.8903180
  • Slides (PDF) (as presented at MCC'19, Karlskrona, Nov. 2019)
  • [C93]
    Christoph Kessler, Sebastian Litzinger, Jörg Keller:
    Adaptive Crown Scheduling for Streaming Tasks on Many-Core Systems with Discrete DVFS.
    Euro-Par 2019 Workshop Third International Workshop on Autonomic Solutions for Parallel and Distributed Data Stream Processing (Auto-DaSP 2019), Göttingen, Germany, Aug. 2019.
    In: Euro-Par 2019: Parallel Processing Workshops. Euro-Par 2019. Springer Lecture Notes in Computer Science, vol. 11997, pp. 17-29, May 2020. DOI: 10.1007/978-3-030-48340-1_2

    [C94]
    August Ernstsson, Christoph Kessler:
    Multi-variant User Functions for Platform-aware Skeleton Programming.
    Proc. of ParCo-2019 conference, Prague, Sep. 2019, in: I. Foster et al. (Eds.), Parallel Computing: Technology Trends, series: Advances in Parallel Computing, vol. 36, IOS press, March 2020, pages 475-484. DOI: 10.3233/APC200074.
    • PDF (open access)

    [C95]
    Christoph Kessler, Sebastian Litzinger, Jörg Keller:
    Robustness and Energy-elasticity of Crown Schedules for Sets of Parallelizable Tasks on Many-core Systems with DVFS.
    Proc. 28th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP'20), Västerås, Sweden, March 2020, IEEE. DOI: 10.1109/PDP50117.2020.00027

    [C96]
    Sebastian Litzinger, Jörg Keller, Christoph Kessler:
    Maximizing Profit in Energy-Efficient Moldable Task Execution with Deadline.
    Proc. 28th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP'20), Västerås, Sweden, March 2020, IEEE. DOI: 10.1109/PDP50117.2020.00029

    [C97]
    Nicolas Melot, Christoph Kessler, Jörg Keller:
    Voltage Island-Aware Energy-Efficient Scheduling of Parallel Streaming Tasks on Many-Core CPUs.
    Proc. 28th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP'20), Västerås, Sweden, March 2020, IEEE. DOI: 10.1109/PDP50117.2020.00030

    [C98]
    Jörg Keller, Sebastian Litzinger, Christoph Kessler:
    Combining Design Space Exploration with Task Scheduling of Moldable Streaming Tasks on Reconfigurable Platforms.
    Proc. International Symposium on Reconfigurable Computing, Rennes, France, June 2021, Springer LNCS.

    [C99]
    Christoph Kessler, Jörg Keller, Sebastian Litzinger:
    Temperature-Aware Energy-Optimal Scheduling of Moldable Streaming Tasks onto 2D-Mesh-Based Many-Core CPUs with DVFS
    Proc. 24th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP 2021) in conjunction with IPDPS 2021, Portland, Oregon USA, 21 May 2021, Springer LNCS 12985, Oct. 2021. DOI: 10.1007/978-3-030-88224-2_9

    [C100]
    Gabriella Andrade, Dalvan Griebler, Rodrigo Santos, Christoph Kessler, August Ernstsson, Luiz G. Fernandes:
    Analyzing Programming Effort Model Accuracy of High-Level Parallel Programs for Stream Processing.
    Proc. 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), 2022, pp. 229-232, IEEE. DOI 10.1109/SEAA56994.2022.00043.

    [C101]
    Michail Boulasikis, Christoph Kessler, Flavius Gruian, Jörg Keller, Sebastian Litzinger:
    Packet-Type Aware Scheduling of Moldable Streaming Tasks on Multicore Systems with DVFS.
    Proc. of ACM Symposium on Applied Computing (SAC'24), Avila, Spain, April 2024, pp. 449-451.

    [C102]
    Sajad Khosravi, Christoph Kessler, Sebastian Litzinger, Jörg Keller:
    Energy-Efficient Scheduling of Moldable Streaming Computations for the Edge-Cloud Continuum.
    Proc. 9th Int. Conference on Fog and Mobile Edge Computing (FMEC 2024), Sep. 2024. IEEE. DOI: 10.1109/FMEC62297.2024.10710310



    Papers in Peer-Reviewed National Workshops

    [W1]
    Daniel Johansson, Mattias Eriksson, Christoph Kessler.
    Bulk-synchronous parallel computing on the CELL processor.
    PARS'07: 21. PARS - Workshop, Hamburg, Germany, May 31-Jun 1, 2007. GI/ITG-Fachgruppe Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS). In: PARS-Mitteilungen 24: 90-99, ISSN 0177-0454, Dec. 2007.

    [W2]
    Bert Wesarg, Holger Blaar, Jörg Keller, Christoph Kessler:
    Emulating a PRAM on a Parallel Computer.
    PARS'07: 21. PARS - Workshop, Hamburg, Germany, May 31-Jun 1, 2007. GI/ITG-Fachgruppe Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS). In: PARS-Mitteilungen 24: 77-89, ISSN 0177-0454, Dec. 2007.

    [W3]
    Christoph W. Kessler, Jörg Keller
    Optimized On-Chip Pipelining of Memory-Intensive Computations on the Cell BE.
    ACM Computer Architecture News 36(5): 36-45, Dec. 2008.
    Special issue on MCC-2008 First Swedish Workshop on Multicore Computing, Ronneby, Sweden, Nov. 2008.
    Presentation slides (PDF)

    [W4]
    Håkan Lundvall, Kristian Stavåker, Peter Fritzson, Christoph Kessler.
    Automatic Parallelization of Simulation Code for Equation-based Models with Software Pipelining and Measurements on Three Platforms.
    ACM Computer Architecture News 36(5): 46-55, Dec. 2008.
    Special issue on MCC-2008 First Swedish Workshop on Multicore Computing, Ronneby, Sweden, Nov. 2008.

    [W5]
    Jörg Keller, Christoph Kessler, Bert Wesarg.
    Efficient Simulation of Fork Programs on Multicore Machines.
    PARS'09: 22nd PARS-Workshop, Parsberg, Germany, Jun 4-5, 2009. GI/ITG-Fachgruppe Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS). In: PARS-Mitteilungen 26: 84-90, ISSN 0177-0454, Dec. 2009.

    [W6]
    Christoph W. Kessler, Wladimir Schamai, Peter Fritzson:
    Platform-independent modeling of explicitly parallel programs.
    PARS'10: 23rd PARS-Workshop on parallel Systems and Algorithms, Hannover, Germany, Feb. 2010. In: M. Beigl and F. Cazorla-Almeida (Eds.): ARCS'10 Workshop Proceedings, ISBN 978-3-8007-3222-7, VDE-Verlag Berlin/Offenbach, Germany, pp. 83--93.
    Also in: PARS-Mitteilungen 27: 73-83, ISSN 0177-0454, Sep. 2010.

    [W7]
    Johan Enmyren, Usman Dastgeer, Christoph Kessler:
    Towards a Tunable Multi-Backend Skeleton Programming Framework for Multi-GPU Systems.
    Proc. MCC-2010 Third Swedish Workshop on Multicore Computing, Gothenburg, Sweden, Nov. 2010.

    [W8]
    Erik Hansson, Joar Sohl, Christoph Kessler, Dake Liu.
    Case Study of Efficient Parallel Memory Access Programming for an Embedded Heterogeneous Multicore DSP Architecture.
    Proc. MCC-2010 Third Swedish Workshop on Multicore Computing, Gothenburg, Sweden, Nov. 2010.

    [W9]
    Christoph Kessler and Welf Löwe:
    Context-Aware Composition of Parallel Components.
    STEW-2011 workshop, Ericsson, Stockholm, Sweden, Nov. 23-24, 2011.

    [W10]
    Usman Dastgeer and Christoph Kessler:
    A performance-portable generic component for 2D convolution computations on GPU-based systems.
    Proc. MCC-2011, Linköping, Sweden, Nov. 23-25, 2011, pp. 39-44.

    [W11]
    Akhtar Ali, Usman Dastgeer and Christoph Kessler.
    OpenCL on shared memory multicore CPUs.
    Proc. MCC-2011, Linköping, Sweden, Nov. 23-25, 2011, pp. 65-70.

    [W12]
    Kenan Avdic, Nicolas Melot, Christoph Kessler and Jörg Keller.
    Pipelined parallel sorting on the Intel SCC.
    Proc. MCC-2011, Linköping, Sweden, Nov. 23-25, 2011, pp. 96-101.

    [W13]
    Mudassar Majeed, Usman Dastgeer, Christoph Kessler:
    Structured Development of Scalable Scientific Applications for GPU Clusters.
    In M. Brorsson and A. Hemani (eds.), Proc. MCC'12 Fifth Swedish Workshop on Multicore Computing, Nov. 2012, Stockholm, pp. 57-60, ISSN 1653-7238.

    [W14]
    Lu Li, Usman Dastgeer, Christoph Kessler:
    Pruning strategies in adaptive off-line tuning for optimized composition of components on heterogeneous systems.
    In M. Brorsson and A. Hemani (eds.), Proc. MCC'12 Fifth Swedish Workshop on Multicore Computing, Nov. 2012, Stockholm, pp. 52-55, ISSN 1653-7238.

    [W15]
    Amin Shafiee Sarvestani, Erik Hansson, Christoph Kessler:
    Towards Domain Specific Automatic Parallelization.
    In M. Brorsson and A. Hemani (eds.), Proc. MCC'12 Fifth Swedish Workshop on Multicore Computing, Nov. 2012, Stockholm, pp. 70-73, ISSN 1653-7238.

    [W16]
    Patrick Cichowski, Jörg Keller, Christoph Kessler:
    Energy-efficient Mapping of Task Collections onto Manycore Processors.
    In M. Brorsson and A. Hemani (eds.), Proc. MCC'12 Fifth Swedish Workshop on Multicore Computing, Nov. 2012, Stockholm, pp. 6-9, ISSN 1653-7238.

    [W17]
    Christoph Kessler, Patrick Eitschberger, Jörg Keller:
    Energy-Efficient Static Scheduling of Streaming Task Collections with Malleable Tasks.
    Proc. 25th PARS Workshop, April 2013, Erlangen, Germany.
    In PARS-Mitteilungen (ISSN 0177-0454) vol. 30, pp. 37-46, Sep. 2013.

    [W18]
    Patrick Eitschberger, Jörg Keller, Frank Thiele, Christoph Kessler:
    Exploring the Placement of Memory Controllers in Manycore Processors: Case Study on Intel SCC
    Proc. MCC'13 Sixth Swedish Workshop on Multicore Computing, Nov. 2013, Halmstad, Sweden.

    [W19]
    Nicolas Melot, Christoph Kessler, Jörg Keller:
    Energy-Efficient Mapping of Streaming Tasks for Crown Scheduling on Many-Core Systems.
    Proc. MCC'13 Sixth Swedish Workshop on Multicore Computing, Nov. 2013, Halmstad, Sweden.

    [W20]
    Usman Dastgeer, Christoph Kessler:
    Conditional component composition for GPU-based systems.
    Proc. MCC'13 Sixth Swedish Workshop on Multicore Computing, Nov. 2013, Halmstad, Sweden.

    [W21]
    Erik Hansson, Erik Alnervik, Christoph Kessler, Martti Forsell:
    A Quantitative Comparison of Emulated Shared Memory Architectures to Current Multicore CPUs and GPUs.
    Proc. MCC'13 Sixth Swedish Workshop on Multicore Computing, Nov. 2013, Halmstad, Sweden.

    [W22]
    Lu Li, Christoph Kessler:
    Validating Energy Compositionality of GPU Computations
    Proc. MCC'14 Seventh Swedish Workshop on Multicore Computing, Nov. 2014, Lund, Sweden.


    Invited Papers

    [I1]
    Christoph W. Kessler
    Pattern Recognition Enables Automatic Parallelization of Numerical Codes
    in: H.J. Sips [Ed.]: Proc. of CPC'93 4th Int. Workshop on Compilers for Parallel Computers, Delft University of Technology, the Netherlands, Dec. 13-16, 1994. pp. 385-397

    [I2]
    Christoph W. Kessler
    The PARAMAT Project: Current Status and Plans for the Future
    Proc. of AP'95 2nd Workshop on Automatic Data Layout and Performance Prediction, CRPC-TR95548, Rice University, Houston, Apr. 1995.

    [I3]
    Christoph W. Kessler, Helmut Seidl.
    Fork95 Language and Compiler for the SB-PRAM
    Proceedings of CPC'95 5th Int. Workshop on Compilers for Parallel Computers, Malaga, June 28-30, 1995.

    [I4]
    Christoph W. Kessler, Helmut Seidl:
    Language and Compiler Support for Synchronous Parallel Critical Sections
    Proc. of CPC'96 6th Int. Workshop on Compilers for Parallel Computers, Aachen, Dec. 1996.
    See also: Technical Report Nr. 95-23 of University of Trier, Departement for Mathematics and Computer Science, Nov. 1995.

    [I5]
    Christoph W. Kessler:
    On the Applicability of Program Comprehension Techniques to the Automatic Parallelization of Sparse Matrix Computations
    Proc. of AP'97 3rd Workshop on Automatic Data Layout and Performance Prediction, Research report of Departament d'Arquitectura de Computadors, Universitat Polytechnica de Catalunya, Barcelona, Spain, Jan. 1997.

    [I7]
    Christoph W. Kessler:
    Applicability of Automatic Program Comprehension to Sparse Matrix Computations
    Proc. of CPC'98 7th Int. Workshop on Compilers for Parallel Computers, Linköping (Sweden), pp. 218-230, July 1998.

    [I8]
    NestStep: Nested Parallelism and Virtual Shared Memory for the BSP Model (see [C18])
    Christoph W. Kessler
    Proc. of CPC'00 8th Int. Workshop on Compilers for Parallel Computers, Aussois (France), pp. 13-19, Jan. 2000.

    [I9]
    Managing irregular remote accesses to distributed shared arrays in a bulk-synchronous parallel programming environment
    Christoph W. Kessler
    Proc. of CPC'01 9th Int. Workshop on Compilers for Parallel Computers, Edinburgh (Scotland), pp. 195-204, June 2001.

    [I10]
    Optimal integrated code generation for VLIW architectures
    Andrzej Bednarski, Christoph W. Kessler
    Proc. of CPC'03 10th Int. Workshop on Compilers for Parallel Computers, Amsterdam, The Netherlands, pp. 71-80, January 2003.

    [I11]
    Andrzej Bednarski, Christoph W. Kessler
    Energy-Optimal Integrated VLIW Code Generation.
    Proc. of CPC'04 11th Int. Workshop on Compilers for Parallel Computers, Seeon, Germany, pp. 227-238, July 2004.

    [I12]
    Christoph W. Kessler, Andrzej Bednarski
    Classification and generation of schedules for VLIW processors.
    Proc. of CPC'06 12th Int. Workshop on Compilers for Parallel Computers, A Coruna, Spain, Jan. 2006, pp. 60-72.

    [I13]
    Andrzej Bednarski, Christoph W. Kessler
    Integer Linear Programming versus Dynamic Programming for Optimal Integrated VLIW Code Generation.
    Proc. of CPC'06 12th Int. Workshop on Compilers for Parallel Computers, A Coruna, Spain, Jan. 2006, pp. 73-85.

    [I14]
    Christoph W. Kessler, Jörg Keller.
    Models for Parallel Computing: Review and Perspectives.
    Invited survey article, in PARS-Mitteilungen 24: 13-29, ISSN 0177-0454, Dec. 2007. Gesellschaft für Informatik e.V., Germany.

    [I15]
    Christoph Kessler, Jörg Keller:
    Optimized Mapping of Pipelined Task Graphs on the Cell BE.
    Proc. 14th Int. Workshop on Compilers for Parallel Computers (CPC-2009), Zürich, Switzerland, Jan. 2009.

    [I16]
    Christoph Kessler, Welf Löwe:
    Optimized Composition of Performance-Aware Parallel Components.
    Proc. 15th Int. Workshop on Compilers for Parallel Computers (CPC-2010), Vienna, Austria, July 2010.

    [I17]
    Usman Dastgeer, Lu Li, Christoph Kessler:
    Performance-Aware Dynamic Composition of Applications for Heterogeneous Multicore Systems with the PEPPHER Composition Tool.
    Proc. 16th Int. Workshop on Compilers for Parallel Computers (CPC-2012), Padova, Italy, Jan. 2012.

    [I18]
    Usman Dastgeer, Christoph Kessler:
    Towards global composition of performance-aware components for GPU-based systems.
    Proc. 17th Int. Workshop on Compilers for Parallel Computers (CPC-2013), Lyon, France, July 2013.

    [I19]
    Christoph Kessler, Usman Dastgeer, Lu Li:
    Optimized Composition: Generating Efficient Code for Heterogeneous Systems from Multi-Variant Components, Skeletons and Containers.
    In: F. Hannig and J. Teich (eds.), Proc. First Workshop on Resource awareness and adaptivity in multi-core computing (Racing 2014), May 2014, Paderborn, Germany, pp. 43-48.

    [I20]
    Yosandra Sandoval, Dennis Hoppe, Dmitry Khabi, Michael Gienger, Christoph Kessler, Lu Li, Usman Dastgeer, Phuong Ha, Ibrahim Umar, Vi Tran, Anders Gidenstam, Philippas Tsigas, Paul Renaud-Goud, Ivan Walulya:
    EXCESS: Execution Models for Energy-Efficient Computing Systems.
    Proceedings of the Energy Efficient Distributed and Parallel Computing Workshop (fEEDBACk), held in conjunction with PODC-2015, Donostia-San Sebastian, Spain, July 20th, 2015.

    [I21]
    Christoph Kessler, Lu Li, Erik Hansson, Nicolas Melot, August Ernstsson:
    Programming Frameworks for Optimized Software Composition for Parallel Systems.
    Proc. Second EXCESS Workshop, Chalmers, Gothenburg, Sweden, Aug. 2016.

    [I22]
    Dimitrios Soudris, Lazaros Papadopoulos, Christoph Kessler, Dionysios Kehagias, Athanasios Papadopoulos, Panos Seferlis, Alexander Chatzigeorgiou, Apostolos Ampatzoglou, Samuel Thibault, Raymond Namyst, Dirk Pleiter, Georgi Gaydadjiev, Tobias Becker, Matthieu Haefele:
    The EXA2PRO programming environment: architecture and applications.
    Proc. SAMOS-XVIII, 18th Int. Conf. on Embedded Computer Systems: Architectures, Modeling, and Simulation, Pythagorion, Greece, July 2018, pages 202-209, ACM, 2019. DOI: 10.1145/3229631.3239369

    [I23]
    Sotirios Panagiotou, August Ernstsson, Johan Ahlqvist, Lazaros Papadopoulos, Christoph Kessler, Dimitrios Soudris:
    Portable exploitation of parallel and heterogeneous HPC architectures in neural simulation using SkePU.
    Proc. SCOPES-2020, St. Goar, Germany, May 2020. ACM. DOI: 10.1145/3378678.3391889


    Theses

    [D1]
    Christoph W. Kessler:
    Code-Optimierung quasiskalarer vektorieller Grundblöcke für Vektorrechner
    Diplomarbeit (M.S. thesis, 65 pages, reprint available on request), Universität des Saarlandes, Saarbrücken, Germany, 1990.

    [D2]
    Christoph W. Kessler:
    Automatische Parallelisierung Numerischer Programme durch Mustererkennung
    Dissertation (PhD thesis, 200 pages) Universität des Saarlandes, Saarbrücken, Germany, 1994.

    [D3]
    Christoph W. Kessler:
    Parallelism and Compilers
    Habilitation thesis, 336 pages, FB IV, Universität Trier, Germany, Dec. 2000.


    Edited Books and Proceedings

    [E1]
    Christoph W. Kessler (ed.):
    Automatic Parallelization - New Approaches to Code Generation, Data Distribution, and Performance Prediction
    Book (221 pages, softcover), Vieweg Advanced Studies in Computer Science, Verlag Vieweg, Wiesbaden 1994, ISBN 3-528-05401-8.
    Based on the Proc. of AP'93 First Int. Workshop on Automatic Parallelization, Automatic Data Distribution and Automatic Parallel Performance Prediction, held in March 1-3,1993, at Saarbrücken, Germany.

    [E2]
    Erik Altman, James Dehnert, Christoph W. Kessler, Jens Knoop (eds.):
    Dagstuhl Seminar Proceedings 05101 -- Scheduling for Parallel Architectures: Theory, Applications, Challenges, March 2005. Abstracts Collection and Executive Summary. Internationales Begegnungs- und Forschungszentrum (IBFI), Schloss Dagstuhl, Germany, 2005.

    [E3]
    Christoph W. Kessler, Welf Löwe, David Padua, Markus Püschel (eds.):
    Dagstuhl Seminar Proceedings 10191 -- Program Composition and Optimization : Autotuning, Scheduling, Metaprogramming and Beyond, May 2010. Abstracts Collection, Executive Summary, and Introduction Paper [M17]. Dagstuhl Research Online Publication Server (DROPS), http://drops.dagstuhl.de, Leibniz-Zentrum för Informatik, Schloss Dagstuhl, Germany, June 2010.

    [E4]
    Christoph W. Kessler (ed.):
    Proceedings of the Fourth Swedish Workshop on Multicore Computing (MCC-2011), Linköping, Sweden, Nov. 23-25, 2011.

    [E5]
    Christoph W. Kessler and Sabri Pllana (eds.):
    Proceedings of the 2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS-2013), September 7, 2013, Edinburgh, Scotland, UK, in conjunction with the 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT-2013).
    Published in IEEE Xplore, ISBN 978-1-4700-1010-6, IEEE, 2013.
    Editor's note see [M21].

    [E6]
    Christoph W. Kessler (ed.):
    Guest Editor's Note: High-Level Parallel Programming 2019
    International Journal of Parallel Programming, 48(4), Special issue, Aug. 2020, Springer. DOI: 10.1007/s10766-020-00672-0.


    Publications Related to Computer Science Education

    [P1]
    C. Kessler, S. Nadjm-Tehrani:
    Mid-term Course Evaluations with Muddy Cards
    ACM SIGCSE ITiCSE'02 7th Int. Conf. on Information technology in computer science education, Aarhus (Denmark), June 2002.
    Also published in: ACM SIGCSE Bulletin 34(3), September 2002.

    An extended version appeared in:
    H. Hult [ed.], 5:e universitetspedagogiska konferensen vid Linköpings universitet, 15 Nov. 2001, CUL rapport No. 3, pp. 153-161, 2001.

    [P2]
    N. Dale, J. Bishop, D. Barnes, C. Kessler.
    A dialog between authors and teachers.
    In Proc. ACM SIGCSE ITiCSE'02 7th Annual Conf. on Information Technology in Computer Science Education, Aarhus, Denmark, June 2002. Pages 44-45.
    Also published in: ACM SIGCSE Bulletin 34(3), September 2002.

    [P3]
    Christoph W. Kessler:
    A practical access to the theory of parallel algorithms (see also [C22])
    Proc. ACM SIGCSE'04 Symposium on Computer Science Education, Norfolk, Virginia, USA, March 2004.

    [P4]
    C. Kessler:
    StASy: Datorstödd administration för stora studierektorsområden. [In Swedish]
    In: Centrum för Undervisning och Lärande, CUL-report no. 10: Nya villkor för lärande och undervisning. 9:e Universitetspedagogiska konferensen vid Linköpings universitet, 17 oktober 2005. Linköping University Electronic Press, March 2006, pp. 103--113.

    [P5]
    C. Kessler:
    Teaching parallel programming early.
    In: Proceedings of Workshop on Developing Computer Science Education -- How Can It Be Done?, March 10, 2006, Linköpings universitet, March 2006.


    Other Publications

    [M1]
    Christoph W. Kessler.
    Knowledge-Based Automatic Parallelization by Pattern Recognition
    in: Christoph W. Kessler [Ed.]: Automatic Parallelization - New Approaches to Code Generation, Data Distribution, and Performance Prediction, Verlag Vieweg, Wiesbaden, 1994.

    [M2]
    Christoph W. Kessler [Editor]
    Proceedings of AP'93 First Int. Workshop on Automatic Parallelization, Automatic Data Distribution and Automatic Parallel Performance Prediction, held in March 1-3, 1993, in Saarbrücken, Germany.

    [M3]
    Christoph W. Kessler
    Pattern-Driven Automatic Parallelization, Data Distribution, and Performance Prediction
    in: Poster contributions at CONPAR'94, Technical report No. 94-48 of RISC Linz, Austria, Sept. 1994. pp. 17-20

    [M4]
    Christoph W. Kessler, Helmut Seidl.
    Making FORK Practical
    Technical report no. 01/95, SFB 124, Univ. Saarbrücken, 1995.

    [M5]
    Christoph W. Kessler.
    Automatische Parallelisierung
    Vorlesungsskript (Course script), University of Trier, Department for Mathematics and Computer Science, SS 1995.

    [M6]
    Christoph W. Kessler
    Scheduling Expression DAGs for Minimal Register Need
    Technical Report 96-12, Univ. Trier, FB IV Mathematik/Informatik.

    [M7]
    Christoph W. Kessler.
    Parallel Fourier-Motzkin Elimination
    Manuscript, University of Trier, Department for Mathematics and Computer Science, Feb. 1997.
    See also the conference paper version, [C11].

    [M8]
    Christoph W. Kessler.
    Practical PRAM Programming in Fork95 - A Tutorial
    Technical Report No. 97-12, University of Trier, Department for Mathematics and Computer Science, 62 pages, May 1997.
    Available as postscript (1322 KB) and gzip'ed postscript (270 KB).

    [M9]
    Beniamino DiMartino, Christoph W. Kessler.
    Two Program Comprehension Tools for Automatic Parallelization: A Comparative Study
    Technical Report No. 97-23, University of Trier, Department for Mathematics and Computer Science, 23 pages, Nov. 1997.
    Available as postscript (1442 KB) and gzip'ed postscript (189 KB).

    [M10]
    Christoph W. Kessler, Helmut Seidl.
    ForkLight: A Control-Synchronous Parallel Programming Language.
    Technical Report No. 98-13, University of Trier, Department for Mathematics and Computer Science, 19 pages, Sep. 1998.
    Available as abstract and postscript (746 KB)

    [M11]
    Christoph W. Kessler, Helmut Seidl, Craig H. Smith.
    The SPARAMAT Approach to Automatic Comprehension of Sparse Matrix Computations.
    Technical Report No. 99-10, University of Trier, Department for Mathematics and Computer Science, 21 pages, March 1999. (BibTeX)
    Short version published at IWPC'99 [C17].

    [M12]
    P. Kelly, S. Gorlatch, C. Kessler, D. Quinlan.
    Topic 10: Parallel Programming: Models, Methods and Programming Languages
    In Proc. Euro-Par 2004 10th International Euro-Par Conference, Pisa, 2004, p. 614. Springer LNCS 3149, 2004. DOI: 978-3-540-27866-5_80.

    [M13]
    Christoph W. Kessler, Welf Löwe.
    A Framework for Performance-Aware Composition of Explicitly Parallel Components (1-page summary).
    In: G.R. Joubert, C. Bischof, F. Peters, T. Lippert, M. Bücker, P. Gibbon, B. Mohr (Eds.), Book of Abstracts, ParCo-2007 Conference, NIC Series vol. 37, John von Neumann Inst. of Computing (NIC), Jülich, Germany, Sep. 2007, p. 57.

    [M14]
    Karl-Filip Faxen, ed., and C. Bengtsson, M. Brorsson, H. Grahn, E. Hagersten, B. Jonsson, C. Kessler, B. Lisper, P. Stenström, B. Svensson.
    Multicore computing--the state of the art.
    Report, The Swedish Multicore Initiative, 2008

    [M15]
    Beniamino Di Martino, Christoph W. Kessler, Yi Pan, Thomas Rauber, Gudula Rünger, Laurence T. Yang:
    Message from the PDSEC-09 workshop chairs.
    Proc. Int.\ Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC-09), 29 May 2009, Rome, Italy, pp.1-2, in conjunction with IEEE Int. Symposium on Parallel&Distributed Processing (IPDPS), 2009.

    [M16]
    C. Kessler
    Multicore - Möjliga Scenarion för Framtiden [In Swedish]
    OnTime (No. 3/2009), Combitech AB, Sweden, Dec. 2009

    [M17]
    C. Kessler, T. Rauber, Y. Robert, V. Scarano:
    Topic 12: Theory and Algorithms for Parallel Computation.
    Proc. EuroPar-2010 Conference.

    [M18]
    C. Kessler, W. Löwe, D. Padua, M. Püschel:
    Program Composition and Optimization: An Introduction.
    Proc. Dagstuhl Seminar 10191 Program Composition and Optimization, Dagstuhl Research Online Publication Server, LZI Dagstuhl, Germany, June 2010.

    [M19]
    S. Benkner, S. Pllana, J.L. Träff, P. Tsigas, A. Richards, G. Russell, S. Thibault, C. Augonnet, R. Namyst, H. Cornelius, C. Kessler, D. Moloney, P. Sanders:
    PEPPHER: PErformance Portability and Programmability for HEterogeneous many-core aRchitectures.
    Technical Report TR-10-4, Department of Scientic Computing, Faculty of Computer Science, University of Vienna, Austria, September 2010.

    [M20]
    Christoph Kessler, Usman Dastgeer, Mudassar Majeed, Nathalie Furmento, Samuel Thibault, Raymond Namyst, Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Martin Wimmer:
    Leveraging PEPPHER Technology for Performance Portable Supercomputing.
    Poster and short paper at Supercomputing'12, Salt Lake City, Utah, USA, Nov. 2012.
    Short paper (DOI: 10.1109/SC.Companion.2012.212) and poster (DOI: 10.1109/SC.Companion.2012.213) in SC'12 Companion, ISBN 978-1-4673-6218-4, IEEE.

    [M21]
    Christoph Kessler, Sabri Pllana:
    Message from the MuCoCoS-2013 Workshop Chairs.
    Proceedings of IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS-2013), September 7, 2013, Edinburgh, Scotland, UK, IEEE Xplore, ISBN 978-1-4799-1010-6, IEEE, 2013.

    [M22]
    N. Melot, C. Kessler, J. Keller, P. Eitschberger:
    Fast Crown Scheduling Heuristics for Energy-Efficient Mapping and Scaling of Moldable Streaming Tasks on Many-Core Systems - extended abstract.
    Summary of [J20], Proc. 18th Int. Worksh. on Software and Compilers for Embedded Systems (SCOPES-2015) workshop, ACM, June 2015, pp. 105-108. DOI: 10.1145/2764967.2764975

    [M23]
    Christoph Kessler, Lu Li, Aras Atalar, Alin Dobre.
    An extensible platform description language supporting retargetable toolchains and adaptive execution [Extended Abstract]
    Summary of [C79], Proc. 19th Int. Worksh. on Software and Compilers for Embedded Systems (SCOPES-2016) workshop, ACM, May 2016. DOI: 10.1145/2906363.2906366.


    Software

    [S1]
    Christoph Kessler:
    Fork: language, compiler and toolchain for the SB-PRAM parallel computer.
    Complete programming environment with compiler, libraries, graphical trace file visualizer. Necessary system tools and a software simulator for the SB-PRAM, written by the SB-PRAM team in Saarbrücken, are included for completeness. A comprehensive description can be found in [B1].
    First public release (open source) 1997. The system is nowadays mainly used for teaching parallel algorithms, see also [C22].
    [S2]
    Christoph Kessler:
    StASy: Studierektorns Administrations-System.
    Computer-based administration tool for the stressed studierektor (manager of undergraduate education) responsible for many courses and teaching staff.
    StASy is in use at the Systems and Software division of the computer science department of Linköping university for daily administrative work since january 2002.
    Public release (open source) Nov. 2004.
    [S3]
    Andrzej Bednarski, Christoph W. Kessler.
    OPTIMIST: Retargetable integrated code generator for VLIW, DSP and embedded processors.
    First public release (open source) in June 2005.

    [S4]
    Johan Enmyren, Usman Dastgeer, August Ernstsson, Johan Ahlqvist, Christoph Kessler et al.:
    SkePU: Skeleton Programming Framework for Multicore CPU and Multi-GPU Systems
    First public release (open source) in Sept. 2010.
    SkePU3 is hosted on github since May 2020.

    [S5]
    Amin Shafiee Sarvestani, Erik Hansson, Christoph Kessler:
    PRT: Pattern Recognition Tool
    Generic tool for automated recognition of computational patterns in legacy C programs, e.g. for automatic parallelization.
    First public release (open source) in Aug. 2012.

    [S6]
    Lu Li, Christoph Kessler:
    MeterPU: Generic Measurement Abstraction Library for Heterogeneous Multicore Systems
    A C++ based software multimeter for convenient and portable measurement of time, energy etc. on heterogeneous multi-/manycore systems such as GPU-based systems, with platform-specific plug-ins.
    First public release (open source) in Aug. 2015.

    [S7]
    Nicolas Melot, Christoph Kessler:
    Drake: Generic Framework for Portable Specification of Streaming Applications and Deployment for On-Chip-Pipelined Execution on Manycore Systems
    Drake is a plain C based programming framework and toolchain for the portable design and platform-specific optimized deployment of streaming applications on many-core processors.
    First public release (open source) in Nov. 2015.


    Copyright notice: This page contains links to copyrighted material; copyright is held by the author or by the publisher. In the latter case, we either give a link to the publisher's catalog or provide an extended abstract or the final manuscript, which usually differs from the printed article in minor details. This material is made accessible here with the purpose of fast dissemination of work to interested colleagues, and hence intended for personal use only. For reproduction and redistribution, the copyright rules by the respective publisher apply.

    This page by Christoph W. Kessler

    Page responsible: Webmaster
    Last updated: 2024-10-16