I provide a personal account of the history and main design decisions in the development of the course 15-122 Principles of Imperative Computation. The first pilot was taught in Fall 2010, and it is now a central part in the freshmen curriculum for Computer Science and Electrical...
There is a raging argument in the world over how Internet access should be priced and controlled. The players have often used emotional words to describe the issues. Rarely does debate focus on understanding the implications of proposed directions on future technologies and...
Mobile manipulation robots offer the potential to vastly improve the quality of live for persons with severe motor disabilities, by acting as surrogates though which the person can influence and interact with their physical environment. However, fully autono mous robots are not...
Our world is full of networks. The linking relationships might be quite abstract, such as friendship or metabolic processes or even more concrete, like roads or railways, but are still hard to overlook. One way to deal with such a network, is to mathematically model it as a graph...
Gordon Bell's data dump is more than just a glorified photo album. By using e-memory as a surrogate for meat-based memory, he argues, we free our minds to engage in more creativity, learning, and innovation.
...
In this work we look at left-vs.-right polarization in web search queries issued and hashtags being used. We look at queries issued to Yahoo! in the US, and assign a political leaning in relation to the results being clicked. E.g. a query returning predominantly The Huffington...
MapReduce is now a pervasive analytics engine on the cloud. Hadoop is an open source implementation of MapReduce and is currently enjoying wide popularity. Hadoop offers a high-dimensional space of configuration parameters that makes it difficult on practitioners to set for...
This talk presents several examples of how Information Technology can help the 5 billion people at the bottom of the pyramid who do not now have routine access to devices and networks. For many of them, access to Word, Excel, and PPT or Programming are not that important. However...
Statistical Machine Translation (SMT) is the current mainstream approach to automatic translation between human languages. SMT relies on the existence of parallel bilingual corpora that are used to learn translation models automatically. Languages with different degrees of...
This talk is a gentle introduction to state-space search and model checking with LMNtal and LaViT, an IDE for LMNtal powered by visualizers. LMNtal is a language model based on hierarchical graph rewriting that uses point-to-point links to represent connectivity and membranes to...
All areas of science are being confronted with increasingly large datasets, and often their usual approaches to data analysis are struggling to cope. While advances in computational tools are addressing this problem, in some cases these are still not as refined as desired and...
Despite the increasing importance of data quality and the rich theoretical and practical contributions in all aspects of data cleaning, there is no single end-to-end o -the-shelf solution to (semi-)automate the detection and the repairing of violations w.r.t. a set of...
This presentation reports on the experience and lessons learned from introducing a constructivist Inquiry Based Learning (IBL) in advanced computing courses. Research-based projects are one of the IBL practices that are particularly suitable for higher education. Research-based...
While many organizations continue to focus on enabling sustainable development, very few organizations have studied the role of technology in this process. TechBridgeWorld at Carnegie Mellon University is spearheading the innovation and implementation of technological solutions...
Minicomplexity is the computational complexity theory that we can build by analogy to standard Computational Complexity Theory, when our computational model and resource of interest are not the Turing machine and its running time, but instead the two-way finite automaton (2FA) and...
As our data sets grow in size, the need for techniques for processing such large data under limited resources becomes more critical. One model for processing large data sequences is that of streaming computation: the input is read sequentially, i.e., "streamed", in one long pass,...
High throughput sequencing technologies have made it possible to sequence the whole genome of an individual human donor for a few thousand dollars. The world-wide capacity for genome sequence production has grown at an unprecedented rate, making the initiation of large scale...
Goggles, wearable cameras are poised to enter our social spaces in a big way. In this talk, I will investigate what wearable cameras can tell us both about the person wearing the cameras and the people they interact with. In the first part, I will present a method to reconstruct...
Message Ferrying (MF) is a technique used to deliver data in wireless and mobile networks that are either sparse or intermittently connected. The scheme utilizes a set of mobile nodes called message ferries that take responsibility for carrying messages within the network. I will...
Our societies depend more and more on IT-infrastructure. Algorithms are the brains of IT-systems. We are mostly interested in algorithms for difficult combinatorial and geometric problems: what is the fastest tour from A to B? How to optimally assign jobs to machines? how can a...
Physarum is a slime mold. It was observed over the past 10 years that the mold is able to solve shortest path problems and to construct good Steiner networks. In a nutshell, the shortest path experiment is as follows: A maze is built and the mold is made to cover the entire maze....
Learning based uniquely on input will not respond adequately to future challenges for individuals, society or the economy. The trend is to rely, increasingly, on the identification of learning outcomes and competences. In this event, we will present current international projects...
Typical location determination systems, for example the satellite-based Global Positioning System (GPS), require the presence of a physical device that is attached to the object that is being tracked. In addition, they usually require the tracked device to participate actively in...
The Global Technology Outlook (GTO) is IBM Research’s vision of the future for information technology (IT) and its impact on industries that use IT. This annual exercise highlights emerging software, hardware, and services technology trends that are expected to significantly...
Data-Intensive Scalable Computing (DISC) systems provide computing resources that enable the analysis of massive data sets. They have found widespread use in Internet companies, but they also have the potential to greatly advance areas such as astronomy and biology. Programs for...
Web search engines have become fixtures in our society, but few people realize that they are actually publicly accessible supercomputing systems, where a single query can unleash the power of several hundred processors operating on a data set of over 200 terabytes. With Internet...