- Lecture 1: Introduction to Information-theoretic Lower Bounds via Puzzles
- Lecture 2: Importance of Problem Structures
- Lecture 3: Statistical Decision Theory: Model Distance and Equivalence
- Lecture 4: Local Asymptotic Normality and Asymptotic Theorems
- Lecture 5: Hypothesis Testing and Le Cam’s Two-point Method
- Lecture 6: Generalized Two-Point Method: Point vs. Mixture
- Lecture 7: Mixture vs. Mixture and Moment Matching
- Lecture 8: Multiple Hypothesis Testing: Tree, Fano and Assoaud
- Lecture 9: Multiple Hypothesis Testing: More Examples
- Online Lecture: Information-theoretic Lower Bounds
In 2019, from June to August, 40 high school students attended the STEM to SHTEM (Science, Humanities, Technology, Engineering and Mathematics) summer program hosted by Prof. Tsachy Weissman and the Stanford Compression Forum. During this summer program, the high schoolers pursued fun research projects in various domains under the supervision of 18 mentors, where the entire collection of the high schoolers’ reports can be found below.
- Artificial Olfaction Measurement using VOC, Gas, and Temperature Sensors
- Building a Human-Centric Lossy Compressor for Facial Images
- Comparing the compressibility of different genomes and the correlation between genome length and compression ratio
- Determining sufficient audio file compression and reconstruction standards for different music genres
- Human-Based Image Compression; Using a Deterministic Computer Algorithm to Reconstruct Pre-Segmented Images
- Implicit Communication in Collaborative Object Transport
- Journal for High Schoolers in 2019
- Nanopore DNA Sequencing
- Olfactometer Podcast
- Olfactory Communications In a Virtual Environment
- Probabilistic Models of Human Decision-Making With Time Pressure and Partial Information Using Game Interfaces, Reward Functions, and Markov Decision Processes
- The Average Sensitivity of Boolean Functions on Hypercubes
Anushka Sheth, Bikrant Das Sharma, Harvey Jarin, Ian Zhang, Jonathon Sneh, Leena Elzeiny, Prayusha Parikh and Vyomika Gupta
Abstract:Currently, VR systems immerse an individual visually and auditorily, but the lack of olfactory stimulation is part of the void that separates virtual reality from real life experiences. The prospect of adding scent as a third dimension to virtual reality prompted our research of the effects of adding an olfactory feature to virtual interactions, as well as its effect on the judgement of pleasantness and personality of others met in the virtual environment. At the start of the experiment, all participants take a personality quiz. The quiz predicts how much each participant enjoys a certain smell. Each individual then participates in a conversation in virtual reality with a stranger. Some participants have scent dispersed during their conversation, and some participants do not. The stranger the participant meets has scripted responses. Immediately after, each participant answers questions based on their conversation. The questionnaire quantitatively scores the (1) pleasantness to examine if scents impact the perception of someone and (2) the perceived personality of the stranger, in order to determine whether olfactory communication aids in an accurate first impression. This research is aimed at investigating the implications of integrating olfactory communication in virtual interactions and assessing its effects on the judgement of the other person’s personality and pleasantness.
Olfactory communication is deeply instinctual. A scent is first processed by the olfactory bulb which directly activates the amygdala, the control center for emotions in the brain. The association between smells and emotions can run so deep that, according to three clinical studies by two psychiatric professors, patients with post-traumatic stress disorder reported feeling guilt and nausea upon smelling an environment associated with the traumatic incident . Currently, technology is slowly shifting to incorporate olfaction machinery into spaces, to bring a new era of immersive reality. In order to simulate the physical world, one must mimic all dimensions of it, but little research has been conducted to understand the human response to syntents.
The purpose of our research is to determine the effect of aroma in a person’s judgement of the pleasantness and personality of a person who they meet. A personality quiz is given to the participant to predict their enjoyment of a certain smell. The participant interacts with another person. During the virtual reality conversation between the participant and the actor, the scent is dispersed for some subjects without their knowledge.
Virtual reality (VR) is a highly realistic, immersive, and simulated experience incorporating sophisticated visual and audio elements. Virtual reality is primarily used for educational and entertainment purposes, as it simulates a realistic interaction. The two participants interact in virtual reality, through the platform AltspaceVR, a social virtual world where users create avatars and explore chat rooms. AltspaceVR serves to supplement social interactions, and connects users worldwide. While AltspaceVR and other such virtual communication platforms attempt to mimic and replicate real life encounters, many subtleties humans have in their physical interactions are lost. Most first impressions, as well as the formation of memories, require tangible visual and olfactory cues that cannot be conveyed currently through virtual means. First impressions are formed in the amygdala and posterior cingulate cortex; parts of the brain that process fear and assign value to objects. The factors involved in the assignment of value and trustworthiness are largely derived from visual signals, primarily body language. People can evaluate faces on trustworthiness, status, and attractiveness following even just a brief glance (e.g., 33 ms) and extra time (100 or 500 ms) only led to a small improvement in the correspondence of these time-constrained evaluations with an independent set of time-unconstrained judgments. Perceptions of personality are also based in visual cues, including clothing, posture, and eye contact. Many such cues, however, are also conveyed through olfaction.
Yet, such a strong correlation has not always been addressed. The potential of olfactory communication has been tabled since the ancient Greek philosophers: Plato and Aristotle described scent as the most animalistic of the five senses . Later philosophers such as Immanuel Kant agreed because smell did not contribute to the music and art during the Enlightenment. On the other hand, some non-Western cultures glorified their sense of smell . For example, the Bororo people in Brazil consider body odor to be the life force of an individual and the smell of breath to be linked to the soul.
3. Methods and Materials
We received a scent machine from OWidgets, a device from the SCHI lab at the University of Sussex. AltspaceVR was used for the virtual reality world and experience. The VR headsets used were an Oculus Rift and an HTC Vive. Scents were received from the Institute of Art and Olfaction in Los Angeles, California. An Arduino board was used to control the dispersion of the OWidgets machine. The board was connected to a laptop which controlled the dispersion of scent.
The experience begins in a waiting room, where participants are ushered in and out. Each participant takes the scent personality quiz anonymously and the database stores the person’s scent preferences scored by the quiz. From there, participants are told the narrative for the experiment: they have left their lives and are in a state which determines whether they will return to their old life or move on to the afterlife. In the room, the participant enters the virtual reality world. There is another person in the Virtual Reality world, and the two users converse. After the experience, they take an exit survey, judging each others personality and pleasantness.
An important factor in conducting the experiment relies on the participants perceived reality as a blinded experiment. The experiment is disguised as a study on the effect of virtual reality on first impression. Each participant meets an “actor”. An actor is a person who has memorized his/her responses to the questions and gives the memorized response when the participant asks them a question.
After their interaction, the participant steps out of the VR headset and take an exit survey on the interaction they just had. The exit survey asks the participant to judge the actor’s personality and pleasantness.
AltspaceVR is a virtual reality platform that is used as a meeting space for both private and public events. In AltspaceVR, worlds can be built from scratch, which allows for greater customization and control over certain variables (avatars, music, boundaries, setting). In addition to its customization features, Altspace also supports world-building through Unity and in-game app development through its MDK feature. This, in combination with its other features, makes AltspaceVR ideal for our experimental procedure.
For our world, we wanted to create a neutral, non-religious interpretation of the Afterlife. In order to ensure that the world would not elicit any negative emotions, we also wanted to make the world as serene and calm as possible. The world we determined that would be best included the following features: soothing music, a night time and mountain background, a reflective plane, and a limiting boundary in the form of rocks. To make the experience more interactive, we also implemented a Tic-Tac-Toe game to go along with the questions.
3.2 Personal Quiz
In order to determine the scent that corresponded to a participant’s unique personality, we created our own version of a personality quiz. Based on the research we conducted on various personality quizzes (Myers-Briggs, Five Factor Model, Eysenck’s PEN Model), we determined that a quiz following the general structure and trait association as the FFM would best suit our purpose. We devised 12 questions, 11 four response multiple choice and 1 two response question, based around the five main personality traits in the FFM: Openness to experience, Conscientiousness, Extraversion, Agreeableness, and Neuroticism (OCEAN). Since there are four main fragrance families–Floral, Oriental, Woody, Fresh–and five trait groups, we decided to combine openness and extraversion due to their similarities . To connect fragrance families and the personality traits, we found correlations between one’s personality and preferred perfume. The fragrance-trait relations we used were Floral-Neuroticism, Oriental-Conscientiousness, Woody-Agreeableness, Fresh-Openness and Extraversion .
Early versions of our quiz were inaccurate, as a majority of people tended to overwhelmingly favour one response over the other answer choices for several questions. This led to a majority of people ending up with a high score in the Fresh fragrance family as opposed to the other three fragrance families. After going through several iterations of the quiz while modifying and removing certain biased questions, we were left with 12 questions with even distributions in terms of answer responses. For scoring, we assigned each response three points (with the exception of #11 which was given four points for more weight). The points corresponded to different scents, as our research proved that those who chose a specific answer choice were more likely to favor a specific fragrance. Each answer choice that an individual made added points to one or more scents. In total, there were 37 possible points to be earned in each of the four scents, meaning that there was an equal opportunity to score any of the four fragrance families. After all 12 questions were answered, the total number of points scored in each fragrance family was summed up to create the participant’s scent array. The scent array ranked the four scents, in order to anticipate which one they would be partial to. Next, we had to evaluate whether an individual’s personality – as judged by our quiz – actually had an effect on the kind of smells they preferred. To test the accuracy of our personality quiz, we went to various locations on campus and surveyed people randomly. Participants were first given the quiz in order for us to predict the scent that they should prefer based on their personality type. Then, they were given four strips of paper, each sprayed with a different fragrance oil and were asked to choose which one they prefered. Out of the 24 people we surveyed, 16 of them chose one of their top two scoring scents as their favourite, proving that our quiz results could accurately predict which aromas an individual would prefer a majority of the time.
3.3 OWidgets Scent Machine
To disperse the participants’ unique perfume throughout the room, we looked into various methods of scent delivery systems such as essential oil diffusers, heat diffusers, and nebulizer oil diffusers. While each of these have proved to be effective at dispersing smells throughout a given environment, the fragrance oils they give off tend to linger in the room for hours. Over time, the different smells would blend together to form olfactory white – an unidentifiable scent that would distort our data. We wanted to avoid the Smell-o-vision disaster of the mid-twentieth century, in which users were overwhelmed by the multiple scents coming at them. Furthermore, we had to ensure that the smell was not too strong. Since each VR experience would only last for four to five minutes, we needed a way to easily get rid of the smell after each encounter. To do this, we partnered with OWidgets, a company that designs ways to incorporate olfaction into various experiences. This company was founded by Emanuela Maggioni, an experimental psychologist whose interest in the human olfactory system led her to develop a machine that could be used for further experimentation with smells. The scent machine that she and her team created allows three different scents to be sprayed from the nozzles. Through experimentation, we confirmed that perfumes sprayed from the machine only noticeably lingered in the environment for 1-3 seconds after the machine was turned off. We found that the ideal distance between the user and the OWidgets Scent machine was five feet. Moreover, we discovered that the best way to spray the fragrances was to have the machine on a cycle with it on for five seconds and off for 15 seconds for the duration of the experience – about four minutes. This ensured that the participant could smell the scent for a majority of the experience and was not overwhelmed by the strong fragrance.
4. Future Directions
The next steps for continuing this research would be using more perfumes in the VR interaction. Expanding from only fresh to also including floral, warm and spicy, and woody would offer more data about responses to smells which people like and do not like.
Afterwards, expanding to a more dynamic set of scents would also be a possible future direction. By implementing scents that correspond to one’s unique environment, an even more immersive experience would be gained from the users. Smells in an environment could be identified by an E-nose and then transferred to another user using a similar system as ours, but the actual replication of the scent and as well the current capabilities of an E-nose limit this process. Future advancements in these areas would be needed to further the inclusion of dynamic scents in VR or any other platform.
We would like to acknowledge Professor Tsachy Weissman of Stanford’s Electrical Engineering Department and head of the Stanford Compression Forum for his guidance and help with the project. We also want to thank Devon Baur for being the mentor of our group and helping us with designing the project, accessing and getting materials, and running the experiment. Additionally, we also want to thank Professors Subhashish Mitra, Gordon Wetzstein, and Debbie Senesky for all of their help with the project. Also, we want to thank the OWidgets group in London for giving us a scent machine and helping us set it up and troubleshoot. Lastly, we want to thank the Institute of Art and Olfaction in Los Angeles for providing us with the scents we used for the project.
 Vermetten E & Bremner JD. Olfaction as a traumatic reminder in posttraumatic stress disorder: case reports and review. The Journal of Clinical Psychiatry 64 (2003), 202-207.
 Camille Ferdenzi, S. Craig Roberts, Annett Schirmer, Sylvain Delplanque, Sezen Cekic, Christelle Porcherot, Isabelle Cayeux, David Sander, Didier Grandjean, Variability of Affective Responses to Odors: Culture, Gender, and Olfactory Knowledge, Chemical Senses, Volume 38, Issue 2, February 2013, Pages 175–186, https://doi.org/10.1093/chemse/bjs083
 Bhana, Yusuf. “Localising Smells: How Different Cultures Perceive Scents.” TranslateMedia, 7 Dec. 2017, http://www.translatemedia.com/translation-blog/localising-smells-different-cultures-perceive-scents/
 Mensing J., Beck C. (1988) The psychology of fragrance selection. In: Van Toller S., Dodd G.H. (eds) Perfumery. Springer, Dordrecht
 Ahrndt, Sebastian & Fähndrich, Johannes & Albayrak, Sahin. (2015). Modelling of Personality in Agents: From Psychology to Implementation.
Alex Chin and Alan Yu
Georgina Cortez, Nicole Krukova, Thuan Le, and Suraj Thangellapally
Abstract:Nanopore technology is a modern way of sequencing DNA, in other words, it determines the order of nucleotides in a DNA strand . First, a DNA strand passes through a motor protein that serves as an inhibitor to improve the accuracy of the readings. Then, the strand runs through a nano-sized hole that is experiencing an ionic current. As the DNA strand passes through the nanopore, the individual nucleotides cause disruptions in the electric current which help determine the nucleic acid sequence . Approximately 250 bases can be sequenced per second by Nanopore technology . However, the results are “noisy” or not completely reliable because of several factors including the inconsistency of the nucleotides’ dwell times, which are the time periods of the central bases in the nanopore. The wavelength of the ionic current depends on the dwell time; inconsistent dwell times produce different wavelengths that make it difficult to accurately determine the nucleic acid sequence.
We believe the central base’s dwell time depends on the nucleotides present in the motor protein at that moment. The central base is the nucleotide currently in the nanopore whose dwell time we are focusing on. A k-mer (e.g. “AAT” or “CGATC”) consists of multiple nucleotides that correspond to the central base’s dwell time. To characterize the inconsistencies of the dwell times, we analyzed the data to find correlations between the k-mers and their dwell times. These correlations can help future researchers assign specific characteristics to k-mers and their impact upon dwell times.
We conducted our research with sequenced DNA data previously acquired from the lambda bacteria, which has approximately 48,000 nucleotides. Through Python, we manipulated the data to determine trends and correlations between the k-mers and their corresponding dwell times. Our main goal in the programming was to create visual representations of the data in the form of plots. These plots allowed us to discern noticeable trends in the data that we otherwise might not have observed. Our initial plots were difficult to analyze because there was too much data that made it nearly impossible to understand any trends. To resolve this issue, we organized the data into a more digestible format, by only plotting the average dwell time for each k-mer and sorting the points from lowest dwell time to highest. This allowed us to more easily spot any outliers or trends in the data and compare different k-mers.
The plot and table showed that the data for all of the bases was very similar as there were barely any differences between their means, standard deviations, etc. These results validated our belief that the dwell times did not depend on the base in the pore but rather that the dwell times are dependent on the following bases that are falling through the motor protein since it controls the speed of the DNA strand. To test this idea, we moved on to creating plots of different k-mer’s dwell times to see if any correlations became more apparent.
We began with an unshifted 3-mer as practice to construct our function and begin experimenting with associating k-mers to the central base’s dwell time. As expected, there were no significant trends in our data because the unshifted 3-mer has only one nucleotide following the central base. We believed that there would be correlations with a longer k-mer and its dwell time since, as our hypothesis claims, the central base’s dwell time depends on the nucleotides passing through the motor protein in that moment. Following on with this observation, we created a dataset with shifted 3-mers, where we shifted the 3-mer one space to the right.
Since the error bars for the dwell times of the 3-mer are so significant, the data for the 3-mer plot should not be used as it is unreliable. This further affirmed our focus on the 5-mer as it holds more reliable data.
We observed that the outlying dwell times below 7 units and above 9 units had a consecutive repetition of a base in their k-mers. The dwell times in these ranges varied greatly from the “normal” dwell time in the 7 to 9 unit range. The k-mers in this “normal” range had a minimal amount of repetition in comparison to the outlying ranges. This lead us to believe that the consecutive repetition of a nucleotide causes an irregular dwell time.
Similar to the previous sample of 5-mer data, there was a repetition of a single base in k-mers with irregular dwell times. In the 6.5 to 6.7 unit range of the dwell time there was a repetition of a single base in each k-mer. We also noticed that there was a higher prevalence of C’s in the lower spectrum of the dwell times. Moving into the higher 10 to 12 unit range of the dwell time, we observed that there was a consistent repetition of the nucleotide G in each k-mer. G’s are known to be particularly disruptive so we believe that the repetition of them caused the change in the dwell time. This data supported that the repetition of a nucleotide causes irregularities in the dwell times.
The first plot models that there is no difference between the prior k-mers and the 7-mer. This indicates that in order to find some trend, we need to increase the length of the k-mer and keep shifting it. There are 16,384 possible 7-mers combinations, so we created a plot of their averages so there would be a better visualization. The second plot above represents the averages of the first 10,000 bases of the lambda data collected, sorted from least to greatest. However, after analyzing the plot, we were sure that we would need a longer k-mer to spot any significant trend. For future reference, researchers should look to analyzing the repetition of nucleotides, focusing on specific ranges, and any other possible trends that could lead to a discovery.
Our project confirmed that the dwell time is not dependant on the central base but rather on the following bases in the motor protein. A significant discovery was that the consecutive repetition of a nucleotide causes irregular dwell times. The nucleotides G and C also appear to affect the consistency of the dwell time. With this information, future researchers should continue to create different shifts of k-mers and gather more data about the impact of base repetition on the dwell time. By understanding this relationship, researchers will be able to characterize and predict the effect a k-mer will have upon the dwell time. This can lead to more accurate readings of the sequenced DNA which will allow for the expansion of Nanopore technology.
Advancements in Nanopore technology will expand the applications of DNA sequencing in fields such as healthcare and data compression. Increased reliability will permit for the technology to have a more widespread role in personalized medicine, where treatment is unique to one’s genetics, and DNA data storage, an incredibly efficient way of storing a substantial amount of data in a microscopic space.
 Python: The Ultimate Beginner’s Guide!. Scotts Valley: CreateSpace Publishing, 2016.
 “Python Numpy Tutorial”, Cs231n.github.io, 2019. [Online]. Available: http://cs231n.github.io/python-numpy-tutorial/#numpy-arrays. [Accessed: 08- Jul- 2019].
 “Data Analysis and Visualization with Python for Social Scientists alpha: Extracting row and columns”, Datacarpentry.org, 2017. [Online]. Available: https://datacarpentry.org/python-socialsci/09-extracting-data/index.html. [Accessed: 08-Jul-2019].
 G. Templeton, “How DNA data storage works – ExtremeTech”, ExtremeTech, 2016. [Online]. Available: https://www.extremetech.com/extreme/231343-how-dna-data-storage-works-as-scientis ts-create-the-first-dna-ram. [Accessed: 08-Jul-2019].
 G. Templeton, “How DNA sequencing works – ExtremeTech”, ExtremeTech, 2015. [Online]. Available: https://www.extremetech.com/extreme/214647-how-does-dna-sequencing-work. [Accessed: 08-Jul-2019].
 “Replacing strings with numbers in Python for Data Analysis – GeeksforGeeks”, GeeksforGeeks, 2018. [Online]. Available: https://www.geeksforgeeks.org/replacing-strings-with-numbers-in-python-for-data-analysis/. [Accessed: 08-Jul-2019].
 N. Jetha, C. Feehan, M. Wiggin, V. Tabard-Cossa and A. Marziali, “Long Dwell-Time Passage of DNA through Nanometer-Scale Pores: Kinetics and Sequence Dependence of Motion”, Biophysical Journal, vol. 100, no. 12, pp. 2974-2980, 2011. Available: 10.1016/j.bpj.2011.05.007.
 T. Petrou, “Selecting Subsets of Data in Pandas: Part 1 – Dunder Data – Medium”, Medium, 2017. [Online]. Available: https://medium.com/dunder-data/selecting-subsets-of-data-in-pandas-6fcd0170be9c. [Accessed: 11-Jul-2019].
 “How it works”, Oxford Nanopore Technologies, 2019. [Online]. Available: https://nanoporetech.com/how-it-works. [Accessed: 11-Jul-2019].
 S. Lynn, “Using iloc, loc, & ix to select rows and columns in Pandas DataFrames”, Shane Lynn, 2018. [Online]. Available: https://www.shanelynn.ie/select-pandas-dataframe-rows-and-columns-using-iloc-loc-an d-ix/#loc-selection. [Accessed: 11-Jul-2019].
 “Python Pandas Tutorial 7. Group By (Split Apply Combine)”, YouTube, 2017. [Online]. Available: https://www.youtube.com/watch?v=Wb2Tp35dZ-I. [Accessed: 14-Jul-2019].
 “How do I apply multiple filter criteria to a pandas DataFrame?”, YouTube, 2016. [Online]. Available: https://www.youtube.com/watch?v=YPItfQ87qjM. [Accessed: 14-Jul-2019].
 “Sequencing DNA (or RNA) | Real-time, Ultra Long-Reads, Scalable Technology from Oxford Nanopore”, YouTube, 2017. [Online]. Available: https://www.youtube.com/watch?v=GUb1TZvMWsw. [Accessed: 14-Jul-2019]
 “Introduction to nanopore sequencing”, YouTube, 2019. [Online]. Available:
https://www.youtube.com/watch?v=I9BOF8Hla5s. [Accessed: 14-Jul-2019].
 lambda data accessed from Weissman’s lab