The NASA Task Book

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2025
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 08/31/2025
Task Last Updated: 01/10/2025

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Whitmire, Alexandra Center Contact: alexandra.m.whitmire@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 0 No. of PhD Candidates: 2 No. of Master's Candidates: 1 No. of Bachelor's Candidates: 1	No. of PhD Degrees: 1 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 0

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Flight Assignment/Project Notes:	NOTE: End date changed to 08/31/2025 per A. Beitman/JSC (Ed., 9/19/24) NOTE: End date changed to 02/28/2025 per A. Beitman/JSC (Ed., 1/25/23)

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crewmembers in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2025

Task Progress:

In Year 6 of this project, we have worked primarily on Specific Aim 3, supporting the conduct of the HERA C7 campaign. The goal of Campaign 7 was to test a more advanced version of the assistant with more complex anomaly scenarios. In addition, we have worked on publications regarding Specific Aims 1 and 2. Finally, we started working on some guidelines for the future development of similar VAs.

Specific Aim 3: HERA C7 campaign Concerning Specific Aim 3, we prepared for and supported the HERA C7 campaign, which was completed in December 2024. Data analysis is in progress as of January 10, 2025. In preparation for the campaign, we conducted two major tasks:

We added several new “complex” anomaly scenarios, consisting of two and in some cases 3 anomalies. These anomalies were cascading in most cases, with the second one being phased in a few seconds or minutes after the first one. In a few cases, the individual anomalies were simultaneous. These scenarios were meant to simulate both independent and common-cause failures, as well as cases where the first failure causes the second one. The new scenarios were designed by HERA personnel with input from us and were largely combinations of existing (C6) scenarios. The design of these anomaly scenarios was subject to operational constraints on duration, number of operators, etc. Developing these scenarios necessitated incorporating them into Daphne’s knowledge graph and making changes to the Daphne diagnosis skill to allow the calculation of likelihood scores for these complex scenarios.

We also used the “enhanced” version of Daphne in C7. Compared to the “baseline” version used in C6, the enhanced version had the following added functionalities: 1) provide explanations for the likelihood scores of each anomaly; 2) a more advanced question-answering system leveraging large language models (GPT4).

With data from the first 3 missions (N=12), the major findings from the HERA C7 campaign are relatively similar to those of the C6 campaign:

–No significant differences in any of the main measures –Time to diagnosis (min): 9.29±1.48 (without VA) vs 8.51±1.73 (with VA) –Situational Awareness Rating Technique (SART) 25.33±2.10 (without VA) vs 24.69±1.65 (with VA) –NASA Task Load Index (TLX) 30.13±3.21 (without VA) vs 29.02±2.99 (with VA)

Insights from the exit interviews are largely consistent with those of C6. By and large, subjects thought the scenarios became easy and repetitive towards the end of the mission, due to a lack of variety. Indeed, many scenarios designed by HERA were different combinations of the same few C6 scenarios (CDRA failure, cabin fan failure, MOXIE failure).

All subjects established trust in Daphne very quickly and cited the tool’s accuracy as the primary factor for this. Some subjects also identified Daphne’s ability to provide explanations for its diagnoses as a positive factor that contributed to the establishment of trust.

Almost none of them found the question-answering capabilities essential because the task was clear, they were trying to finish the task as quickly as possible, and they didn’t feel like they needed to ask any questions. Some subjects did mention that question-answering was more useful in cases where Daphne recommended more than one diagnosis with the same confidence level, and thought it would be even more useful if the anomaly was completely new and unknown.

Overall, crewmembers felt that in more complex scenarios, a more iterative approach to diagnosis with Daphne asking questions to the crewmembers to try to gather more evidence and refine its diagnosis would be useful.

Note that detailed data regarding the interactions between the crewmembers and Daphne has not been analyzed yet.

Guidelines We have developed a list of 20+ potential guidelines. Each guideline consists of a short title, a recommendation, and some justification/evidence. An example guideline is provided below.

Short title: On-demand adaptive explanations Recommendation: AI agents should give succinct or on-demand explanations for users who already know and trust the tool and longer explanations for more novice users who are expected to not trust the tool yet.

Justification: More explanations for novices can accelerate training and develop trust, but long explanations can be frustrating and lead to a higher workload for advanced users (Bansal et al., 2021). In C7 exit interviews, several crewmembers mentioned that they don’t need a symptom comparison table because they already trust the model and they want to go as fast as possible.

The complete list of current guidelines is provided below. This list is still in progress and the guidelines are being more fully developed.

–Ranked anomaly scenarios: Provide a list of plausible anomaly causes ranked by likelihood –Symptom comparison table: Show a table comparing the signature of current vs potential anomalies –Diagnosis confidence: Agent should convey the degree of uncertainty in its diagnosis –Anomaly signature changes: Alert the user when the signature of the anomaly has changed –Automatic TM display: Automatically pull up time series for relevant parameters when an anomaly is detected –Potential misdiagnosis: Alert the user when asking to diagnose symptoms that were likely created by user actions –Agent information space: AI agent should have access to 1) the telemetry feed; 2) a database of the most recent N days of telemetry; 3) a knowledge base of subsystems, anomalies, procedures; 4) a historical anomaly database; 5) information about its own internal workings –Voice interface: Agent should have a voice interface to help the users, particularly with following procedures –Sufficient training: For proper agent trust, users should receive both hands-on training and theoretical training on 1) the tool and how it works internally; 2) the habitat systems –Latency of the QA system: The QA system shall be fast enough to provide immediate answers –Flexibility of the QA system: Use an LLM to increase flexibility and scalability. However, it should be a local LLM that can be downloaded and fine-tuned on a relevant dataset –Top-down Workflow: Users find a top-down workflow (detection, diagnosis, and resolution) intuitive, with the chatbot on the right –Evidence from other agents: Agent should incorporate additional evidence provided by the user and provide an updated diagnosis –Bayesian diagnosis: Agent should be able to update initial beliefs based on all evidence provided –Propose actions to reduce diagnostic uncertainty: Agent should recommend actions to gather additional evidence to refine the diagnosis in case of uncertainty –Agent learning: Agent should be able to learn from its own experience –Explanation types: The agent should provide explanations based on both expert knowledge and historical anomaly occurrence –Outlier detection: The agent should flag outlier signatures even if no thresholds are exceeded –Multiple anomalies: The agent should be able to deal with multiple simultaneous anomalies –Anomaly priority: In case of multiple anomalies, the agent should help the user prioritize the most important ones to attend to first –Full-wave diagnosis: The agent should diagnose anomalies based on the current and past values of the parameters in the telemetry feed (as opposed to the thresholds exceeded)

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography


Abstracts for Journals and Proceedings	Selva D, Dutta P, Josan PK, Dunbar BJ, Wong RKW, Diaz-Artiles A. "Virtual assistant for anomaly resolution in long duration exploration missions: Final results and lessons learned." 2025 NASA Human Research Program Investigators' Workshop, Galveston, Texas, January 28-31, 2025. Abstracts. 2025 NASA Human Research Program Investigators' Workshop, Galveston, Texas, January 28-31, 2025. , Jan-2025

Articles in Other Journals or Periodicals	Dutta P, Josan PK, Wong RKW, Dunbar BJ, Diaz-Artiles A, Selva D. "Effects of explanations and accuracy on human performance and trust in AI-assisted anomaly diagnosis tasks." Journal of Cognitive Engineering and Decision Making. Conditionally accepted November 14, 2024. , Nov-2024

Articles in Peer-reviewed Journals	Josan PK, Dutta P, Abbott R, Martin AV, Dunbar BJ, Wong RKW, Selva D, Diaz-Artiles A. "Virtual assistant for spacecraft anomaly resolution: Effects on human performance metrics." J Aerosp Inform Syst. 2025 Jan 3. Online ahead of print. https://doi.org/10.2514/1.I011449 , Jan-2025

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2024
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 08/31/2025
Task Last Updated: 01/07/2024

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Whitmire, Alexandra Center Contact: alexandra.m.whitmire@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 0 No. of PhD Candidates: 2 No. of Master's Candidates: 1 No. of Bachelor's Candidates: 1	No. of PhD Degrees: 0 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 0

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Flight Assignment/Project Notes:	NOTE: End date changed to 08/31/2025 per A. Beitman/JSC (Ed., 9/19/24) NOTE: End date changed to 02/28/2025 per A. Beitman/JSC (Ed., 1/25/23)

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crewmembers in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2024

Task Progress:

In Year 5 of this project, we have worked primarily on Specific Aims 2 (Enhanced VA with self-explaining abilities) and 3 (validation in analog), although we have also worked on revisions for the journal paper corresponding to Specific Aim 1 (validation of the baseline agent in a lab environment).

Concerning Specific Aim 2, we concluded Lab Experiment 2 on the effect of explanations and have submitted the corresponding two journal papers. This experiment addresses the following research question: How do explanations affect human performance, trust, cognitive workload (CW), situational awareness (SA), user satisfaction, and self-confidence for different levels of agent accuracy and uncertainty in human-AI collaborative anomaly diagnosis? The first paper focuses on the effects of explanations for different levels of agent accuracy, while the second paper focuses on the effects of uncertainty.

The protocol for this experiment was as follows. Subjects start with a background survey and test and then they proceed to do two sessions, one in each explanation condition (order counterbalanced). In each session they work on 8 anomaly scenarios, 4 with high uncertainty and 4 with low uncertainty, in random order. After each anomaly, subjects fill out a confidence survey, a Jian survey, and a satisfaction survey. After each session, they also fill out a NASA Task Load Index (TLX), a Situational Awareness Rating Technique (SART) questionnaire, another Jian survey, and another satisfaction survey.

30 subjects were recruited and performed the experiment. The major findings from the experiment are as follows:

Effect of explanations (within-subjects comparison using surveys done after each condition) 1. Explanations improve trust (p<0.001) 2. Explanations improve #anomalies correctly diagnosed (p=0.0039), but do not significantly change the time to diagnosis (p=0.37) 3. Explanations improve SA (p=0.009) 4. Explanations slightly increase CW, but effect is not significant (p=0.108) 5. Explanations improve user satisfaction (p<0.001) 6. Explanations improve user confidence (p<0.001)

Effect of uncertainty (within-subjects comparison using surveys done after each anomaly) 1. (High) Uncertainty decreases trust (p<0.001) 2. (High) Uncertainty decreases #anomalies correctly diagnosed (p<0.001) and time to diagnosis (p<0.001) 3. (High) Uncertainty decreases user satisfaction (p<0.001) 4. (High) Uncertainty decreases user confidence (p<0.001)

The effect of uncertainty on CW and SA could not be measured due to the experimental design – they were only measured once with explanations and once without, averaging over the effects of uncertainty in both cases.

The effects of accuracy were not significant, presumably due to individual differences leading to large variability between subjects. The only exception is that accuracy significantly improved the number of anomalies correctly diagnosed (p<0.001). The interaction effect between explanations and accuracy was also not significant.

Concerning Specific Aim 3, we completed the C6 campaign and prepared for the C7 campaign.

Results from the HERA C6 campaign: The major findings from the HERA C6 campaign (N=16) are as follows: • All subjects correctly resolved all scenarios. • Primary results did now show significant effects of the VA on any metrics. • The only exception is that attentional demand (a component of SA) was significantly higher with VA than without (n=16, V=21, p=0.03). • For group scenarios only, CW was higher with Daphne (t (15) = -6.0207, p <0.001).

Some of the insights we got from the exit interviews are as follows. The crew generally exhibited strong interest in using VAs for anomaly resolution, and enjoyed using Daphne. Many subjects showed an interest in the social aspects of VA and mentioned that they “attributed a personality to Daphne” and “talked about her as if she were another crewmember”. They all mentioned establishing trust very quickly once and for all thanks to Daphne “getting it right the first 3 times or so”. Almost none of them found the question answering capabilities essential because they were “going for speed” and didn’t feel like they needed to ask any questions. However, many subjects mentioned that question answering would be very useful in cases where Daphne recommended more than one diagnosis with the same confidence level. Moreover, crewmembers expressed an interest in the more interactive diagnosis and advanced explanations capabilities we are currently developing as something that would “significantly increase the usefulness” of the tool. Finally, they all confirmed that the scenarios generally felt very easy to diagnose and adding some more complexity would make it more interesting and fun. Note that detailed data regarding the interactions between the crewmembers and Daphne has not been analyzed yet.

Preparation of HERA C7: In C7, we will test a more advanced version of the Daphne agent with improved self-explaining abilities while also testing more complex anomaly scenarios in which there are cascading and simultaneous failures. The timeline for the campaign is shown below. • Phase 1 – Group sessions to study the effect of time delay (crew allowed to talk to MCC) – MD1-4: Group study with and without VA at 0s time delay – 2x1h – MD5-10: Group study with and without VA at 5s time delay – 2x1h – MD11-12: Group study with and without VA at 1min time delay – 2x1h • Phase 2 – Individual sessions for main study (not allowed to talk to MCC) – Week 3: Individual training, 1h per crewmember, 2 simple scenarios, 1 with VA and 1 without – Weeks 4-6: 2 sessions per crewmember per week, 1 with VA and 1 without, 3x4x2x1h=24h

As shown above, the group scenarios from C6 have been replaced with a shorter study on the effect of cislunar time delays (5sec) compared to no delays or longer delays (1 min). For these scenarios, the crew will also be allowed to communicate with MCC, to gain insight into the roles a VA can play for cislunar operators given that MCC is available albeit with a non-negligible time delay of a few seconds.

The second phase is the main phase of individual scenarios and follows the same design as in C6, with the caveat that a (re)-training session is done for each crewmember in week 3 before they start their individual tests.

Finally, in the next year, we also plan on finalizing the 3 publications under review (which should finalize work related to Specific Aims 1 and 2) and submitting a fourth one on the results of C6.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography


Abstracts for Journals and Proceedings	Selva D, Dutta P, Josan PK, Dunbar BJ, Wong KRW, Diaz-Artiles A. "Virtual assistant for anomaly resolution in long duration exploration missions: Overview of results so far and next steps." 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. Abstracts. 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. , Feb-2024

Abstracts for Journals and Proceedings	Dutta P, Josan PK, Dunbar BJ, Wong KRW, Diaz-Artiles A, Selva D. "Effect of agent accuracy, confidence, and transparency on the evolution of trust in AI-assisted anomaly diagnosis." 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. Abstracts. 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. , Feb-2024

Abstracts for Journals and Proceedings	Josan PK, Dutta P, Dunbar BJ, Wong KRW, Selva D, Diaz-Artiles A. "Results from Hera analog testing of a virtual assistant for anomaly resolution." 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. Abstracts. 2024 Human Research Program Investigators’ Workshop, Galveston, Texas, February 13-16, 2024. , Feb-2024

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2023
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 02/28/2025
Task Last Updated: 01/10/2023

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Whitmire, Alexandra Center Contact: alexandra.m.whitmire@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 1 No. of PhD Candidates: 2 No. of Master's Candidates: 0 No. of Bachelor's Candidates: 2	No. of PhD Degrees: 0 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 2

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Flight Assignment/Project Notes:	NOTE: End date changed to 02/28/2025 per A. Beitman/JSC (Ed., 1/25/23)

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crewmembers in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2023

Task Progress:

In Year 4 of this project, we worked primarily on specific Aims 2 and 3, although we also submitted the journal paper corresponding to specific Aim 1 (validation of the baseline agent in a lab environment).

Concerning Specific Aim 2, we added the explanation capabilities to the VA and started data collection and analysis for Lab Experiment 2. This experiment addresses the following research question: How do explanations affect human performance, trust, cognitive workload (CW), situational awareness (SA), user satisfaction, and self-confidence for different levels of accuracy and uncertainty in human-AI collaborative anomaly diagnosis?

The protocol is as follows. Subjects start with a background survey and test, and then they proceed to do two sessions, one in each explanation condition (order counterbalanced). In each session, they work on 8 anomaly scenarios, 4 with high uncertainty and 4 with low uncertainty, in random order. After each anomaly, subjects fill out a confidence survey, a Jian survey, and a satisfaction survey. After each session, they also fill out a NASA Task Load Index (TLX), a Situational Awareness Rating Technique (SART) questionnaire, another Jian survey, and another satisfaction survey. A Situation Awareness Global Assessment Technique (SAGAT) test is done twice per session (once per uncertainty level).

So far, 18 subjects have been recruited and performed the experiment, but data for 3 subjects had to be deleted because of a procedural mistake during those sessions. Currently, data collection is halted while we wait for approval of an IRB modification regarding the details of this experiment’s protocol and compensation. The major findings from the preliminary results (N=15) are as follows: 1. Explanations improve trust (p=0.01) 2. Explanations do not significantly improve performance; #anomalies correctly diagnosed increases (p=0.14), but time to diagnosis also increases (p=0.30) 3. Explanations improve SA (p=0.018), primarily by improving understanding (P<0.01) 4. Explanations increase CW but effect is not significant (p=0.14) 5. Explanations improve user satisfaction (p<0.01) 6. Explanations improve user confidence (p<0.01) 7. Performance is higher for low uncertainty (p<0.01) 8. Trust is higher for low uncertainty (p<0.01)

A more detailed analysis, including the accuracy levels, will be done with the full dataset (N=36).

Concerning Specific Aim 3, we completed the C6M3 mission and will complete the last mission by the end of this year’s period of performance, which should get us close to completing the nominal scope of Specific Aim 3, pending publication of the results. The major findings from the HERA C6M1-M3 missions (N=12) are as follows: 1. All subjects correctly resolved all scenarios. 2. Time to resolution is lower without Daphne but not significant (p=0.2). We attribute this to some logon issues we encountered, especially in mission 1. Our perception is that the diagnosis was very fast, both with and without Daphne, suggesting that the anomaly scenarios were “too easy.” The subjects did spend considerable time conducting the procedures to resolve the anomalies, but that was not the focus of our study. 3. CW is higher with Daphne, but this difference was not significant (p=0.3). 4. SA is lower with Daphne, but this difference was not significant (p=0.19).

Some of the insights we got from the exit interviews are as follows: The crew generally exhibited a strong interest in using VAs for anomaly resolution and enjoyed using Daphne. Most of them showed an interest in the social aspects of VA and mentioned that they “attributed a personality to Daphne” and “talked about her as if she were another crewmember.” They all mentioned establishing trust very quickly once and for all thanks to Daphne “getting it right the first 3 times or so.” Almost none of them found the question-answering capabilities essential because they were “going for speed” and didn’t feel like they needed to ask any questions. However, many subjects mentioned that question-answering would be very useful in cases where Daphne recommended more than one diagnosis with the same confidence level. Moreover, crewmembers expressed an interest in the more interactive diagnosis and advanced explanations capabilities we are currently developing as something that would “significantly increase the usefulness” of the tool. Finally, they all confirmed that the scenarios generally felt very easy to diagnose and adding some more complexity would make it more interesting and fun. Note that detailed data regarding the interactions between the crewmembers and Daphne has not been analyzed yet.

The final data analysis will be done once the C6M4 mission is complete. Of note, we have extended the scope of specific Aim 3 through an extension of the grant that will allow us to participate in the HERA C7 campaign, which we have started to prepare. In C7, we will be able to test a more advanced version of the Daphne agent while also testing more complex anomaly scenarios.

In terms of publications, we submitted a journal paper on Specific Aim 1 to the Human Factors journal, which is currently under a second round of review and presented a paper on Specific Aim 2 at the 2022 Applied Human Factors and Ergonomics (AHFE) conference [15]. We will also have one talk and one poster presentation at the 2023 Human Research Project Investigator's Workshop (HRP IWS) in February.

As for the next steps, in Year 4, we will finalize the C6 campaign and start C7 (Specific Aim 3). We plan on publishing a journal paper with the results of C6. Otherwise, most of the activity will focus on finalizing lab experiments 2 and 3. We will work on additional journal papers on lab experiment, lab experiment 3, and the C7 campaign.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography


Abstracts for Journals and Proceedings	Selva D, Dutta P, Josan PK, Dunbar, BJ, Wong RKW, Diaz-Artiles A. "Virtual Assistant for anomaly resolution in long duration exploration missions: Preliminary results and next steps." 2023 NASA Human Research Program Investigators' Workshop, Galveston, Texas, February 7-9, 2023. Abstracts, 2023 NASA Human Research Program Investigators' Workshop, Galveston, Texas, February 7-9, 2023. , Feb-2023

Abstracts for Journals and Proceedings	Dutta P, Josan PK, Dunbar, BJ, Wong RKW, Diaz-Artiles A, Selva D. "Effects of explanations by virtual assistant for anomaly resolution tasks in long duration exploration missions." 2023 NASA Human Research Program Investigators' Workshop, Galveston, Texas, February 7-9, 2023. Abstracts, 2023 NASA Human Research Program Investigators' Workshop, Galveston, Texas, February 7-9, 2023. , Feb-2023

Papers from Meeting Proceedings	Dutta P, Josan PL, Wong RKK, Dunbar BJ, Diaz Artiles A, Selva D. "Effect of Explanations in AI-Assisted Anomaly Treatment for Human Spaceflight Missions." 2022 Human Factors and Ergonomics Society Annual Meeting, Atlanta, GA, Oct 10 - 14, 2022. 2022 Human Factors and Ergonomics Society Annual Meeting, Atlanta, GA, Oct 10 - 14, 2022.. Vol 66(1), pp. 697-701. , Oct-2022

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2022
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 03/05/2023
Task Last Updated: 01/05/2022

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Whitmire, Alexandra Center Contact: alexandra.m.whitmire@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 1 No. of PhD Candidates: 2 No. of Master's Candidates: 0 No. of Bachelor's Candidates: 2	No. of PhD Degrees: 0 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 0

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crewmembers in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2022

Task Progress:

In Year 3 of this project, we have completed the first lab experiment, thus achieving Specific Aim 1, pending publication of the journal paper with the result. A total of 12 participants (mean age ± SD = 24.8 ± 1.9 years) were selected from an astronaut-like population. The participants were largely comprised of students at Texas A&M University, who had either already completed or were currently pursuing a graduate degree in a science, technology, engineering, mathematics (STEM) field. In addition, most participants had previous experience working with technical procedures. Each participant completed two experiment sessions: a control session without Daphne and a treatment session with Daphne. During both conditions, subjects were tasked with detecting, diagnosing, and resolving Environmental Control and Life Support Subsystem (ECLSS) anomaly scenarios. We selected two different anomaly scenario groups (5 anomaly scenarios per group) based on the perceived difficulty of resolving that particular anomaly and completing its corresponding resolution procedure. We counterbalanced participants with respect to which anomaly group they received first and in which experiment session they could use Daphne.

We measured performance metrics by evaluating the number of anomalies that participants could successfully resolve with and without the use of Daphne-AT. Additional performance metrics included the time it took for each subject to solve each anomaly, whether or not the subject detected the anomaly, diagnosed the root cause, and selected the correct anomaly resolution procedure. We also recorded the number of attempts for the latter three metrics. In addition, we measured cognitive workload by employing the NASA Task Load Index (TLX) survey, and situational awareness using the Situational Awareness Rating Technique (SART). Finally, the participants’ trust in autonomous systems was measured using Jian's trust scale.

The summary of the results is as follows. For performance, a significant difference (p=0.002) was found in the number of anomalies correctly resolved with Daphne (4.1±0.3) vs. without Daphne (2.7±0.4). This supports our hypothesis that a VA can improve performance in anomaly resolution. For cognitive workload, a significant difference (p=0.002) was found in the TLX score with Daphne (45.53±6.67) vs. without Daphne (65.97±4.81). This supports our hypothesis that a VA can reduce cognitive workload in anomaly resolution. For situational awareness, a marginally significant difference (p=0.052) was found in the SART score with Daphne (5.19±0.56) vs. without Daphne (4.41±0.71). This does not fully support our hypothesis that a VA can improve situational awareness in anomaly resolution. Finally, the responses to the Jian scale items showed generally high trust in automation. A much more detailed description of the experiment and the results is in the paper about to be submitted to the Human Factors and Ergonomics Society (HFES) journal. The draft paper will be provided to the NASA Human Research Program (HRP) as soon as it is submitted.

This year we also completed the first HERA mission Campaign 6, Mission XXII (C6M1). Campaign 6 / C6M2 will also be complete by the time the period of performance of this report is exhausted, which gets us about halfway in meeting Specific Aim 3. Our HERA study was divided into two phases. In Phase 1, each crewmember performed two experiment sessions per week. In each session, the crewmember worked on a single anomaly scenario, either with Daphne or without. Thus, at the end of Phase 1, each subject had worked individually on 3 anomalies with Daphne and 3 without. Because of a technical issue with Daphne, one of the crewmembers ended up with only 2 anomalies with Daphne and 4 without Daphne. In Phase 2, crew members worked together as a team. They conducted two sessions per week, and worked on one anomaly scenario per session. This resulted in a total of 3 anomalies with Daphne and 3 without. Preliminary results are as follows.

Concerning performance, all scenarios were successfully resolved and therefore this was not an interesting metric. We are currently looking at the time to diagnosis of the anomaly to see if there are any interesting trends there. Trends appear to indicate that in some cases the crew took more time to solve scenarios with Daphne than without, especially in Phase 2, but we attribute this to some logon issues we encountered towards the end of the mission. Our perception is that the diagnosis was very fast both with and without Daphne, suggesting that the anomaly scenarios were “too easy”. The subjects did spend considerable time conducting the procedures to resolve the anomalies, but that was not the focus of our study.

Concerning cognitive workload, we observed a small difference between the TLX scores with Daphne (23.42±5.17) and without Daphne (25.20±3.54) but this difference was not significant with only 4 subjects (p=0.42). It remains to be seen if the effect becomes significant with the full 16 subjects. It is also interesting to note that these scores are very low compared to the scores obtained in the laboratory experiments. This could be due to the shorter training and less experience of the participants of the lab experiments compared to the HERA crew. This is also consistent with the explanation that the scenarios were too “easy” for the HERA crew.

Concerning situational awareness, we observed the opposite trend compared to the lab experiments, although in both cases the effects are not significant. In this case, SART scores were slightly lower with Daphne (20.46±3.35) than without Daphne (21.21±3.50), for a p-value of 0.41. We note that the effects were not consistent among crewmembers, and the overall result is driven by one of the 4 crewmembers, who reported a significantly higher level of situational awareness without Daphne vs. with Daphne. We are looking in more detail into the components of the SART score (understanding, demand, supply) because the trends appear to be conflicting there too. While the literature has reported cases where using some kind of assistance results in a decrease of situational awareness (e.g., autopilots in commercial aircraft), this may be due to other factors (e.g., minor technical issues with the VA) as well. More research is needed to understand this – we hope that missions 2,3,4 will help us elucidate this issue. We also note that crewmembers mentioned in the exit debrief interviews that adding explanations would likely significantly increase situational awareness with Daphne, and this is something that we plan on testing in our lab experiments 2 and 3.

Concerning trust, the results were similar to the ones from the lab experiments, showing high trust in the VA across the board. In the exit interviews, crewmembers indicated that they trusted Daphne “right away”, after the first few of her recommendations were proven to be correct.

Some of the insights we got from the exit interviews are as follows. The crew generally exhibited strong interest in using VAs for anomaly resolution, and enjoyed using Daphne. Most of them showed an interest in the social aspects of VA and mentioned that they “attributed a personality to Daphne” and “talked about her as if she were another crewmember”. They all mentioned establishing trust very quickly, once and for all, thanks to Daphne “getting it right the first 3 times or so”. Almost none of them found the question answering capabilities essential because they were “going for speed” and didn’t feel like they needed to ask any questions. However, all of them mentioned that question answering would be very useful in cases where Daphne recommended more than one diagnosis with the same confidence level. Moreover, crewmembers expressed an interest in the more interactive diagnosis and advanced explanations capabilities we are currently developing as something that would “significantly increase the usefulness” of the tool. Finally, they all confirmed that the scenarios generally felt very easy to diagnose and adding some more complexity would make it more interesting and fun.

As for next steps, in Year 4 we will support the remaining missions of C6, thus finalizing Specific Aim 3. We plan on publishing a journal paper with the results. Otherwise, most of the activity will focus on achieving Specific Aim 2: we will finalize the development of the enhanced VA capabilities (explanations and mixed initiative) and conduct lab experiments 2 and 3.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography


Abstracts for Journals and Proceedings	Selva D, Josan P, Dutta P, Abbott R, Viros i Martin A, York K, Dunbar B, Wong RKW, Diaz Artiles A. "Virtual assistant for anomaly resolution In long duration exploration missions: Baseline effects on performance, cognitive workload, and situational awareness." 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022. Abstracts. 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022 (Abstract #1133-000353). , Feb-2022

Abstracts for Journals and Proceedings	Josan P, Dutta P, Abbott R, Viros A, York K, Dunbar B, Wong R, Selva D, Diaz-Artiles A. "Results from first laboratory testing of virtual assistant Daphne-AT. " 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022. Abstracts. 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022 (Abstract #1133-000351). , Feb-2022

Abstracts for Journals and Proceedings	Dutta P, Kaur PK, Viros A, York K, Abbott R, Dunbar B, Wong R, Diaz-Artiles A, Selva D. "Virtual assistants for anomaly treatment – lessons learned and path forward." 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022. Abstracts. 2022 NASA Human Research Program Investigators' Workshop, Virtual. February 7-10, 2022 (Abstract #1133-000410). , Feb-2022

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2021
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 03/05/2023
Task Last Updated: 01/06/2021

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Whitmire, Alexandra Center Contact: alexandra.m.whitmire@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 0 No. of PhD Candidates: 2 No. of Master's Candidates: 1 No. of Bachelor's Candidates: 2	No. of PhD Degrees: 0 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 0

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crew members in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2021

Task Progress:

In Year 2 of this project, we have completed the first round of testing of the Daphne-AT prototype and conducted a pilot study. We worked closely with HERA throughout the integration and testing of our study as part of the C6 campaign. We are now getting ready for a final training documentation delivery to NASA this month, and a first system-level test of our study with the entire team at Johnson Space Center (JSC), assuming the COVID situation permits it.

To be able to do experiments at Texas A&M, we developed an immersive 3d user interface to simulate the Human Exploration Research Analog (HERA) habitat during our experiments at TAMU. The goal was for the simulator to have reasonable fidelity in terms of the general dimensions and shape as well as the main components present in the habitat related to the ECLSS system. The simulator is basically a first-person game developed in Unreal Engine 4 (UE4). All of the major components of the HERA Environmental Control and Life Support Subsystem (ECLSS) system are present and the user can interact with them (e.g., press buttons, open/close panels) as required by the anomaly resolution procedures. More details about this simulator are provided in

Woodruff R, Beebe N, Josan PK, Wong RKW, Dunbar BJ, Selva D, Diaz-Artiles A. (2021). 3D Interactive Model of HERA to support ECLSS anomaly resolution using a Virtual Assistant. 2021 IEEE Aerospace Conference Proceedings.

After validation of the UE4 simulator, we designed and conducted a pilot study of the first “Baseline” prototype of the assistant. It included five male subjects within the age range of 22-40 years old, and with a Standard Deviation (SD) of ±7.2. Prior to the experimental sessions, subjects were provided a 2-hour virtual training session about all the experiment elements (Daphne-AT, ECLSS subsystems, 3D virtual HERA environment, and experimental protocols) and they were guided through an example anomaly scenario. After the training, subjects participated in two in-person experiment sessions: one with Daphne-AT and the other one without Daphne-AT (counterbalanced among subjects). These sessions were conducted in two consecutive days at approximately the same time of the day. In each one of the sessions, the subjects solved four anomalies. Once the experiment sessions were over, the subjects completed a set of surveys.

Dependent variables included performance, cognitive workload, situational awareness, and trust in the context of ECLSS anomaly treatment, with and without the use of a VA. Performance is described as an individual’s ability to successfully treat the anomaly. On average, subjects solved 3.2 ± 0.37 anomalies using Daphne-AT and 2.0 ± 0.without Daphne-AT. Based on these limited data, our preliminary results support the hypothesis that the use of Daphne-AT will increase performance in the context of ECLSS anomaly resolution operations.

The NASA Task Load Index (TLX) survey was used to measure the effect of using Daphne-AT on cognitive workload. NASA TLX is an evaluation technique developed by NASA to assess relative importance of six pre-determined factors in determining how much workload a subject experience during a particular task, in this case, the task of anomaly treatment. These six factors include mental demand, physical demand, temporal demand, perceived performance, perceived effort, and frustration level. On average, subjects indicated higher cognitive workload during the sessions without Daphne-AT (57.2 ± 5.78 with Daphne-AT and 66 ± 7.31 without Daphne). These preliminary results support the hypothesis that the use of Daphne-AT will decrease cognitive workload. More details about this pilot study can be found in:

Josan PK, Dutta P, Woodruff R, Beebe N, York K, Balcells-Quintana O, Kluis L, Viros-i-Martin A, Dunbar BJ, Wong RKW, Selva D, Diaz-Artiles A. (2021). Experimental Design & Pilot Testing for ECLSS Anomaly Resolution using Daphne-AT Virtual Assistant. 2021 IEEE Aerospace Conference Proceedings.

As a result of the pilot testing and HERA integration activities, we have substantially refined the VA interface and added functionality to the back end. Specifically: 1) We adapted the telemetry feed to the new version of the HERA Environmental Control and Life Support System (ECLSS) simulator (now called Habitat System Simulator or HSS). 2) We updated the anomaly database to include more information for each anomaly (e.g., a time signature). 3) We expanded the knowledge base with more information about ECLSS anomalies, their associated risks, and how to resolve them. 4) We expanded the anomaly detection functions beyond basic thresholding to be able to detect outliers in the telemetry feed. 5) We made slight changes to the knowledge graph and procedures to adapt to HERA C6 schedule constraints.

In terms of publications, we had two papers accepted at the 2021 IEEE Aerospace Conference which will be presented in March. We also gave a presentation at the 2020 American Institute of Aeronautics and Astronautics (AIAA) ASCEND conference (Ed. note 1/7/2021: see FY2020 Bibliography in Task Book report) and will have one talk presentation and one poster presentation at the 2021 Human Research Program workshop in February.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography


Papers from Meeting Proceedings	Woodruff R, Beebe N, Josan PK, Wong RKW, Dunbar BJ, Selva D, Diaz-Artiles A. "A 3D Interactive Model of HERA to Support ECLSS Anomaly Resolution Using a Virtual Assistant." 2021 IEEE Aerospace Conference, Big Sky, MT, Virtual, March 6-13, 2021. 2021 IEEE Aerospace Conference, Big Sky, MT, Virtual, March 6-13, 2021. Meeting paper. https://doi.org/10.1109/AERO50100.2021.9438341 , Mar-2021

Papers from Meeting Proceedings	Josan PK, Dutta P, Woodruff R, Beebe N, York K, Balcells-Quintana O, Kluis L, Viros-i-Martin A, Dunbar BJ, Wong RKW, Selva D, Diaz-Artiles A. "Experimental Design & Pilot Testing for ECLSS Anomaly Resolution Using Daphne-AT Virtual Assistant." 2021 IEEE Aerospace Conference, Big Sky, MT, Virtual, March 6-13, 2021. 2021 IEEE Aerospace Conference, Big Sky, MT, Virtual, March 6-13, 2021. Meeting paper. https://doi.org/10.1109/AERO50100.2021.9438497 , Mar-2021

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2020
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 03/05/2023
Task Last Updated: 01/06/2020

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Williams, Thomas Center Contact: 281-483-8773 thomas.j.will1@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: 0 No. of PhD Candidates: 2 No. of Master's Candidates: 0 No. of Bachelor's Candidates: 2	No. of PhD Degrees: 0 No. of Master's Degrees: 0 No. of Bachelor's Degrees: 0

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crew members in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from existing VAs developed by the Principal Investigator (PI) for similar purposes. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload, situational awareness, and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. The system will also be deployed and tested in the Human Exploration Research Analog (HERA) analog environment.

Research Impact/Earth Benefits:	This project will provide standards and guidelines that will help NASA design similar virtual assistants to support astronauts during future long duration exploration missions. Such standards and guidelines will concern both the functionality and the user interface of the virtual assistant.

Task Progress & Bibliography Information FY2020

Task Progress:

This project is one of seven projects in the Virtual NASA Specialized Center of Research (VNSCOR) for Human Capabilities Assessments for Autonomous Missions (HCAAM). The goal of our project is to develop a virtual assistant to help astronauts with anomaly resolution in long duration exploration missions.

In the first year of this project, we have completed a project definition phase in which we have more crisply defined the scientific objectives and the technical approach to design the virtual assistant. We started by selecting the Environmental Control and Life Support Subsystem (ECLSS) as the technical scope for the virtual assistant. We then proceeded to elicit high level requirements and use cases and perform the early design of the virtual assistant, which we are getting ready to test in our laboratory at Texas A&M University in the next month.

Specifically, in this first year, we have accomplished the following: 1) We have defined the high-level requirements and overall software architecture of the virtual assistant. 2) We have defined how the virtual assistant will interface with the Human Exploration Research Analog (HERA) infrastructure. 3) We have developed the first very basic prototype of the virtual assistant which includes: a) a simple anomaly database; b) a simple database containing knowledge about how the ECLSS system works in HERA and how to perform some maintenance and repair operations; c) an interface with the HERA simulated telemetry feed, which provides a data stream with all the measurements of the ECLSS subsystem; d) a web-based graphical user interface containing a main plot window displaying the telemetry feed and a chat box to chat with the virtual assistant; e) the natural language processing layer that the assistant uses to process the natural language requests; f) the machine learning layer that is used to understand the intent of the question and send it to the relevant part of the software, i.e., the one that knows how to answer it; f) the first version of the back end of the software, which is precisely the part of the software that performs the calculations needed to answer the questions and requests from the user. While this first prototype is not sophisticated in terms of the breadth and depth of questions it can answer, it still has most of the main functionality expected of the assistant. Specifically: 1) it warns the user when a measurement from the telemetry feed exceed some user-defined thresholds; 2) it also performs statistical analysis on the data stream and warns the user if the measurement is considered an outlier, even if it is within the user-defined thresholds; 3) once an anomaly is detected (e.g., increase in CO2 concentration), it provides the users with some possible root causes for it, in order of likelihood (e.g., a malfunctioning filter or a leak); 4) once the anomaly has been diagnosed (faulty filter), it provides the user with relevant procedures to reconfigure the system (how to change the CO2 filter).

In parallel of the software development, we have also started planning and designing the experiments. We are currently getting ready to start a pilot experiment at Texas A&M in order to refine both the software implementation and the experimental design. The results of this pilot experiment will help us get ready for the HERA campaign, which is currently scheduled for August 2020.

Since the beginning of the project, we have been working with the HERA team to prepare for the experiment. We obtained approval from the Institutional Review Boards of both NASA and Texas A&M. We provided our input to the experiment support specialist to create our science requirements document and are currently working on the investigator working group, getting ready for a software delivery to NASA by March.

We have also been in constant communication with the rest of the VNSCOR to ensure compatibility of our experiments and try to make the most of any potential synergies.

Finally, in terms of publications, we are about to present a peer-reviewed full paper at the 2020 American Institute of Aeronautics and Astronautics (AIAA) Scitech conference. We also have two poster presentations at the 2020 Human Research Program workshop later this month.

In the next few months, we will keep adding functionality to the assistant so that it can answer more questions better and provide more useful information to the user. We anticipate that some rework of the telemetry feed interface will be necessary to adapt to some changes on the HERA side. We will also refine the user interface with the results from the pilot. We are getting ready to conduct a preliminary design review in March before delivery to HERA. Between March and August 2020 we will work on integrating our software with HERA. We will conduct a Critical Design Review in June 2020 and an Operational Readiness Review in August 2020, right before ingress. Between August 2020 and September 2021 we will support the 4 missions of the C6 HERA campaign.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography

Papers from Meeting Proceedings

Dutta P, Balcells Quintana O, Viros A, Whittle RS, Poonampreet KJ, Beebe N, Dunbar BJ, Wong RK, Diaz-Artiles A, Diaz-Artiles SD, Ana Selva D. "Virtual Assistant for Anomaly Treatment in Long Duration Exploration Missions." Presented at the 2020 AIAA SciTech Forum, Orlando, FL, January 6-10, 2020.

Paper AIAA 2020-2255. 2020 AIAA SciTech Forum, Orlando, FL, January 6-10, 2020. https://doi.org/10.2514/6.2020-2255 , Jan-2020

Back to Search Results

Project Title: HCAAM VNSCOR: Virtual Assistant for Spacecraft Anomaly Treatment During Long Duration Exploration Missions

Reduce

Images:

Fiscal Year: FY 2019
Division: Human Research
Research Discipline/Element:
HRP HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Start Date: 03/06/2019
End Date: 03/05/2023
Task Last Updated: 04/22/2019

Download Task Book report in PDF pdf

Principal Investigator/Affiliation: Selva, Daniel Ph.D. / Texas A&M University

Address:

Aerospace Engineering Department
701 Ross St 3141 TAMU
College Station , TX 77843-0001

Email: dselva@tamu.edu
Phone: 607-255-6351
Congressional District: 17
Web:

Organization Type: UNIVERSITY
Organization Name: Texas A&M University
Joint Agency:
Comments:

Co-Investigator(s)
Affiliation:

Diaz Artiles, Ana Ph.D.	Texas A&M Engineering Experiment Station
Dunbar, Bonnie Ph.D.	Texas A&M Engineering Experiment Station
Wong, Raymond Ka Wai Ph.D.	Texas A & M, College Station

Project Information: Grant/Contract No. 80NSSC19K0656

Responsible Center: NASA JSC Grant Monitor: Williams, Thomas Center Contact: 281-483-8773 thomas.j.will1@nasa.gov Unique ID: 12299	Solicitation / Funding Source: 2017-2018 HERO 80JSC017N0001-BPBA Topics in Biological, Physiological, and Behavioral Adaptations to Spaceflight. Appendix C Grant/Contract No.: 80NSSC19K0656 Project Type: Ground Flight Program: TechPort: Yes	No. of Post Docs: No. of PhD Candidates: No. of Master's Candidates: No. of Bachelor's Candidates:	No. of PhD Degrees: No. of Master's Degrees: No. of Bachelor's Degrees:

Human Research Program Elements:	(1) HFBP:Human Factors & Behavioral Performance (IRP Rev H)

Human Research Program Risks:	(1) HSIA:Risk of Adverse Outcomes Due to Inadequate Human Systems Integration Architecture

Human Research Program Gaps:	(1) HSIA-101:We need to identify the Human Systems Integration (HSI) – relevant crew health and performance outcomes, measures, and metrics, needed to characterize and mitigate risk, for future exploration missions. (2) HSIA-201:We need to evaluate the demands of future exploration habitat/vehicle systems and mission scenarios (e.g. increased automation, multi-modal communication) on individuals and teams, and determine the risks these demands pose to crew health and performance. (3) HSIA-401:We need to determine how HSI can be applied in the vehicle/habitat and computer interface Design Phase to mitigate potential decrements in operationally-relevant performance (e.g. problem-solving, execution procedures), during increasingly earth-independent, future exploration missions (including in-mission and at landing). (4) HSIA-701:We need to determine how human-automation-robotic systems can be optimized for effective enhancement and monitoring of crew capabilities, health, and performance, during increasingly earth-independent, future exploration missions (including in-mission and at landing).

Task Description:	This task is part of the Human Capabilities Assessments for Autonomous Missions (HCAAM) Virtual NASA Specialized Center of Research (VNSCOR). The research objective of this proposal is to investigate the impact of using Virtual Assistants (VA) to support crew members in the context of anomaly treatment during Long Duration Exploration Missions (LDEM), when ground support will be limited. A VA will be developed building upon the software architecture from an existing VA developed by the Principal Investigator. The VA will provide support for various aspects of anomaly treatment, including detecting and diagnosing the anomaly, as well as recommending a course of action. It will also have the ability to take initiative in the dialog with the user (mixed-initiative mode), and the ability to provide explanations for its actions. The impact of the VA on performance, cognitive workload (CW), situational awareness (SA), and trust, will be assessed through a set of three experiments with human subjects in a laboratory environment. The first experiment will establish the baseline impact (master-slave, no explanations), and subsequent experiments will study the effect of switching to the mixed-initiative mode and adding explanations. Finally, the system will be deployed in an analog environment.

Research Impact/Earth Benefits:

Task Progress & Bibliography Information FY2019

Task Progress:

New project for FY2019.

Bibliography:

Description: (Last Updated: 02/21/2025)

Show Cumulative Bibliography

None in FY 2019

Back to Search Results

Menu