[PDF] ExTRA: Explainable Therapy-Related Annotations





Previous PDF Next PDF



Calculation of Chi-Square to Test the No Three-Factor Interaction

CALCULATION OF CHI-SQUARE TO TEST THE NO. THREE-FACTOR INTERACTION HYPOTHESIS. MARVIN A. KASTENBAUM. Mathematics Panel Oak Ridge National Laboratory.



An interactive FORTRAN IV program for calculating aspects of

interactive program (see Appendix) for helping plan experiments with dichotomous data when the usual method of analysis is chi square.



The making of the Fittest: Natural Selection and Adaptation

Click on the interactive stickleback fish. Describe where its spines are For each chi-square calculation how many degrees of freedom are there? df=1.



Gene-gene Interaction Analysis by IAC (Interaction Analysis by Chi

A chi-square test is done by pooling high-risk interaction counts (dominant- dominant) and low risk (recessive-recessive) interaction counts to calculate 



Is Interactive Open Access Publishing Able to Identify High-Impact

6 Oct 2010 respectively using Pearson's chi-square test (Agresti



Interactive Annotation Learning with Indirect Feature Voting

as mutual information and chi-square have often been used to identify the most discriminant features. (Manning et al. 2008). However





Implementation of Chi Square Automatic Interaction Detection

The best independent variable that will form the first branch in the resulting tree diagram. Before the process of calculating the CHAID algorithm [10] divides 



Interaction Tests for 2 × s × t Contingency Tables

Calculation of chi-square to test the no three-factor interaction hypothesis. Biometrics 15 107-115. LANCASTER



ExTRA: Explainable Therapy-Related Annotations

Proceedings of the 2nd Workshop on Interactive Natural Language Technology for Explainable node the Chi-square test for association is applied.



Social Science Statistics - PSY 210: Basic Statistics for

Chi-Square Test Calculator This is a easy chi-square calculator for a contingency table that has up to five rows and five columns (for alternative chi-square calculators see the column to your right) The calculation takes three steps allowing you to see how the chi-square statistic is calculated



Chi Square Test Online Tool - [100% Verified]

•The most popular and commonly used approach of nonparametrics is called chi-square (?2) • Our use of the test will always involve testing hypotheses about frequencies (although ?2 has other uses) • The two main uses of chi-square are called goodness-of-fit and test for independence



Chi-Square Effect Size Calculator - NCSS

Chi-Square Effect Size Calculator Introduction This procedure calculates the effect size of the Chi-square test Based on your input the procedure provides effect size estimates for Chi-square goodness-of-fit tests and for Chi-square tests of independence



Social Science Statistics - PSY 210: Basic Statistics for

Chi-Square Calculator for Goodness of Fit This is a chi-square calculator for goodness of fit (for alternative chi-square calculators see the column to your right) Explanation The first stage is to enter category information into the text boxes below (this calculator allows up to five categories - or levels - but fewer is fine)



28 Chi-square test for goodness of fit on the calculator

28 Chi-square test for goodness of fit on the calculator You can use the TI-Nspire to perform the calculations for a chi-square test for goodness of fit We'll use the data from the hockey and birthdays example to illustrate the steps 1 Enter the observed counts and expected counts in two separate columns in a Lists & Spreadsheet page



Searches related to interactive chi square calculator filetype:pdf

Calculate ” and the calculator will generate the chi-square statistic the degrees of freedom (df) and the p-value Cate- gory Alber Camil Jimm Susar Observed Frequency 100 90 115 95 Reset Expected Frequency 100 100 100 100 Calculate Expected Proportion Percentage Deviation 0 -10 +15 -5 Standardized Residuals Sums: Observed

How do I use the chi-square calculator?

    You can use this chi-square calculator as part of a statistical analysis test to determine if there is a significant difference between observed and expected frequencies. To use the calculator, simply input the true and expected values (on separate lines) and click on the "Calculate" button to generate the results.

How many steps does it take to calculate the chi-square?

    The calculation takes three steps, allowing you to see how the chi-square statistic is calculated. Chi Square Calculator for 2x2 This simple chi-square calculator tests for association between two categorical variables - for example, sex (males and females).

Is the chi square test online effective?

    After all, the chi square test online is simple and effective and allows you to analyze categorical data (data that can be divided into categories). Take a look at the best statistics calculators. One of the things that you need to understand about the chi square test online is that it isn’t suited to work with …

How to perform a chi-square test for goodness of fit?

    Perform a chi-square test for goodness of fit. page. Name the columns and dialogue box will appear. Enter the values as shown in the box below.e to and press ·. spreadsheet containing the test statistic,P-value,and df. If you check theShade P value marked and shaded area corresponding to theP-value.

Proceedings of the 2nd Workshop on Interactive Natural Language Technology for Explainable Artificial Intelligence (NL4XAI 2020), pages 11-15,Dublin, Ireland, 18 December 2020.c

2020 Association for Computational LinguisticsExTRA: Explainable Therapy-Related Annotations

Mat Rawsthorne

University of NottinghamTahseen Jilani

HDR-UK

Digital Research Service

University of NottinghamJacob Andrews

University of NottinghamYunfei Long

University of Essex

J

´er´emie Clos

University of NottinghamSam Malins

Institute of Mental Health

University of NottinghamDaniel Hunt

University of Nottingham

Abstract

In this paper we report progress on a novel

explainable artificial intelligence (XAI) ini- tiative applying Natural Language Processing (NLP) with elements of co-design to develop a text classifier for application in psychother- apy training and practice. The task is to pro- duce a tool that will automatically label psy- chotherapy transcript text with levels of inter- action for patient activation in known psycho- logical processes. The purpose is to enable therapists to review the effectiveness of their therapy session content. We use XAI to in- crease trust in the model"s suggestions and pre- dictions of the client"s outcome trajectory. Af- ter pre-processing of the language features ex- tracted from professionally annotated therapy session transcripts, we apply a supervised ma- chine learning approach (CHAID) to classify interaction labels (negative, neutral or positive in terms of patient activation). Weighted sam- ples are used to overcome class imbalanced data. The results show this initial model can make useful distinctions among the three la- bels of patient activation with 74% accuracy and provide insight into its reasoning. This on- going project will additionally evaluate which

XAI approaches are best for increasing the

transparency of the tool to end users and ex- plore whether direct involvement of stakehold- ers improves usability of the XAI interface and therefore trust in the solution.

1 IntroductionIt takes a lot of manual effort to quality-assure

psychotherapy sessions (

Tseng et al.

2017
), and therefore assessments of quality are rarely used routinely in psychotherapy practice. This work seeks to produce a tool that can automatically code psychotherapy transcripts, in line with a coding scheme developed by psychotherapists, known to characterise predictors of recovery (

Malins et al.

2020a
). The tool is also being developed to present explanations of the reasons for the coding deci- sions it makes. Explaining algorithms to those taking actions based on their outputs is recognised as good practice in data-driven health and care tech- nology ( DHSC 2019
). The ExTRA-PPOLATE1 project is the first step in building tools to optimise scarce resources for provision of mental healthcare

Lorenzo-Luaces et al.

2017
) by enabling thera- pists to adhere to good practice (

Waller and Turner

2016
) and deliver care tailored to the patient ( Del- gadillo et al. 2016

2 Overall Aims of Programme

The long-term objectives that we aim to achieve

throughout our programme are threefold: Aim 1

To build the foundation for unobtrusive,

objective, transdiagnostic measures of patient acti- vation. Aim 2

To understand the practical trade-offs be-

tween classifier accuracy and explainability. Aim 3

To explore the relationship amongst co-

production, transparency and trust in algorithm- informed clinical decision making.

3 Methods

Core project team members were separately sur-

veyed as to their initial hypotheses for key lan- guage markers of client-therapist interaction that is deemed helpful, focusing on generating features from different perspectives (see Table 1 ). These were then reviewed by the whole team and coded into a Python script to extract them from a cor- pus of transcripts of 120 health anxiety sessions.

This created a simple model for identifying key

interaction-types of interest (engagement in partic- ular types of conversation) which are predictive of1 Explainable Therapy Related Annotations: Patient & Practitioner Oriented Learning Assisting Trust & Engagement11 clinical outcomes (Malins et al.,2020a ) and could be compared to detailed labels that had been ap- plied to the data by specialist raters in nVivo using the Clinical Interaction Coding Scheme (CICS)

Malins et al.

2020b
). Further information on this coding scheme is provided in Appendix A.

4 Data Analysis

Data distribution and model selection

The data was skewed, and it was necessary to collapse some similar categories to ensure sufficient rep- resentation. We employed Chi-square Automatic

Interaction Detection (CHAID), a type of decision

tree (DT) classification model that can handle both categorical and numeric data sets. It does not re- quire common statistical assumptions such as nor- mality and non-collinearity ( Kass 1980
). For im- balanced data, DT models allow weighting samples according to their importance. A sub-category of the outcome variable having smaller number of samples is assigned higher weight than as compare to other category with larger number of samples.

Since positive and neutral category ratings were

more common in the dataset than negative ratings, negatively categorised data were weighted for bal- ance.

How Decision Trees Work

DT models work by

recursively partitioning the samples into a number of subsets. The starting node (at the top of the tree) is termed as "root". Any node with outgoing nodes is termed as an internal node, while the nodes with- out further branches are called "leaves". At each node, the Chi-square test for association is applied and the variable having the strongest association with the outcome variable is selected for further split into leaves. The chosen variable is the one that expresses the strongest discrimination between the different levels of outcome variable. The algo- rithm keeps dividing the full data set into subsets using the depth-first approach until the stopping criterion is not met (

Magidson

1994

Validation

For internal validation of the model

or when no validation data set is available, the model can perform K-fold cross validation. Finally, the results from different K folds were merged to produce a single DT estimation. DT models also offer tree pruning and feature selection based on the Chi-Squared test to prevent overfitting of the model. A "minimum cases" criteria is used for de- ciding further split of a branch. Discrimination of the original and cross-validated models was evalu- ated through the generation of Receiver Operating

Characteristic (ROC) curves and calculation of C-

statistics.

5 Initial Findings

CHAID label classification results are summarised

in the classification matrix in Table 2. The over- all accuracy of the model was 74% with the high- est correct sample classified in the Neutral cate- gory. There were a total of 681 negative labels out of a total of 25,823 samples (2.6%). Of these,

60.4% were correctly classified. A larger total of

16,713 samples were recorded for positive labels

(64.7% of the total), with a correct classification rate of 69.5%. The performance of the classifi- cation could be further enhanced through a more detailed exploration of the language features from the session transcripts, using improved oversam- pling techniques such as SMOTE and deeper ma- chine learning modelling such as random forest and convolutional neural networks. Furthermore, the interdisciplinary engagement with the data has already helped deepen understanding of both the

CICS framework and the classifier model (

P´aez,

2019
) and generated ideas for their refinement.

6 Tool Development

The project uses a fusion of techniques to apply

Responsible Research & Innovation (RRI) to the

tool"s development, specifically:

Incorporating a range of perspectives

at mul- tiple levels: The core project team combines the lived experience of a Service User Researcher and Involvement Volunteers (skilled in instrumentation design and plain English summaries) from the Institute of Mental Health, with specialist Clinical

Psychology knowledge, Statistical Machine Learn-

ing, Psychometrics, Computer Science and Corpus

Linguistics expertise. This diversity of experts

in the formal and informal language of mental health provide triangulation to ensure the methods and findings make sense (

Ernala et al.

2019

Additionally we engaged a Patient & Practitioner

Reference Group (PPRG), comprised of 12 people,

balanced across key stakeholder groups: patients and carers, clinical psychologists, therapy trainers, and mental health service managers. Dissemina- tion will be via interactive "roadshow" events with

PPRG peer groups to gauge whether they feel the12

Perspective Feature Impact Coding

Patient absolute words, profanity negative customised dictionaries Clinician positive sentiment positive valence and polarity Linguist first person pronouns negative ratio singular:plural

NLP researcher utterance length positive word, character countsTable 1: Table Examples of Candidate Language Features

Perspective: professional alignment of the core project team member suggesting the language feature.

Impact: expected relationship between the feature and level of patient engagement in the interaction.

Coding: method used to extract from the text using Python [details available from authors on request].Observed Predicted Negative Predicted Neutral Predicted Positive Percent Correct

Negative 411 94 176 60.4%

Neutral 99 7,766 564 92.1%

Positive 1,223 3,871 11,619 69.5%

Overall Percentage6.7 45.4 47.974%Table 2: Initial Results for Classification of Level of Clinical Engagementco-design process adds to the credibility of the tool.

Agile Science Approach(Hekler et al.,2016 )

Repeated engagement with end-users is intended

to build trust ( Carr 2020
) and emulates industry best practice. The project leverages specialist support from a social enterprise2on coproduction aspects (

Hickey et al.

2018
), and a digital healthquotesdbs_dbs17.pdfusesText_23
[PDF] interactive louvre map

[PDF] interactive pdf javascript

[PDF] interactive rail map of germany

[PDF] interactive reader and study guide world history answers

[PDF] interactive teaching techniques

[PDF] interchange 5th edition pdf

[PDF] intercompany inventory transactions solutions

[PDF] intercompany profit elimination example

[PDF] intercompany sale of land

[PDF] interest rate benchmark reform

[PDF] interest rate benchmark reform (amendments to ifrs 9 ias 39 and ifrs 7)

[PDF] interest rate benchmark reform phase 2

[PDF] interest rate benchmark reform ey

[PDF] interest rate benchmark reform iasb

[PDF] interest rate benchmark reform pwc