Weekend batch
Avijeet is a Senior Research Analyst at Simplilearn. Passionate about Data Analytics, Machine Learning, and Deep Learning, Avijeet is also interested in politics, cricket, and football.
Free eBook: Top Programming Languages For A Data Scientist
Normality Test in Minitab: Minitab with Statistics
Machine Learning Career Guide: A Playbook to Becoming a Machine Learning Engineer
The bottom line.
Hypothesis testing, sometimes called significance testing, is an act in statistics whereby an analyst tests an assumption regarding a population parameter. The methodology employed by the analyst depends on the nature of the data used and the reason for the analysis.
Hypothesis testing is used to assess the plausibility of a hypothesis by using sample data. Such data may come from a larger population or a data-generating process. The word "population" will be used for both of these cases in the following descriptions.
In hypothesis testing, an analyst tests a statistical sample, intending to provide evidence on the plausibility of the null hypothesis. Statistical analysts measure and examine a random sample of the population being analyzed. All analysts use a random population sample to test two different hypotheses: the null hypothesis and the alternative hypothesis.
The null hypothesis is usually a hypothesis of equality between population parameters; e.g., a null hypothesis may state that the population mean return is equal to zero. The alternative hypothesis is effectively the opposite of a null hypothesis. Thus, they are mutually exclusive , and only one can be true. However, one of the two hypotheses will always be true.
The null hypothesis is a statement about a population parameter, such as the population mean, that is assumed to be true.
If an individual wants to test that a penny has exactly a 50% chance of landing on heads, the null hypothesis would be that 50% is correct, and the alternative hypothesis would be that 50% is not correct. Mathematically, the null hypothesis is represented as Ho: P = 0.5. The alternative hypothesis is shown as "Ha" and is identical to the null hypothesis, except with the equal sign struck-through, meaning that it does not equal 50%.
A random sample of 100 coin flips is taken, and the null hypothesis is tested. If it is found that the 100 coin flips were distributed as 40 heads and 60 tails, the analyst would assume that a penny does not have a 50% chance of landing on heads and would reject the null hypothesis and accept the alternative hypothesis.
If there were 48 heads and 52 tails, then it is plausible that the coin could be fair and still produce such a result. In cases such as this where the null hypothesis is "accepted," the analyst states that the difference between the expected results (50 heads and 50 tails) and the observed results (48 heads and 52 tails) is "explainable by chance alone."
Some statisticians attribute the first hypothesis tests to satirical writer John Arbuthnot in 1710, who studied male and female births in England after observing that in nearly every year, male births exceeded female births by a slight proportion. Arbuthnot calculated that the probability of this happening by chance was small, and therefore it was due to “divine providence.”
Hypothesis testing helps assess the accuracy of new ideas or theories by testing them against data. This allows researchers to determine whether the evidence supports their hypothesis, helping to avoid false claims and conclusions. Hypothesis testing also provides a framework for decision-making based on data rather than personal opinions or biases. By relying on statistical analysis, hypothesis testing helps to reduce the effects of chance and confounding variables, providing a robust framework for making informed conclusions.
Hypothesis testing relies exclusively on data and doesn’t provide a comprehensive understanding of the subject being studied. Additionally, the accuracy of the results depends on the quality of the available data and the statistical methods used. Inaccurate data or inappropriate hypothesis formulation may lead to incorrect conclusions or failed tests. Hypothesis testing can also lead to errors, such as analysts either accepting or rejecting a null hypothesis when they shouldn’t have. These errors may result in false conclusions or missed opportunities to identify significant patterns or relationships in the data.
Hypothesis testing refers to a statistical process that helps researchers determine the reliability of a study. By using a well-formulated hypothesis and set of statistical tests, individuals or businesses can make inferences about the population that they are studying and draw conclusions based on the data presented. All hypothesis testing methods have the same four-step process, which includes stating the hypotheses, formulating an analysis plan, analyzing the sample data, and analyzing the result.
Sage. " Introduction to Hypothesis Testing ," Page 4.
Elder Research. " Who Invented the Null Hypothesis? "
Formplus. " Hypothesis Testing: Definition, Uses, Limitations and Examples ."
Hypothesis testing involves formulating assumptions about population parameters based on sample statistics and rigorously evaluating these assumptions against empirical evidence. This article sheds light on the significance of hypothesis testing and the critical steps involved in the process.
A hypothesis is an assumption or idea, specifically a statistical claim about an unknown population parameter. For example, a judge assumes a person is innocent and verifies this by reviewing evidence and hearing testimony before reaching a verdict.
Hypothesis testing is a statistical method that is used to make a statistical decision using experimental data. Hypothesis testing is basically an assumption that we make about a population parameter. It evaluates two mutually exclusive statements about a population to determine which statement is best supported by the sample data.
To test the validity of the claim or assumption about the population parameter:
Example: You say an average height in the class is 30 or a boy is taller than a girl. All of these is an assumption that we are assuming, and we need some statistical way to prove these. We need some mathematical conclusion whatever we are assuming is true.
Hypothesis testing is an important procedure in statistics. Hypothesis testing evaluates two mutually exclusive population statements to determine which statement is most supported by sample data. When we say that the findings are statistically significant, thanks to hypothesis testing.
One tailed test focuses on one direction, either greater than or less than a specified value. We use a one-tailed test when there is a clear directional expectation based on prior knowledge or theory. The critical region is located on only one side of the distribution curve. If the sample falls into this critical region, the null hypothesis is rejected in favor of the alternative hypothesis.
There are two types of one-tailed test:
A two-tailed test considers both directions, greater than and less than a specified value.We use a two-tailed test when there is no specific directional expectation, and want to detect any significant difference.
Example: H 0 : [Tex]\mu = [/Tex] 50 and H 1 : [Tex]\mu \neq 50 [/Tex]
To delve deeper into differences into both types of test: Refer to link
In hypothesis testing, Type I and Type II errors are two possible errors that researchers can make when drawing conclusions about a population based on a sample of data. These errors are associated with the decisions made regarding the null hypothesis and the alternative hypothesis.
Null Hypothesis is True | Null Hypothesis is False | |
---|---|---|
Null Hypothesis is True (Accept) | Correct Decision | Type II Error (False Negative) |
Alternative Hypothesis is True (Reject) | Type I Error (False Positive) | Correct Decision |
Step 1: define null and alternative hypothesis.
State the null hypothesis ( [Tex]H_0 [/Tex] ), representing no effect, and the alternative hypothesis ( [Tex]H_1 [/Tex] ), suggesting an effect or difference.
We first identify the problem about which we want to make an assumption keeping in mind that our assumption should be contradictory to one another, assuming Normally distributed data.
Select a significance level ( [Tex]\alpha [/Tex] ), typically 0.05, to determine the threshold for rejecting the null hypothesis. It provides validity to our hypothesis test, ensuring that we have sufficient data to back up our claims. Usually, we determine our significance level beforehand of the test. The p-value is the criterion used to calculate our significance value.
Gather relevant data through observation or experimentation. Analyze the data using appropriate statistical methods to obtain a test statistic.
The data for the tests are evaluated in this step we look for various scores based on the characteristics of data. The choice of the test statistic depends on the type of hypothesis test being conducted.
There are various hypothesis tests, each appropriate for various goal to calculate our test. This could be a Z-test , Chi-square , T-test , and so on.
We have a smaller dataset, So, T-test is more appropriate to test our hypothesis.
T-statistic is a measure of the difference between the means of two groups relative to the variability within each group. It is calculated as the difference between the sample means divided by the standard error of the difference. It is also known as the t-value or t-score.
In this stage, we decide where we should accept the null hypothesis or reject the null hypothesis. There are two ways to decide where we should accept or reject the null hypothesis.
Comparing the test statistic and tabulated critical value we have,
Note: Critical values are predetermined threshold values that are used to make a decision in hypothesis testing. To determine critical values for hypothesis testing, we typically refer to a statistical distribution table , such as the normal distribution or t-distribution tables based on.
We can also come to an conclusion using the p-value,
Note : The p-value is the probability of obtaining a test statistic as extreme as, or more extreme than, the one observed in the sample, assuming the null hypothesis is true. To determine p-value for hypothesis testing, we typically refer to a statistical distribution table , such as the normal distribution or t-distribution tables based on.
At last, we can conclude our experiment using method A or B.
To validate our hypothesis about a population parameter we use statistical functions . We use the z-score, p-value, and level of significance(alpha) to make evidence for our hypothesis for normally distributed data .
When population means and standard deviations are known.
[Tex]z = \frac{\bar{x} – \mu}{\frac{\sigma}{\sqrt{n}}}[/Tex]
T test is used when n<30,
t-statistic calculation is given by:
[Tex]t=\frac{x̄-μ}{s/\sqrt{n}} [/Tex]
Chi-Square Test for Independence categorical Data (Non-normally distributed) using:
[Tex]\chi^2 = \sum \frac{(O_{ij} – E_{ij})^2}{E_{ij}}[/Tex]
Let’s examine hypothesis testing using two real life situations,
Imagine a pharmaceutical company has developed a new drug that they believe can effectively lower blood pressure in patients with hypertension. Before bringing the drug to market, they need to conduct a study to assess its impact on blood pressure.
Let’s consider the Significance level at 0.05, indicating rejection of the null hypothesis.
If the evidence suggests less than a 5% chance of observing the results due to random variation.
Using paired T-test analyze the data to obtain a test statistic and a p-value.
The test statistic (e.g., T-statistic) is calculated based on the differences between blood pressure measurements before and after treatment.
t = m/(s/√n)
then, m= -3.9, s= 1.8 and n= 10
we, calculate the , T-statistic = -9 based on the formula for paired t test
The calculated t-statistic is -9 and degrees of freedom df = 9, you can find the p-value using statistical software or a t-distribution table.
thus, p-value = 8.538051223166285e-06
Step 5: Result
Conclusion: Since the p-value (8.538051223166285e-06) is less than the significance level (0.05), the researchers reject the null hypothesis. There is statistically significant evidence that the average blood pressure before and after treatment with the new drug is different.
Let’s create hypothesis testing with python, where we are testing whether a new drug affects blood pressure. For this example, we will use a paired T-test. We’ll use the scipy.stats library for the T-test.
Scipy is a mathematical library in Python that is mostly used for mathematical equations and computations.
We will implement our first real life problem via python,
import numpy as np from scipy import stats # Data before_treatment = np . array ([ 120 , 122 , 118 , 130 , 125 , 128 , 115 , 121 , 123 , 119 ]) after_treatment = np . array ([ 115 , 120 , 112 , 128 , 122 , 125 , 110 , 117 , 119 , 114 ]) # Step 1: Null and Alternate Hypotheses # Null Hypothesis: The new drug has no effect on blood pressure. # Alternate Hypothesis: The new drug has an effect on blood pressure. null_hypothesis = "The new drug has no effect on blood pressure." alternate_hypothesis = "The new drug has an effect on blood pressure." # Step 2: Significance Level alpha = 0.05 # Step 3: Paired T-test t_statistic , p_value = stats . ttest_rel ( after_treatment , before_treatment ) # Step 4: Calculate T-statistic manually m = np . mean ( after_treatment - before_treatment ) s = np . std ( after_treatment - before_treatment , ddof = 1 ) # using ddof=1 for sample standard deviation n = len ( before_treatment ) t_statistic_manual = m / ( s / np . sqrt ( n )) # Step 5: Decision if p_value <= alpha : decision = "Reject" else : decision = "Fail to reject" # Conclusion if decision == "Reject" : conclusion = "There is statistically significant evidence that the average blood pressure before and after treatment with the new drug is different." else : conclusion = "There is insufficient evidence to claim a significant difference in average blood pressure before and after treatment with the new drug." # Display results print ( "T-statistic (from scipy):" , t_statistic ) print ( "P-value (from scipy):" , p_value ) print ( "T-statistic (calculated manually):" , t_statistic_manual ) print ( f "Decision: { decision } the null hypothesis at alpha= { alpha } ." ) print ( "Conclusion:" , conclusion )
T-statistic (from scipy): -9.0 P-value (from scipy): 8.538051223166285e-06 T-statistic (calculated manually): -9.0 Decision: Reject the null hypothesis at alpha=0.05. Conclusion: There is statistically significant evidence that the average blood pressure before and after treatment with the new drug is different.
In the above example, given the T-statistic of approximately -9 and an extremely small p-value, the results indicate a strong case to reject the null hypothesis at a significance level of 0.05.
Data: A sample of 25 individuals is taken, and their cholesterol levels are measured.
Cholesterol Levels (mg/dL): 205, 198, 210, 190, 215, 205, 200, 192, 198, 205, 198, 202, 208, 200, 205, 198, 205, 210, 192, 205, 198, 205, 210, 192, 205.
Populations Mean = 200
Population Standard Deviation (σ): 5 mg/dL(given for this problem)
As the direction of deviation is not given , we assume a two-tailed test, and based on a normal distribution table, the critical values for a significance level of 0.05 (two-tailed) can be calculated through the z-table and are approximately -1.96 and 1.96.
The test statistic is calculated by using the z formula Z = [Tex](203.8 – 200) / (5 \div \sqrt{25}) [/Tex] and we get accordingly , Z =2.039999999999992.
Step 4: Result
Since the absolute value of the test statistic (2.04) is greater than the critical value (1.96), we reject the null hypothesis. And conclude that, there is statistically significant evidence that the average cholesterol level in the population is different from 200 mg/dL
import scipy.stats as stats import math import numpy as np # Given data sample_data = np . array ( [ 205 , 198 , 210 , 190 , 215 , 205 , 200 , 192 , 198 , 205 , 198 , 202 , 208 , 200 , 205 , 198 , 205 , 210 , 192 , 205 , 198 , 205 , 210 , 192 , 205 ]) population_std_dev = 5 population_mean = 200 sample_size = len ( sample_data ) # Step 1: Define the Hypotheses # Null Hypothesis (H0): The average cholesterol level in a population is 200 mg/dL. # Alternate Hypothesis (H1): The average cholesterol level in a population is different from 200 mg/dL. # Step 2: Define the Significance Level alpha = 0.05 # Two-tailed test # Critical values for a significance level of 0.05 (two-tailed) critical_value_left = stats . norm . ppf ( alpha / 2 ) critical_value_right = - critical_value_left # Step 3: Compute the test statistic sample_mean = sample_data . mean () z_score = ( sample_mean - population_mean ) / \ ( population_std_dev / math . sqrt ( sample_size )) # Step 4: Result # Check if the absolute value of the test statistic is greater than the critical values if abs ( z_score ) > max ( abs ( critical_value_left ), abs ( critical_value_right )): print ( "Reject the null hypothesis." ) print ( "There is statistically significant evidence that the average cholesterol level in the population is different from 200 mg/dL." ) else : print ( "Fail to reject the null hypothesis." ) print ( "There is not enough evidence to conclude that the average cholesterol level in the population is different from 200 mg/dL." )
Reject the null hypothesis. There is statistically significant evidence that the average cholesterol level in the population is different from 200 mg/dL.
Hypothesis testing stands as a cornerstone in statistical analysis, enabling data scientists to navigate uncertainties and draw credible inferences from sample data. By systematically defining null and alternative hypotheses, choosing significance levels, and leveraging statistical tests, researchers can assess the validity of their assumptions. The article also elucidates the critical distinction between Type I and Type II errors, providing a comprehensive understanding of the nuanced decision-making process inherent in hypothesis testing. The real-life example of testing a new drug’s effect on blood pressure using a paired T-test showcases the practical application of these principles, underscoring the importance of statistical rigor in data-driven decision-making.
1. what are the 3 types of hypothesis test.
There are three types of hypothesis tests: right-tailed, left-tailed, and two-tailed. Right-tailed tests assess if a parameter is greater, left-tailed if lesser. Two-tailed tests check for non-directional differences, greater or lesser.
Null Hypothesis ( [Tex]H_o [/Tex] ): No effect or difference exists. Alternative Hypothesis ( [Tex]H_1 [/Tex] ): An effect or difference exists. Significance Level ( [Tex]\alpha [/Tex] ): Risk of rejecting null hypothesis when it’s true (Type I error). Test Statistic: Numerical value representing observed evidence against null hypothesis.
Statistical method to evaluate the performance and validity of machine learning models. Tests specific hypotheses about model behavior, like whether features influence predictions or if a model generalizes well to unseen data.
Pytest purposes general testing framework for Python code while Hypothesis is a Property-based testing framework for Python, focusing on generating test cases based on specified properties of the code.
Similar reads.
Generate hypotheses for your research.
Ai sentence rephraser, ai article outliner, ai writing style adapter.
Volume 48, 1997, review article, creative hypothesis generating in psychology: some useful heuristics.
To correct a common imbalance in methodology courses, focusing almost entirely on hypothesis-testing issues to the neglect of hypothesis-generating issues which are at least as important, 49 creative heuristics are described, divided into 5 categories and 14 subcategories. Each of these heuristics has often been used to generate hypotheses in psychological research, and each is teachable to students. The 49 heuristics range from common sense perceptiveness of the oddity of natural occurrences to use of sophisticated quantitative data analyses in ways that provoke new insights.
Article metrics loading...
Full text loading...
Literature Cited
Data & Media loading...
Most cited most cited rss feed, job burnout, executive functions, social cognitive theory: an agentic perspective, on happiness and human potentials: a review of research on hedonic and eudaimonic well-being, sources of method bias in social science research and recommendations on how to control it, mediation analysis, missing data analysis: making it work in the real world, grounded cognition, personality structure: emergence of the five-factor model, motivational beliefs, values, and goals.
Hypothesis Definition, Format, Examples, and Tips
Verywell / Alex Dos Diaz
Falsifiability of a hypothesis.
Hypotheses examples.
A hypothesis is a tentative statement about the relationship between two or more variables. It is a specific, testable prediction about what you expect to happen in a study. It is a preliminary answer to your question that helps guide the research process.
Consider a study designed to examine the relationship between sleep deprivation and test performance. The hypothesis might be: "This study is designed to assess the hypothesis that sleep-deprived people will perform worse on a test than individuals who are not sleep-deprived."
A hypothesis is crucial to scientific research because it offers a clear direction for what the researchers are looking to find. This allows them to design experiments to test their predictions and add to our scientific knowledge about the world. This article explores how a hypothesis is used in psychology research, how to write a good hypothesis, and the different types of hypotheses you might use.
In the scientific method , whether it involves research in psychology, biology, or some other area, a hypothesis represents what the researchers think will happen in an experiment. The scientific method involves the following steps:
The hypothesis is a prediction, but it involves more than a guess. Most of the time, the hypothesis begins with a question which is then explored through background research. At this point, researchers then begin to develop a testable hypothesis.
Unless you are creating an exploratory study, your hypothesis should always explain what you expect to happen.
In a study exploring the effects of a particular drug, the hypothesis might be that researchers expect the drug to have some type of effect on the symptoms of a specific illness. In psychology, the hypothesis might focus on how a certain aspect of the environment might influence a particular behavior.
Remember, a hypothesis does not have to be correct. While the hypothesis predicts what the researchers expect to see, the goal of the research is to determine whether this guess is right or wrong. When conducting an experiment, researchers might explore numerous factors to determine which ones might contribute to the ultimate outcome.
In many cases, researchers may find that the results of an experiment do not support the original hypothesis. When writing up these results, the researchers might suggest other options that should be explored in future studies.
In many cases, researchers might draw a hypothesis from a specific theory or build on previous research. For example, prior research has shown that stress can impact the immune system. So a researcher might hypothesize: "People with high-stress levels will be more likely to contract a common cold after being exposed to the virus than people who have low-stress levels."
In other instances, researchers might look at commonly held beliefs or folk wisdom. "Birds of a feather flock together" is one example of folk adage that a psychologist might try to investigate. The researcher might pose a specific hypothesis that "People tend to select romantic partners who are similar to them in interests and educational level."
So how do you write a good hypothesis? When trying to come up with a hypothesis for your research or experiments, ask yourself the following questions:
Before you come up with a specific hypothesis, spend some time doing background research. Once you have completed a literature review, start thinking about potential questions you still have. Pay attention to the discussion section in the journal articles you read . Many authors will suggest questions that still need to be explored.
To form a hypothesis, you should take these steps:
In the scientific method , falsifiability is an important part of any valid hypothesis. In order to test a claim scientifically, it must be possible that the claim could be proven false.
Students sometimes confuse the idea of falsifiability with the idea that it means that something is false, which is not the case. What falsifiability means is that if something was false, then it is possible to demonstrate that it is false.
One of the hallmarks of pseudoscience is that it makes claims that cannot be refuted or proven false.
A variable is a factor or element that can be changed and manipulated in ways that are observable and measurable. However, the researcher must also define how the variable will be manipulated and measured in the study.
Operational definitions are specific definitions for all relevant factors in a study. This process helps make vague or ambiguous concepts detailed and measurable.
For example, a researcher might operationally define the variable " test anxiety " as the results of a self-report measure of anxiety experienced during an exam. A "study habits" variable might be defined by the amount of studying that actually occurs as measured by time.
These precise descriptions are important because many things can be measured in various ways. Clearly defining these variables and how they are measured helps ensure that other researchers can replicate your results.
One of the basic principles of any type of scientific research is that the results must be replicable.
Replication means repeating an experiment in the same way to produce the same results. By clearly detailing the specifics of how the variables were measured and manipulated, other researchers can better understand the results and repeat the study if needed.
Some variables are more difficult than others to define. For example, how would you operationally define a variable such as aggression ? For obvious ethical reasons, researchers cannot create a situation in which a person behaves aggressively toward others.
To measure this variable, the researcher must devise a measurement that assesses aggressive behavior without harming others. The researcher might utilize a simulated task to measure aggressiveness in this situation.
The hypothesis you use will depend on what you are investigating and hoping to find. Some of the main types of hypotheses that you might use include:
A hypothesis often follows a basic format of "If {this happens} then {this will happen}." One way to structure your hypothesis is to describe what will happen to the dependent variable if you change the independent variable .
The basic format might be: "If {these changes are made to a certain independent variable}, then we will observe {a change in a specific dependent variable}."
Once a researcher has formed a testable hypothesis, the next step is to select a research design and start collecting data. The research method depends largely on exactly what they are studying. There are two basic types of research methods: descriptive research and experimental research.
Descriptive research such as case studies , naturalistic observations , and surveys are often used when conducting an experiment is difficult or impossible. These methods are best used to describe different aspects of a behavior or psychological phenomenon.
Once a researcher has collected data using descriptive methods, a correlational study can examine how the variables are related. This research method might be used to investigate a hypothesis that is difficult to test experimentally.
Experimental methods are used to demonstrate causal relationships between variables. In an experiment, the researcher systematically manipulates a variable of interest (known as the independent variable) and measures the effect on another variable (known as the dependent variable).
Unlike correlational studies, which can only be used to determine if there is a relationship between two variables, experimental methods can be used to determine the actual nature of the relationship—whether changes in one variable actually cause another to change.
The hypothesis is a critical part of any scientific exploration. It represents what researchers expect to find in a study or experiment. In situations where the hypothesis is unsupported by the research, the research still has value. Such research helps us better understand how different aspects of the natural world relate to one another. It also helps us develop new hypotheses that can then be tested in the future.
Thompson WH, Skau S. On the scope of scientific hypotheses . R Soc Open Sci . 2023;10(8):230607. doi:10.1098/rsos.230607
Taran S, Adhikari NKJ, Fan E. Falsifiability in medicine: what clinicians can learn from Karl Popper [published correction appears in Intensive Care Med. 2021 Jun 17;:]. Intensive Care Med . 2021;47(9):1054-1056. doi:10.1007/s00134-021-06432-z
Eyler AA. Research Methods for Public Health . 1st ed. Springer Publishing Company; 2020. doi:10.1891/9780826182067.0004
Nosek BA, Errington TM. What is replication ? PLoS Biol . 2020;18(3):e3000691. doi:10.1371/journal.pbio.3000691
Aggarwal R, Ranganathan P. Study designs: Part 2 - Descriptive studies . Perspect Clin Res . 2019;10(1):34-36. doi:10.4103/picr.PICR_154_18
Nevid J. Psychology: Concepts and Applications. Wadworth, 2013.
By Kendra Cherry, MSEd Kendra Cherry, MS, is a psychosocial rehabilitation specialist, psychology educator, and author of the "Everything Psychology Book."
Educational resources and simple solutions for your research journey
Any research begins with a research question and a research hypothesis . A research question alone may not suffice to design the experiment(s) needed to answer it. A hypothesis is central to the scientific method. But what is a hypothesis ? A hypothesis is a testable statement that proposes a possible explanation to a phenomenon, and it may include a prediction. Next, you may ask what is a research hypothesis ? Simply put, a research hypothesis is a prediction or educated guess about the relationship between the variables that you want to investigate.
It is important to be thorough when developing your research hypothesis. Shortcomings in the framing of a hypothesis can affect the study design and the results. A better understanding of the research hypothesis definition and characteristics of a good hypothesis will make it easier for you to develop your own hypothesis for your research. Let’s dive in to know more about the types of research hypothesis , how to write a research hypothesis , and some research hypothesis examples .
Table of Contents
A hypothesis is based on the existing body of knowledge in a study area. Framed before the data are collected, a hypothesis states the tentative relationship between independent and dependent variables, along with a prediction of the outcome.
Young researchers starting out their journey are usually brimming with questions like “ What is a hypothesis ?” “ What is a research hypothesis ?” “How can I write a good research hypothesis ?”
A research hypothesis is a statement that proposes a possible explanation for an observable phenomenon or pattern. It guides the direction of a study and predicts the outcome of the investigation. A research hypothesis is testable, i.e., it can be supported or disproven through experimentation or observation.
Here are the characteristics of a good hypothesis :
A study begins with the formulation of a research question. A researcher then performs background research. This background information forms the basis for building a good research hypothesis . The researcher then performs experiments, collects, and analyzes the data, interprets the findings, and ultimately, determines if the findings support or negate the original hypothesis.
Let’s look at each step for creating an effective, testable, and good research hypothesis :
Remember that creating a research hypothesis is an iterative process, i.e., you might have to revise it based on the data you collect. You may need to test and reject several hypotheses before answering the research problem.
When you start writing a research hypothesis , you use an “if–then” statement format, which states the predicted relationship between two or more variables. Clearly identify the independent variables (the variables being changed) and the dependent variables (the variables being measured), as well as the population you are studying. Review and revise your hypothesis as needed.
An example of a research hypothesis in this format is as follows:
“ If [athletes] follow [cold water showers daily], then their [endurance] increases.”
Population: athletes
Independent variable: daily cold water showers
Dependent variable: endurance
You may have understood the characteristics of a good hypothesis . But note that a research hypothesis is not always confirmed; a researcher should be prepared to accept or reject the hypothesis based on the study findings.
Following from above, here is a 10-point checklist for a good research hypothesis :
By following this research hypothesis checklist , you will be able to create a research hypothesis that is strong, well-constructed, and more likely to yield meaningful results.
Different types of research hypothesis are used in scientific research:
A null hypothesis states that there is no change in the dependent variable due to changes to the independent variable. This means that the results are due to chance and are not significant. A null hypothesis is denoted as H0 and is stated as the opposite of what the alternative hypothesis states.
Example: “ The newly identified virus is not zoonotic .”
This states that there is a significant difference or relationship between the variables being studied. It is denoted as H1 or Ha and is usually accepted or rejected in favor of the null hypothesis.
Example: “ The newly identified virus is zoonotic .”
This specifies the direction of the relationship or difference between variables; therefore, it tends to use terms like increase, decrease, positive, negative, more, or less.
Example: “ The inclusion of intervention X decreases infant mortality compared to the original treatment .”
While it does not predict the exact direction or nature of the relationship between the two variables, a non-directional hypothesis states the existence of a relationship or difference between variables but not the direction, nature, or magnitude of the relationship. A non-directional hypothesis may be used when there is no underlying theory or when findings contradict previous research.
Example, “ Cats and dogs differ in the amount of affection they express .”
A simple hypothesis only predicts the relationship between one independent and another independent variable.
Example: “ Applying sunscreen every day slows skin aging .”
A complex hypothesis states the relationship or difference between two or more independent and dependent variables.
Example: “ Applying sunscreen every day slows skin aging, reduces sun burn, and reduces the chances of skin cancer .” (Here, the three dependent variables are slowing skin aging, reducing sun burn, and reducing the chances of skin cancer.)
An associative hypothesis states that a change in one variable results in the change of the other variable. The associative hypothesis defines interdependency between variables.
Example: “ There is a positive association between physical activity levels and overall health .”
A causal hypothesis proposes a cause-and-effect interaction between variables.
Example: “ Long-term alcohol use causes liver damage .”
Note that some of the types of research hypothesis mentioned above might overlap. The types of hypothesis chosen will depend on the research question and the objective of the study.
Here are some good research hypothesis examples :
“The use of a specific type of therapy will lead to a reduction in symptoms of depression in individuals with a history of major depressive disorder.”
“Providing educational interventions on healthy eating habits will result in weight loss in overweight individuals.”
“Plants that are exposed to certain types of music will grow taller than those that are not exposed to music.”
“The use of the plant growth regulator X will lead to an increase in the number of flowers produced by plants.”
Characteristics that make a research hypothesis weak are unclear variables, unoriginality, being too general or too vague, and being untestable. A weak hypothesis leads to weak research and improper methods.
Some bad research hypothesis examples (and the reasons why they are “bad”) are as follows:
“This study will show that treatment X is better than any other treatment . ” (This statement is not testable, too broad, and does not consider other treatments that may be effective.)
“This study will prove that this type of therapy is effective for all mental disorders . ” (This statement is too broad and not testable as mental disorders are complex and different disorders may respond differently to different types of therapy.)
“Plants can communicate with each other through telepathy . ” (This statement is not testable and lacks a scientific basis.)
If a research hypothesis is not testable, the results will not prove or disprove anything meaningful. The conclusions will be vague at best. A testable hypothesis helps a researcher focus on the study outcome and understand the implication of the question and the different variables involved. A testable hypothesis helps a researcher make precise predictions based on prior research.
To be considered testable, there must be a way to prove that the hypothesis is true or false; further, the results of the hypothesis must be reproducible.
1. What is the difference between research question and research hypothesis ?
A research question defines the problem and helps outline the study objective(s). It is an open-ended statement that is exploratory or probing in nature. Therefore, it does not make predictions or assumptions. It helps a researcher identify what information to collect. A research hypothesis , however, is a specific, testable prediction about the relationship between variables. Accordingly, it guides the study design and data analysis approach.
2. When to reject null hypothesis ?
A null hypothesis should be rejected when the evidence from a statistical test shows that it is unlikely to be true. This happens when the test statistic (e.g., p -value) is less than the defined significance level (e.g., 0.05). Rejecting the null hypothesis does not necessarily mean that the alternative hypothesis is true; it simply means that the evidence found is not compatible with the null hypothesis.
3. How can I be sure my hypothesis is testable?
A testable hypothesis should be specific and measurable, and it should state a clear relationship between variables that can be tested with data. To ensure that your hypothesis is testable, consider the following:
4. How do I revise my research hypothesis if my data does not support it?
If your data does not support your research hypothesis , you will need to revise it or develop a new one. You should examine your data carefully and identify any patterns or anomalies, re-examine your research question, and/or revisit your theory to look for any alternative explanations for your results. Based on your review of the data, literature, and theories, modify your research hypothesis to better align it with the results you obtained. Use your revised hypothesis to guide your research design and data collection. It is important to remain objective throughout the process.
5. I am performing exploratory research. Do I need to formulate a research hypothesis?
As opposed to “confirmatory” research, where a researcher has some idea about the relationship between the variables under investigation, exploratory research (or hypothesis-generating research) looks into a completely new topic about which limited information is available. Therefore, the researcher will not have any prior hypotheses. In such cases, a researcher will need to develop a post-hoc hypothesis. A post-hoc research hypothesis is generated after these results are known.
6. How is a research hypothesis different from a research question?
A research question is an inquiry about a specific topic or phenomenon, typically expressed as a question. It seeks to explore and understand a particular aspect of the research subject. In contrast, a research hypothesis is a specific statement or prediction that suggests an expected relationship between variables. It is formulated based on existing knowledge or theories and guides the research design and data analysis.
7. Can a research hypothesis change during the research process?
Yes, research hypotheses can change during the research process. As researchers collect and analyze data, new insights and information may emerge that require modification or refinement of the initial hypotheses. This can be due to unexpected findings, limitations in the original hypotheses, or the need to explore additional dimensions of the research topic. Flexibility is crucial in research, allowing for adaptation and adjustment of hypotheses to align with the evolving understanding of the subject matter.
8. How many hypotheses should be included in a research study?
The number of research hypotheses in a research study varies depending on the nature and scope of the research. It is not necessary to have multiple hypotheses in every study. Some studies may have only one primary hypothesis, while others may have several related hypotheses. The number of hypotheses should be determined based on the research objectives, research questions, and the complexity of the research topic. It is important to ensure that the hypotheses are focused, testable, and directly related to the research aims.
9. Can research hypotheses be used in qualitative research?
Yes, research hypotheses can be used in qualitative research, although they are more commonly associated with quantitative research. In qualitative research, hypotheses may be formulated as tentative or exploratory statements that guide the investigation. Instead of testing hypotheses through statistical analysis, qualitative researchers may use the hypotheses to guide data collection and analysis, seeking to uncover patterns, themes, or relationships within the qualitative data. The emphasis in qualitative research is often on generating insights and understanding rather than confirming or rejecting specific research hypotheses through statistical testing.
Editage All Access is a subscription-based platform that unifies the best AI tools and services designed to speed up, simplify, and streamline every step of a researcher’s journey. The Editage All Access Pack is a one-of-a-kind subscription that unlocks full access to an AI writing assistant, literature recommender, journal finder, scientific illustration tool, and exclusive discounts on professional publication services from Editage.
Based on 22+ years of experience in academia, Editage All Access empowers researchers to put their best research forward and move closer to success. Explore our top AI Tools pack, AI Tools + Publication Services pack, or Build Your Own Plan. Find everything a researcher needs to succeed, all in one place – Get All Access now starting at just $14 a month !
BMC Cancer volume 24 , Article number: 1037 ( 2024 ) Cite this article
359 Accesses
1 Altmetric
Metrics details
CD19-targeted chimeric antigen receptors (CAR) T cells are one of the most remarkable cellular therapies for managing B cell malignancies. However, long-term disease-free survival is still a challenge to overcome. Here, we evaluated the influence of different hinge, transmembrane (TM), and costimulatory CAR domains, as well as manufacturing conditions, cellular product type, doses, patient’s age, and tumor types on the clinical outcomes of patients with B cell cancers treated with CD19 CAR T cells. The primary outcome was defined as the best complete response (BCR), and the secondary outcomes were the best objective response (BOR) and 12-month overall survival (OS). The covariates considered were the type of hinge, TM, and costimulatory domains in the CAR, CAR T cell manufacturing conditions, cell population transduced with the CAR, the number of CAR T cell infusions, amount of CAR T cells injected/Kg, CD19 CAR type (name), tumor type, and age. Fifty-six studies (3493 patients) were included in the systematic review and 46 (3421 patients) in the meta-analysis. The overall BCR rate was 56%, with 60% OS and 75% BOR. Younger patients displayed remarkably higher BCR prevalence without differences in OS. The presence of CD28 in the CAR’s hinge, TM, and costimulatory domains improved all outcomes evaluated. Doses from one to 4.9 million cells/kg resulted in better clinical outcomes. Our data also suggest that regardless of whether patients have had high objective responses, they might have survival benefits from CD19 CAR T therapy. This meta-analysis is a critical hypothesis-generating instrument, capturing effects in the CD19 CAR T cells literature lacking randomized clinical trials and large observational studies.
Peer Review reports
Chimeric antigen receptors (CARs) are artificial cell membrane receptors responsible for immune cell activation. They are constituted by an extracellular binding domain selected against an antigen, usually in the form of a single-chain variable fragment (scFv), a hinge sequence, and a transmembrane domain fused to intracellular costimulatory and stimulatory signaling domains. First-generation CARs had only one CD3ζ chain in the intracellular domain for T cell activation. Second- and third-generation CARs harbor one and two additional intracellular costimulatory domains, respectively, eliciting complete T cell activation. Fourth-generation CARs are based on second or third-generation CARs designed in a vector able to induce the expression of additional transgenic products, constitutively or by induction, such as cytokines or monoclonal antibodies. The CAR expression has been vastly explored in T cells (CAR T cells), and is evolving in other immune cell types, such as NK cells, dendritic cells, and macrophages, ushering in a new era for the treatment of cancer and other diseases [ 1 , 2 ]. In clinical trials, the main domains constituting the hinge part of a CAR are CD28, CD8 alpha, IgG4, or IgG1, while for the transmembrane domain (TM), CD28 or CD8 alpha are the most applied. The costimulatory domains more extensively applied in the clinical setting are CD28 and 4-1BB. CD28 incorporation into the costimulatory domain of CD19 CAR elicits tumor eradication, glycolysis, effector memory maturation, and T cell exhaustion, whereas 4-1BB signaling induces in vivo T cell persistence, mitochondrial biogenesis, and reprogramming towards a central memory T cell phenotype [ 3 ]. Regardless of a few small studies that explored the clinical impact of using different costimulatory domains in the CAR, there is a lack of information about the influence of different hinge or TM domains on the clinical outcomes of patients treated with CAR T cells.
One of the current most effective CAR T cell therapies targets CD19, an antigen expressed by B cells in all stages of development until differentiation in plasmocytes, including B cell malignancies, such as Hodgkin (HL) and non-Hodgkin lymphoma (NHL), acute (ALL) or chronic lymphocytic leukemia (CLL) [ 4 ]. All tumor types treated with this therapy had a high initial complete response (CR) rate, but long-term disease-free survival can still be improved [ 4 ]The therapeutic success of CAR T cells is sometimes discrepant as it is shaped by several factors, boosting the conduction of a comparative analysis to address the global impact of in vivo and ex vivo conditions that influence CD19 CAR T cell performance in clinical trials.
Here, we analyzed the rates of the primary outcome – defined as best complete response (BCR) – and secondary outcomes defined as 12-month overall survival (OS) and best objective response (BOR) of CD19-positive leukemia or lymphoma patients treated with CD19 CAR T cells containing different hinge, transmembrane (TMD), and costimulatory domains. We have also analyzed the impact of different parameters related to CAR T cell manufacturing conditions, such as the type of interleukin used for CAR T cell expansion, CAR T cells activation method, and cell population transduced with the CAR. We have also evaluated the number of CAR T cell infusions, amount of CAR T cells injected/Kg, CD19 CAR type (name), tumor type, and age. This meta-analysis will be helpful as a hypothesis-generating instrument as it tries to capture effects in the literature that is still recent, lacking randomized clinical trials and large observational studies.
We accomplished a systematic review and meta-analysis according to the PRISMA statement [ 5 , 6 ], registered on PROSPERO (CRD42022360268). The main study question is the rate of BCR in patients undergoing treatment for B cell malignancies according to the CD19 CAR T cells hinge, transmembrane, and costimulatory domains. The MEDLINE/PubMed database was searched from the inception until August 2021, using the following keywords: “receptors, chimeric antigen“[MeSH Terms] OR (“receptors“[All Fields] AND “chimeric“[All Fields] AND “antigen“[All Fields]) OR “chimeric antigen receptors“[All Fields] OR (“chimeric“[All Fields] AND “antigen“[All Fields] AND “receptor“[All Fields]) OR “chimeric antigen receptor“[All Fields]) AND “CD19“[All Fields].
The inclusion criteria were patients with CD19-positive leukemia or lymphoma treated with second or third-generation CD19 CAR T cells. Only studies with original data and in English were included. Grey literature and reference lists from included studies were also considered.
The exclusion criteria were studies with (a) no primary outcome reported, (b) dual CAR, (c) other CAR cells types, such as CAR macrophages, (d) combinations with CAR T cells targeting other molecules or with other targeted or non-targeted therapies, such as hematopoietic stem cell transplant, (e) patients with multiple myeloma and other non-hematological tumors, (f) case series, (g) studies such as meta-analyses, reviews, case reports, protocols, books, letters to the editor, comments or specialists’ opinions, abstracts, and (h) pre-clinical studies. Studies ≤ 10 patients were included in the evidence summary but were excluded from the meta-analysis due to statistical constraints.
Data extracted comprised the rate of successful outcomes versus the sample included in the study, and BCR was defined as the primary outcome. The secondary outcomes were OS and BOR. For the meta-analysis, categorical covariates were the types of hinge, TM, and costimulatory (costimulation) domains in the CAR, CAR T cell manufacturing conditions, such as the interleukin used for CAR T cell expansion, CAR T cells activation method, and cell population transduced with the CAR – PBMCs or other specific subsets – (CAR T cell type), as well as the CD19 CAR type (CAR name), and tumor type. Numerical covariates were patient age, number of CAR T cells injected/Kg, and the number of CAR T cells infusions.
Two independent investigators (ERS and NSPC) screened titles and abstracts with ties resolved by a third person (VAP). Three authors (NSPC, VAP, GCPS) independently performed the full-text review and extracted the data, and ERS resolved disagreements.
The data was presented in a summary of evidence and synthesized as forest plots, with studies ordered by publication year. All methodological details of the meta-analysis were included in the Supplementary Methods.
Risk of bias assessment adopted the Modified Institute of Health Economics Tool for bias analysis [ 7 ] and was performed independently by three authors (NSPC, VAP, GCPS).
Statistical analysis was performed with RStudio version 1.1.383 (The R Foundation for Statistical Computing, Vienna, Austria), using meta and metafor packages [ 8 , 9 ].
Fifty-six studies were included in the systematic review with a total of 3493 patients, 2904 treated with CAR T, and 2809 patients analyzed for rate estimation of BCR. Of these patients, 1440 presented a CR, and 1587 had an objective response (OR). We have also evaluated 12 months-OS, having 42 studies with a total of 2992 patients included, 2479 patients treated with CAR T, and 2393 patients analyzed, of whom 1567 were alive at 12 months.
A total of 46 studies with more than or equal to 10 patients were included in the meta-analysis involving 3421 patients, of whom 2837 were treated with CAR T and 2746 patients analyzed for rate estimation of the primary outcome BCR, being 1251 patients presenting CR and 1571 presenting OR, one of the secondary outcomes evaluated. For the other secondary outcome assessed, OS, we had 37 studies with 2949 patients, 2439 patients treated with CAR T, and 2356 patients analyzed for OS, of whom 1547 were alive at 12 months. The PRISM flow diagram is present in Fig. 1 , and the summary of evidence in Table 1 .
Flow chart for eligibility assessment according to PRISMA guidelines. Based on model reference(Page et al., 2021a). For more information, visit: http://www.prisma-statement.org/
General clinical responses of cd19 car t therapy.
The general proportion of BCR was 56% (95%CI: 49 – 63%), the I 2 was 81%, and the τ 2 was 0.7911 indicating a large between-study variance (Fig. 2 ). However, it equals or exceeds 50% in 28 of 46 studies (Fig. 2 ). Table 2 summarizes meta-analysis data for primary outcome BCR (also presented in full version with references as Suppl. Table 1 ). The bias assessment is presented in Fig. 3 .
The Forest Plot represents the overall rate of the primary outcome Best Complete Response (BCR) of patients treated with CD19 CAR T therapy based on the studies included in the meta-analysis
Funnel, Baujat, and Radial plots showing the heterogeneity observed for the primary outcome Best Complete Response (BCR) of patients treated with CD19 CAR T therapy based on the studies included in the meta-analysis
The general proportion of OS was 60% (95%CI: 53 – 67%), the I 2 was 87%, and the τ 2 was 0.5642 (Suppl. Figure 1 and Suppl. Table 2 ) indicating a moderate between study variance. The overall rate of BOR with CD19 CAR T therapy was 75% (95% CI: 68 – 82%, I 2 = 78%) with a very high between-study variance (τ2 = 1.2262) and rates equal to or above 50% in 40 of 46 studies (Suppl. Figure 2 and Suppl. Table 3 ). Together, these data indicate substantial heterogeneity. The bias assessment for OS and BOR are also presented in Suppl. Figure 3 . All the other forest plots are presented as Suppl. Figure 4 to 39 .
Patients under 18 years old had a 79% BCR (95%CI: 65-89%, I 2 :64%), 62% OS (95%CI: 41-80%, I 2 :73%) and 84% BOR (95%CI: 75-90%, I 2 :31%) (Suppl Figs. 4 , 5 and 6 , respectively). Patients above 18 years old presented a 51% BCR (95%CI: 43 − 57%, I 2 :82%), 60% OS (95%CI: 52- 67%, I 2 :88%) and 73% BOR (95%CI: 64-81%, I 2 :79%) (Suppl Figs. 4 , 5 and 6 , respectively).
Considering interleukin used for CAR T cell expansion, when IL-2 was used we found 58% BCR (95%CI: 50-66%, I 2 :76%), 56% OS (95%CI: 45-66%, I 2 :86%) and 79% BOR (95%CI: 68-87%, I 2 :70%) (Suppl Figs. 7 , 8 and 9 , respectively). When other interleukins were applied, we had a 54% BCR (95%CI: 43-65%, I 2 :72%), 63% OS (95%CI: 50-75%, I 2 :91%), and 73% BOR (95%CI: 64-80%, I 2 :71) (Suppl Figs. 7 , 8 and 9 , respectively).
The BCR (Suppl Fig. 10 ), OS (Suppl Fig. 11 ), and BOR (Suppl Fig. 12 ) proportions were similar for activation and expansion of CAR T cells with anti-CD3/CD28 beads or anti-CD3 mAb. Considering the cell population transduced with the CAR, we have found similar BCR (Suppl Fig. 13 ) and BOR (Suppl Fig. 14 ) rates when using full PBMCs or CD4/CD8 1:1, CD8 only, or other specific subsets. OS rate was higher when using full PBMCs (61%; 95%CI: 53–73%, I 2 : 86%) compared to 55% for CD4/CD8 1:1, CD8 only, or other specific subsets (55%; 95%CI: 35–73%, I 2 : 86%) (Suppl Fig. 15 ).
Patients treated with doses between 1 and 4.9 million cells/ kg per injection had BCR rates of 63% (95%CI: 55-71%, I 2 :77%), 60% OS (95%CI: 50-69%, I 2 :85%), and 83% BOR (95%CI: 76-88%, I 2 :74%) (Suppl Figs. 16 , 17 and 18 , respectively). The 5 to 99 million cells/kg group had only three studies and was not considered for comparison (71% BCR; 95%CI: 25-95%, I 2 :62%; 58% OS; 95%CI: 21-88%, I 2 :66%, and 83% BOR, 95%CI:29–98%, I 2 : 64) (Suppl Figs. 16 , 17 and 18 , respectively). Doses superior to 100 million cells/kg showed lower BCR (36%; 95%CI: 28–46%, I 2 :38%), OS (56%, 95%CI: 25–83%, I 2 :94%) and BOR rates (64%, 95%CI: 32-87%, I 2 :69%) (Suppl Figs. 16 , 17 and 18 , respectively).
The proportions for a single cell injection were 55% for BCR (95%CI:48-62%, I 2 : 81%), 61% for OS (95%CI:52-69%, I 2 : 88%) and 78% for BOR (95%CI:69-85%, I 2 : 77%) (Suppl Figs. 19 , 20 and 21 , respectively). For two infusions, the number of studies was meager (65% BCR; 95%CI: 0-100, I 2 : 97%; 70% OS, 95%CI:55-82%, I 2 : not applicable) (Suppl Figs. 19 , 20 and 21 , respectively). Studies with three or more infusions showed a 50% BCR rate (50%; 95%CI: 25–74%, I 2 : 74%) and 72% BOR (95%CI: 40–91%, I 2 : 81%) (Suppl Fig. 19 , and 21 , respectively). For OS, the number of studies was also meager (58% OS, 95%CI: 29–83%, I 2 : 66%) (Suppl Fig. 20 ).
For Axicabtagene ciloleucel (Axi-cel), we have found a 62% BCR (95%CI: 56–67%, I 2 : 52%), 68% OS (95%CI: 59–77%%, I 2 : 80%) and 86% BOR rates (95%CI: 78–91%, I 2 : 46%) (Suppl Figs. 22 , 23 and 24 , respectively). Tisagenlecleucel (Tisa-cel) showed 53% BCR (95%CI:38–67%, I 2 : 66%), 61% OS (95%CI:42–76%, I 2 : 92%) and 70% BOR rates (95%CI:59–79%, I 2 : 55%) (Suppl Figs. 22 , 23 and 24 , respectively). Other CD19 CAR T products more recently tested had a 60% BCR (95%CI:40–78%, I 2 :82%), 57% OS (95%CI:52–62%, I 2 :40%), and 67% BOR rates (95%CI:44–86%, I 2 :80%) (Suppl Figs. 22 , 23 and 24 , respectively).
When CD28 was used to construct the CAR hinge domain, we had a 60% BCR (95%CI:55–66%, I 2 : 52%), 65% OS (95%CI:55–74%, I 2 : 81%) and 83% BOR rates (95%CI:73–90%, I 2 : 66%) (Suppl Figs. 25 , 26 and 27 , respectively). For CD8, we observed 56% BCR (95%CI:42–70%, I 2 : 75%), 59% OS (95%CI:46–71%, I 2 : 89%), and 71% BOR (95%CI:58–82%, I 2 : 66%) (Suppl Figs. 25 , 26 and 27 , respectively). IgG4 resulted in 50% BCR (95%CI:35–66%, I 2 : 85%), 50% OS (95%CI:32–59%, I 2 : 84%) and 71% BOR (95%CI: 54–83%, I 2 : 79%) (Suppl Figs. 25 , 26 and 27 , respectively).
When the CD28 transmembrane domain was used to build the CAR, we found a 58% BCR (95%CI:48–67%, I 2 : 80%), 61% OS (95%CI:51–70%, I 2 : 85%) and 79% BOR (95%CI:69–86%, I 2 : 80%) (Suppl Figs. 28 , 29 and 30 , respectively). CD8 alpha in the transmembrane resulted in 54% BCR (95%CI:40–68%, I 2 : 73%), 59% OS (95%CI:45–72%, I 2 : 90%) and 70% BOR (95%CI:55–82%, I 2 : 67%) (Suppl Figs. 28 , 29 and 30 , respectively).
The CD28 costimulatory domain in the CAR resulted in 60% BCR (95%CI:54–66%, I 2 : 55%), 66% OS (95%CI:57–74%, I 2 : 79%) and 85% BOR rates (95%CI:78–91%, I 2 : 45%), while for 4-1BB we had 56% BCR (95%CI:44–67%, I 2 : 82%), 56% OS (95%CI:45–66%, I 2 : 89%) and 71% BOR (95%CI:61–79%, I 2 : 76%) (Suppl Figs. 31 , 32 and 33 , respectively).
Patients with ALL achieved 73% BCR (95%CI:60–83%, I 2 : 77%), 57% OS (95%CI:45–68%%, I 2 : 67%), and 80% BOR (95%CI:66–89%, I 2 : 64%) %) (Suppl Figs. 34 , 35 and 36 , respectively), while for NHL, the general BCR was 51% (95%CI:45–57%, I 2 : 75%), 59% OS (95%CI:46–72%, I 2 : 92%) and 71% BOR (95%CI:63–78%, I 2 : 74%) (Suppl Figs. 34 , 35 and 36 , respectively).
The meta-regression showed that the group aged above 18 presented a low but significant amount of heterogeneity explained by this variable (H 2 = 7.5535) and that the moderator is inversely related to BCR, suggesting that the effect size favors the younger patient (estimate= -1.3211; p = 0.005). Also, costimulation based on CD28 and third-generation CD28/4-1BB presents a small amount of heterogeneity explained (H 2 = 9.1079), but both were statistically significant moderators ( p = 0.0391 and p = 0.0493, respectively). For BOR, the attributable heterogeneity for costimulatory domains was H 2 = 7.5535, and CD28 and 4-1BB were significant for this observation ( p = 0.0047 and p = 0.0355). The attributable heterogeneity for the CAR T cell product was small (H 2 = 7.4956); however, there was an inverse effect for Tisa-cel and JCAR014 as moderators ( p = 0.0336 and p = 0.0097). Finally, for OS, the attributable heterogeneity for the CAR T cell product was H 2 = 6.0343, and only the treatment with JCAR014 presented an inverse and statistically significant moderator effect ( p = 0.0215).
A predominant low risk of bias was assessed for the primary and the secondary outcomes, presented in Suppl. Figures 37 , 38 , and 39 , respectively.
The pooled 56%BCR found for all CD19 CAR T therapies evaluated herein, with a 60% OS and 75% BOR, corroborates the results found in most CD19 CAR T clinical trials [ 66 ]. However, among the studies included in this meta-analysis, there are also some outliers, such as one published by Ramos et al. (2016), showing only 13% BCR and 19% BOR ( N = 16 patients, no OS reported), that can be explained by the employment of a first-generation CAR, which usually fails to reach effective antitumor responses [ 67 , 68 ].For comparison, a meta-analysis focused on DLBCL conducted in 2022 by Ying and collaborators showed a similar pooled 63% OS rate and 74% BOR, diverging only by an expressively lower 48% BCR [ 69 ].Additionally, another meta-analysis published in 2021 by Aamir et al., focused on ALL patients, reported an 82% BCR rate. Neither OS nor BOR were reported in this study for comparison [ 70 ]. The difference in pooled BCR from these two studies compared with ours can be explained, at least in part, by the mixed tumor types included in our study, such as ALL, CLL, and other NHL subtypes. When we compared ALL and NHL in our sensitivity analysis for tumor type, the most expressive differences between them were also found for BCR (73 versus 51%), followed by BOR (80 versus 71%) rates, while both tumors resulted in similar OS rates (59 versus 57%). Our data also suggest that regardless of whether patients have had high objective responses or not, they might have survival benefits from CD19 CAR T therapy.
Among the CAR T manufacturing conditions evaluated herein, the cell populations chosen to build the CAR product and the cytokine used for T cell expansion promoted the most relevant differences for the clinical outcomes analyzed, mainly for OS. PBMCs had higher OS but similar BOR and BCR rates compared to CD4/CD8 1:1 clustered with CD8 and other specific subsets for analysis. The clustering of CD4/CD8 1:1, CD8 alone, or others might have influenced the results obtained since there is pre-clinical and clinical evidence that CD4:CD8 1:1 seems to outperform other populations. However, we decided to cluster these groups due to the small number of clinical studies available to evaluate each one of these cell populations separately. ILs different from IL-2 used for CD19 CAR T cell manufacturing showed higher OS rates despite similar BCR and lower BOR, evidencing the necessity of running clinical trials using different cytokines for CAR T cell expansion to evaluate their impacts on clinical responses. The CAR T cell activation and expansion methods were equivalent for all outcomes evaluated.
Considering the covariate age, patients under 18 had notably higher BCR and BOR rates but similar OS compared to older patients. CD19 CAR T cell therapy is known to induce a high clinical response rate in children and young adults, especially with B-ALL, but relapses are still a current issue [ 62 ], explaining, at least in part, the similar OS despite the higher BCR rates found in younger patients.
Regarding the CAR T cell dose effect, higher BCR, BOR, and OS rates were found for patients treated with doses between 1 and 4.9 million cells/kg compared to those with doses greater than 100 million cells/kg. The dose-effect might be biased considering the higher BCR and BOR rates found for younger patients, usually treated with lower CAR T cell doses. Nevertheless, the age bias can be ruled out for the higher OS rates observed for lower CAR T doses since OS was not affected by age. When the number of CAR T infusions was evaluated, we noted that three or more infusions presented lower rates for the evaluated outcomes. This result is critical because higher CAR T doses with repeated infusions are known to enhance toxicity [ 71 , 72 ] despite the evident increased manufacturing cost. These results might affect the design of future comparative CD19 CAR T cells-based clinical trials, which can be focused on testing different dose scales up to 100 million cells/kg, limiting the administration to one or two infusions.
The comparison of different molecules used to build the structural CD19-directed CAR hinge (CD8, CD28, or IgG4), transmembrane (CD8 or CD28), and costimulatory domains (CD28 or 4-1BB) showed that the presence of CD28 in these three domains revealed higher rates for all the clinical outcomes evaluated. It might be possible that the different CAR domains act synergistically since they are part of the same functional full costimulatory molecule in human immune cells. However, we cannot affirm or discard this hypothesis based on our data. Particularly considering OS, the most relevant rate difference was found when CD28 was in the CAR’s hinge and costimulatory regions. For BCR, the rate differences between CD28 and other molecules tested were less relevant. A CAR hinge and transmembrane-based comparison with clinical data has never been performed before in the literature, and our meta-analysis gives us some evidence that must be further investigated in future studies to clarify the possibility of synergism combining different/ equal domains. For the costimulatory domains, data recently published in a meta-analysis focused on patients with diffuse large B-cell lymphoma (DLBCL) treated with CD19 CAR T cells corroborated our findings, showing higher BCR and BOR rates of CD28 (57% BCR and 81% BOR) compared to 4-1BB (42% BCR and 70% BOR). However, they found a non-significant statistical difference between CD28 and 4-1BB considering the 12-month OS rate for DLBCL patients [ 69 ].In the same study, the CD28-based Axi-cel had higher rates for all outcomes evaluated compared with the 4-1BB Tisa-cel CAR T for the treatment of DLBCL patients, with a BCR rate of 57% versus 36%, OS rate of 65% versus 49%, and BOR rate of 82% versus 58%, respectively [ 69 ]A clinical trial comparing CD19 CAR-T containing either CD28 or 4-1BB was performed to treat ten ALL patients, five treated with each type of construction in a dose of 0.62 × 10 6 CAR T cells/kg. This study showed similar responses for both treatments, with the CD28 group resulting in 3 CR, 1 PR, and one no response (NR), and the 4-1BB with 3 CR, 0 PR, and 2 NR. Despite the superior number of NR patients in the 4-1BB group, this group had a unique patient with an ongoing anti-tumor response evaluated five months after treatment [ 73 ].This clinical trial was not conclusive due to the limited number of patients. Still considering the costimulatory domain of the CAR, Cappel and Kochenderfer recently reviewed and compared CAR T cell clinical studies based on different targets and having CD28 or 4-1BB as costimulatory domains, including but not limiting CD19 as a target. This general review showed that the available data from clinical trials do not demonstrate a clear advantage of either CD28-costimulated or 4-1BB-costimulated CARs for treating B cell lymphomas or B-ALL, pointing out that more extensive studies and comparative clinical trials must be performed to allow a conclusion about the performance of the different costimulatory domains against B-cell malignancies [ 74 ].
This study is the pioneer in evaluating the impact of the hinge and TMD CAR domains in addition to costimulatory domains in CD19 CAR T cell’s clinical response for B cell leukemia and lymphoma, which is an essential unanswered question in the field. In summary, several covariates analyzed might have a positive impact on all the evaluated clinical outcomes BCR, OS, and BOR of patients treated with CD19 CAR T cell therapies, such as age inferior to 18 years old, injection of 1 to 4.9 million CAR T cells per kg, with one CAR T cell infusion – without discard a potential efficiency using two doses – and CD28 constituting the hinge, transmembrane, and costimulatory domains of the CAR, as in Axi-cel product, and must be better explored in future comparative clinical trials.
The lack of randomized trials or large observational studies on CAR T cells justifies the implementation of this meta-analysis, which intends to provide insights on the ongoing procedures for further research, raising questions and spotting potential aspects of interest in the current approaches. Due to the unavoidable heterogeneity observed, the results of this meta-analysis are not deemed for clinical decision-making but to improve the understanding of this complex and multifaceted treatment instead. The extrapolation and generalization of the results obtained in this meta-analysis should be made with caution since it may be biased by the different study designs and characteristics considering CAR structures, CAR T cell manufacture conditions, doses, tumor type, autologous cells isolated from each individual heavily pretreated, and other variables.
All data generated or analyzed during this study are included in this published article and its supplementary information files.
Acute lymphocytic leukemia
Best complete response
Best objective response
Chimeric antigen receptors
Chronic lymphocytic leukemia
Complete response
Diffuse large B-cell lymphoma
Hodgkin lymphoma
Non-Hodgkin lymphoma
Overall survival
Objective response
Peripheral blood mononuclear cells
Single-chain variable fragment
Transmembrane domains
Dana H, Chalbatani GM, Jalali SA, Mirzaei HR, Grupp SA, Suarez ER, Raposo C, Webster TJ. CAR-T cells: early successes in blood cancer and challenges in solid tumors. Acta Pharm Sin B. 2021;11(5):1129–47.
Article CAS PubMed Google Scholar
Mantovani A, Allavena P, Marchesi F, Garlanda C. Macrophages as tools and targets in cancer therapy. Nat Rev Drug Discov. 2022;21(11):799–820.
Article CAS PubMed PubMed Central Google Scholar
Kawalekar OU, O’Connor RS, Fraietta JA, Guo L, McGettigan SE, Posey AD Jr., Patel PR, Guedan S, Scholler J, Keith B, et al. Distinct signaling of Coreceptors regulates specific metabolism pathways and impacts Memory Development in CAR T cells. Immunity. 2016;44(2):380–90.
Mohty M, Gautier J, Malard F, Aljurf M, Bazarbachi A, Chabannon C, Kharfan-Dabaja MA, Savani BN, Huang H, Kenderian S, et al. CD19 chimeric antigen receptor-T cells in B-cell leukemia and lymphoma: current status and perspectives. Leukemia. 2019;33(12):2767–78.
Article PubMed Google Scholar
Page MJ, Moher D, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021;372:n160.
Article PubMed PubMed Central Google Scholar
Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, Shamseer L, Tetzlaff JM, Akl EA, Brennan SE, et al. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. J Clin Epidemiol. 2021;134:178–89.
Grigor EJM, Fergusson DA, Haggar F, Kekre N, Atkins H, Shorr R, Holt RA, Hutton B, Ramsay T, Seftel M, et al. Efficacy and safety of chimeric antigen receptor T-cell (CAR-T) therapy in patients with haematological and solid malignancies: protocol for a systematic review and meta-analysis. BMJ Open. 2017;7(12):e019321.
Balduzzi S, Rucker G, Schwarzer G. How to perform a meta-analysis with R: a practical tutorial. Evid Based Ment Health. 2019;22(4):153–60.
Viechtbauer W. Conducting meta-analyses in R with the metafor package. J Stat Softw. 2010;36:1–48.
Article Google Scholar
Brentjens RJ, Riviere I, Park JH, Davila ML, Wang X, Stefanski J, Taylor C, Yeh R, Bartido S, Borquez-Ojeda O, et al. Safety and persistence of adoptively transferred autologous CD19-targeted T cells in patients with relapsed or chemotherapy refractory B-cell leukemias. Blood. 2011;118(18):4817–28.
Kalos M, Levine BL, Porter DL, Katz S, Grupp SA, Bagg A, June CH. T cells with chimeric antigen receptors have potent antitumor effects and can establish memory in patients with advanced leukemia. Sci Transl Med. 2011;3(95):95ra73.
Kochenderfer JN, Dudley ME, Feldman SA, Wilson WH, Spaner DE, Maric I, Stetler-Stevenson M, Phan GQ, Hughes MS, Sherry RM, et al. B-cell depletion and remissions of malignancy along with cytokine-associated toxicity in a clinical trial of anti-CD19 chimeric-antigen-receptor-transduced T cells. Blood. 2012;119(12):2709–20.
Brentjens RJ, Davila ML, Riviere I, Park J, Wang X, Cowell LG, Bartido S, Stefanski J, Taylor C, Olszewska M, et al. CD19-targeted T cells rapidly induce molecular remissions in adults with chemotherapy-refractory acute lymphoblastic leukemia. Sci Transl Med. 2013;5(177):177ra138.
Kochenderfer JN, Dudley ME, Kassim SH, Somerville RP, Carpenter RO, Stetler-Stevenson M, Yang JC, Phan GQ, Hughes MS, Sherry RM, et al. Chemotherapy-refractory diffuse large B-cell lymphoma and indolent B-cell malignancies can be effectively treated with autologous T cells expressing an anti-CD19 chimeric antigen receptor. J Clin Oncol. 2015;33(6):540–9.
Lee DW, Kochenderfer JN, Stetler-Stevenson M, Cui YK, Delbrook C, Feldman SA, Fry TJ, Orentas R, Sabatino M, Shah NN, et al. T cells expressing CD19 chimeric antigen receptors for acute lymphoblastic leukaemia in children and young adults: a phase 1 dose-escalation trial. Lancet. 2015;385(9967):517–28.
Bhoj VG, Arhontoulis D, Wertheim G, Capobianchi J, Callahan CA, Ellebrecht CT, Obstfeld AE, Lacey SF, Melenhorst JJ, Nazimuddin F, et al. Persistence of long-lived plasma cells and humoral immunity in individuals responding to CD19-directed CAR T-cell therapy. Blood. 2016;128(3):360–70.
Kebriaei P, Singh H, Huls MH, Figliola MJ, Bassett R, Olivares S, Jena B, Dawson MJ, Kumaresan PR, Su S, et al. Phase I trials using sleeping beauty to generate CD19-specific CAR T cells. J Clin Invest. 2016;126(9):3363–76.
Ramos CA, Savoldo B, Torrano V, Ballard B, Zhang H, Dakhova O, Liu E, Carrum G, Kamble RT, Gee AP, et al. Clinical responses with T lymphocytes targeting malignancy-associated κ light chains. J Clin Invest. 2016;126(7):2588–96.
Turtle CJ, Hanafi LA, Berger C, Hudecek M, Pender B, Robinson E, Hawkins R, Chaney C, Cherian S, Chen X, et al. Immunotherapy of non-hodgkin’s lymphoma with a defined ratio of CD8 + and CD4 + CD19-specific chimeric antigen receptor-modified T cells. Sci Transl Med. 2016;8(355):355ra116.
Gardner RA, Finney O, Annesley C, Brakke H, Summers C, Leger K, Bleakley M, Brown C, Mgebroff S, Kelly-Spratt KS, et al. Intent-to-treat leukemia remission by CD19 CAR T cells of defined formulation and dose in children and young adults. Blood. 2017;129(25):3322–31.
Hu Y, Wu Z, Luo Y, Shi J, Yu J, Pu C, Liang Z, Wei G, Cui Q, Sun J, et al. Potent anti-leukemia activities of Chimeric Antigen Receptor-Modified T Cells against CD19 in Chinese patients with Relapsed/Refractory Acute lymphocytic leukemia. Clin Cancer Res. 2017;23(13):3297–306.
Locke FL, Neelapu SS, Bartlett NL, Siddiqi T, Chavez JC, Hosing CM, Ghobadi A, Budde LE, Bot A, Rossi JM. Phase 1 results of ZUMA-1: a multicenter study of KTE-C19 anti-CD19 CAR T cell therapy in refractory aggressive lymphoma. Mol Ther. 2017;25(1):285–95.
Neelapu SS, Locke FL, Bartlett NL, Lekakis LJ, Miklos DB, Jacobson CA, Braunschweig I, Oluwole OO, Siddiqi T, Lin Y, et al. Axicabtagene Ciloleucel CAR T-Cell Therapy in Refractory large B-Cell lymphoma. N Engl J Med. 2017;377(26):2531–44.
Schuster SJ, Svoboda J, Chong EA, Nasta SD, Mato AR, Anak Ö, Brogdon JL, Pruteanu-Malinici I, Bhoj V, Landsburg D, et al. Chimeric Antigen Receptor T Cells in Refractory B-Cell Lymphomas. N Engl J Med. 2017;377(26):2545–54.
Turtle CJ, Hay KA, Hanafi LA, Li D, Cherian S, Chen X, Wood B, Lozanski A, Byrd JC, Heimfeld S, et al. Durable molecular remissions in chronic lymphocytic leukemia treated with CD19-Specific chimeric Antigen receptor-modified T cells after failure of Ibrutinib. J Clin Oncol. 2017;35(26):3010–20.
Enblad G, Karlsson H, Gammelgård G, Wenthe J, Lövgren T, Amini RM, Wikstrom KI, Essand M, Savoldo B, Hallböök H, et al. A phase I/IIa trial using CD19-Targeted third-generation CAR T cells for Lymphoma and Leukemia. Clin Cancer Res. 2018;24(24):6185–94.
Geyer MB, Riviere I, Senechal B, Wang X, Wang Y, Purdon TJ, Hsu M, Devlin SM, Halton E, Lamanna N, et al. Autologous CD19-Targeted CAR T cells in patients with residual CLL following initial Purine Analog-based therapy. Mol Ther. 2018;26(8):1896–905.
Jacoby E, Bielorai B, Avigdor A, Itzhaki O, Hutt D, Nussboim V, Meir A, Kubi A, Levy M, Zikich D, et al. Locally produced CD19 CAR T cells leading to clinical remissions in medullary and extramedullary relapsed acute lymphoblastic leukemia. Am J Hematol. 2018;93(12):1485–92.
Maude SL, Laetsch TW, Buechner J, Rives S, Boyer M, Bittencourt H, Bader P, Verneris MR, Stefanski HE, Myers GD, et al. Tisagenlecleucel in Children and Young adults with B-Cell lymphoblastic leukemia. N Engl J Med. 2018;378(5):439–48.
Park JH, Rivière I, Gonen M, Wang X, Sénéchal B, Curran KJ, Sauter C, Wang Y, Santomasso B, Mead E, et al. Long-term follow-up of CD19 CAR therapy in Acute Lymphoblastic Leukemia. N Engl J Med. 2018;378(5):449–59.
Rossi J, Paczkowski P, Shen YW, Morse K, Flynn B, Kaiser A, Ng C, Gallatin K, Cain T, Fan R, et al. Preinfusion polyfunctional anti-CD19 chimeric antigen receptor T cells are associated with clinical outcomes in NHL. Blood. 2018;132(8):804–14.
Svoboda J, Rheingold SR, Gill SI, Grupp SA, Lacey SF, Kulikovskaya I, Suhoski MM, Melenhorst JJ, Loudon B, Mato AR, et al. Nonviral RNA chimeric antigen receptor-modified T cells in patients with Hodgkin lymphoma. Blood. 2018;132(10):1022–6.
Weng J, Lai P, Qin L, Lai Y, Jiang Z, Luo C, Huang X, Wu S, Shao D, Deng C, et al. A novel generation 1928zT2 CAR T cells induce remission in extramedullary relapse of acute lymphoblastic leukemia. J Hematol Oncol. 2018;11(1):25.
Curran KJ, Margossian SP, Kernan NA, Silverman LB, Williams DA, Shukla N, Kobos R, Forlenza CJ, Steinherz P, Prockop S, et al. Toxicity and response after CD19-specific CAR T-cell therapy in pediatric/young adult relapsed/refractory B-ALL. Blood. 2019;134(26):2361–8.
Ghorashian S, Kramer AM, Onuoha S, Wright G, Bartram J, Richardson R, Albon SJ, Casanovas-Company J, Castro F, Popova B, et al. Enhanced CAR T cell expansion and prolonged persistence in pediatric patients with ALL treated with a low-affinity CD19 CAR. Nat Med. 2019;25(9):1408–14.
Hay KA, Gauthier J, Hirayama AV, Voutsinas JM, Wu Q, Li D, Gooley TA, Cherian S, Chen X, Pender BS, et al. Factors associated with durable EFS in adult B-cell ALL patients achieving MRD-negative CR after CD19 CAR T-cell therapy. Blood. 2019;133(15):1652–63.
Hirayama AV, Gauthier J, Hay KA, Voutsinas JM, Wu Q, Gooley T, Li D, Cherian S, Chen X, Pender BS, et al. The response to lymphodepletion impacts PFS in patients with aggressive non-hodgkin lymphoma treated with CD19 CAR T cells. Blood. 2019;133(17):1876–87.
Hirayama AV, Gauthier J, Hay KA, Voutsinas JM, Wu Q, Pender BS, Hawkins RM, Vakil A, Steinmetz RN, Riddell SR, et al. High rate of durable complete remission in follicular lymphoma after CD19 CAR-T cell immunotherapy. Blood. 2019;134(7):636–40.
Locke FL, Ghobadi A, Jacobson CA, Miklos DB, Lekakis LJ, Oluwole OO, Lin Y, Braunschweig I, Hill BT, Timmerman JM, et al. Long-term safety and activity of axicabtagene ciloleucel in refractory large B-cell lymphoma (ZUMA-1): a single-arm, multicentre, phase 1–2 trial. Lancet Oncol. 2019;20(1):31–42.
Ma F, Ho JY, Du H, Xuan F, Wu X, Wang Q, Wang L, Liu Y, Ba M, Wang Y, et al. Evidence of long-lasting anti-CD19 activity of engrafted CD19 chimeric antigen receptor-modified T cells in a phase I study targeting pediatrics with acute lymphoblastic leukemia. Hematol Oncol. 2019;37(5):601–8.
Schuster SJ, Bishop MR, Tam CS, Waller EK, Borchmann P, McGuirk JP, Jäger U, Jaglowski S, Andreadis C, Westin JR, et al. Tisagenlecleucel in adult relapsed or refractory diffuse large B-Cell lymphoma. N Engl J Med. 2019;380(1):45–56.
Ying Z, Huang XF, Xiang X, Liu Y, Kang X, Song Y, Guo X, Liu H, Ding N, Zhang T. A safe and potent anti-CD19 CAR T cell therapy. Nat Med. 2019;25(6):947–53.
Zhang Q, Hu H, Chen SY, Liu CJ, Hu FF, Yu J, Wu Y, Guo AY. Transcriptome and Regulatory Network Analyses of CD19-CAR-T immunotherapy for B-ALL. Genomics Proteom Bioinf. 2019;17(2):190–200.
Article CAS Google Scholar
Abramson JS, Palomba ML, Gordon LI, Lunning MA, Wang M, Arnason J, Mehta A, Purev E, Maloney DG, Andreadis C, et al. Lisocabtagene maraleucel for patients with relapsed or refractory large B-cell lymphomas (TRANSCEND NHL 001): a multicentre seamless design study. Lancet. 2020;396(10254):839–52.
An F, Wang H, Liu Z, Wu F, Zhang J, Tao Q, Li Y, Shen Y, Ruan Y, Zhang Q, et al. Influence of patient characteristics on chimeric antigen receptor T cell therapy in B-cell acute lymphoblastic leukemia. Nat Commun. 2020;11(1):5928.
Benjamin R, Graham C, Yallop D, Jozwik A, Mirci-Danicar OC, Lucchini G, Pinner D, Jain N, Kantarjian H, Boissel N, et al. Genome-edited, donor-derived allogeneic anti-CD19 chimeric antigen receptor T cells in paediatric and adult B-cell acute lymphoblastic leukaemia: results of two phase 1 studies. Lancet. 2020;396(10266):1885–94.
Cappell KM, Sherry RM, Yang JC, Goff SL, Vanasse DA, McIntyre L, Rosenberg SA, Kochenderfer JN. Long-term Follow-Up of Anti-CD19 chimeric Antigen receptor T-Cell therapy. J Clin Oncol. 2020;38(32):3805–15.
Chen YH, Zhang X, Cheng YF, Chen H, Mo XD, Yan CH, Chen Y, Han W, Sun YQ, Wang Y, et al. Long-term follow-up of CD19 chimeric antigen receptor T-cell therapy for relapsed/refractory acute lymphoblastic leukemia after allogeneic hematopoietic stem cell transplantation. Cytotherapy. 2020;22(12):755–61.
Frey NV, Gill S, Hexner EO, Schuster S, Nasta S, Loren A, Svoboda J, Stadtmauer E, Landsburg DJ, Mato A, et al. Long-term outcomes from a randomized dose optimization study of Chimeric Antigen Receptor Modified T Cells in relapsed chronic lymphocytic leukemia. J Clin Oncol. 2020;38(25):2862–71.
Gu R, Liu F, Zou D, Xu Y, Lu Y, Liu B, Liu W, Chen X, Liu K, Guo Y, et al. Efficacy and safety of CD19 CAR T constructed with a new anti-CD19 chimeric antigen receptor in relapsed or refractory acute lymphoblastic leukemia. J Hematol Oncol. 2020;13(1):122.
Jacobson CA, Hunter BD, Redd R, Rodig SJ, Chen PH, Wright K, Lipschitz M, Ritz J, Kamihara Y, Armand P, et al. Axicabtagene Ciloleucel in the Non-trial setting: outcomes and correlates of response, resistance, and toxicity. J Clin Oncol. 2020;38(27):3095–106.
Liu E, Marin D, Banerjee P, Macapinlac HA, Thompson P, Basar R, Nassif Kerbauy L, Overman B, Thall P, Kaplan M, et al. Use of CAR-Transduced Natural Killer cells in CD19-Positive lymphoid tumors. N Engl J Med. 2020;382(6):545–53.
Nastoupil LJ, Jain MD, Feng L, Spiegel JY, Ghobadi A, Lin Y, Dahiya S, Lunning M, Lekakis L, Reagan P, et al. Standard-of-care Axicabtagene Ciloleucel for relapsed or refractory large B-Cell lymphoma: results from the US Lymphoma CAR T Consortium. J Clin Oncol. 2020;38(27):3119–28.
Pasquini MC, Hu ZH, Curran K, Laetsch T, Locke F, Rouce R, Pulsipher MA, Phillips CL, Keating A, Frigault MJ, et al. Real-world evidence of tisagenlecleucel for pediatric acute lymphoblastic leukemia and non-hodgkin lymphoma. Blood Adv. 2020;4(21):5414–24.
Sesques P, Ferrant E, Safar V, Wallet F, Tordo J, Dhomps A, Karlin L, Brisou G, Vercasson M, Hospital-Gustem C, et al. Commercial anti-CD19 CAR T cell therapy for patients with relapsed/refractory aggressive B cell lymphoma in a European center. Am J Hematol. 2020;95(11):1324–33.
Wang W, Jiang J, Wu C. CAR-NK for tumor immunotherapy: clinical transformation and future prospects. Cancer Lett. 2020;472:175–80.
Zhou X, Tu S, Wang C, Huang R, Deng L, Song C, Yue C, He Y, Yang J, Liang Z, et al. Phase I Trial of Fourth-Generation Anti-CD19 Chimeric Antigen Receptor T Cells against Relapsed or refractory B cell Non-hodgkin Lymphomas. Front Immunol. 2020;11:564099.
Baird JH, Epstein DJ, Tamaresis JS, Ehlinger Z, Spiegel JY, Craig J, Claire GK, Frank MJ, Muffly L, Shiraz P, et al. Immune reconstitution and infectious complications following axicabtagene ciloleucel therapy for large B-cell lymphoma. Blood Adv. 2021;5(1):143–55.
Gauthier J, Bezerra ED, Hirayama AV, Fiorenza S, Sheih A, Chou CK, Kimble EL, Pender BS, Hawkins RM, Vakil A, et al. Factors associated with outcomes after a second CD19-targeted CAR T-cell infusion for refractory B-cell malignancies. Blood. 2021;137(3):323–35.
Iacoboni G, Villacampa G, Martinez-Cibrian N, Bailén R, Lopez Corral L, Sanchez JM, Guerreiro M, Caballero AC, Mussetti A, Sancho JM, et al. Real-world evidence of tisagenlecleucel for the treatment of relapsed or refractory large B-cell lymphoma. Cancer Med. 2021;10(10):3214–23.
Mian A, Wei W, Winter AM, Khouri J, Jagadeesh D, Anwer F, Gerds AT, Dean RM, Sobecks R, Pohlman B, et al. Outcomes and factors impacting use of axicabtagene ciloleucel in patients with relapsed or refractory large B-cell lymphoma: results from an intention-to-treat analysis. Leuk Lymphoma. 2021;62(6):1344–52.
Shah BD, Bishop MR, Oluwole OO, Logan AC, Baer MR, Donnellan WB, O’Dwyer KM, Holmes H, Arellano ML, Ghobadi A, et al. KTE-X19 anti-CD19 CAR T-cell therapy in adult relapsed/refractory acute lymphoblastic leukemia: ZUMA-3 phase 1 results. Blood. 2021;138(1):11–22.
Tan Y, Pan J, Deng B, Ling Z, Song W, Xu J, Duan J, Wang Z, Yu X, Chang AH, et al. Toxicity and effectiveness of CD19 CAR T therapy in children with high-burden central nervous system refractory B-ALL. Cancer Immunol Immunother. 2021;70(7):1979–93.
Wang S, Wang X, Ye C, Cheng H, Shi M, Chen W, Qi K, Wang G, Wu Q, Zeng L, et al. Humanized CD19-targeted chimeric antigen receptor T (CAR-T) cells for relapsed/refractory pediatric acute lymphoblastic leukemia. Am J Hematol. 2021;96(5):E162–5.
Ying Z, Yang H, Guo Y, Li W, Zou D, Zhou D, Wang Z, Zhang M, Wu J, Liu H, et al. Relmacabtagene autoleucel (relma-cel) CD19 CAR-T therapy for adults with heavily pretreated relapsed/refractory large B-cell lymphoma in China. Cancer Med. 2021;10(3):999–1011.
Ying Z, Huang XF, Xiang X, Liu Y, Kang X, Song Y, Guo X, Liu H, Ding N, Zhang T, et al. A safe and potent anti-CD19 CAR T cell therapy. Nat Med. 2019;25(6):947–53.
Dai H, Wang Y, Lu X, Han W. Chimeric Antigen receptors modified T-Cells for Cancer Therapy. J Natl Cancer Inst 2016, 108(7).
Ramos CA, Savoldo B, Torrano V, Ballard B, Zhang H, Dakhova O, Liu E, Carrum G, Kamble RT, Gee AP. Clinical responses with T lymphocytes targeting malignancy-associated κ light chains. J Clin Investig. 2016;126(7):2588–96.
Ying Z, Song Y, Zhu J. Effectiveness and safety of Anti-CD19 chimeric Antigen Receptor-T cell immunotherapy in patients with Relapsed/Refractory large B-Cell lymphoma: a systematic review and Meta-analysis. Front Pharmacol. 2022;13:834113.
Aamir S, Anwar MY, Khalid F, Khan SI, Ali MA, Khattak ZE. Systematic review and Meta-analysis of CD19-Specific CAR-T cell therapy in Relapsed/Refractory Acute Lymphoblastic Leukemia in the Pediatric and Young Adult Population: Safety and Efficacy outcomes. Clin Lymphoma Myeloma Leuk. 2021;21(4):e334–47.
Brudno JN, Kochenderfer JN. Toxicities of chimeric antigen receptor T cells: recognition and management. Blood. 2016;127(26):3321–30.
Campos NSP, Souza BS, Silva G, Porto VA, Chalbatani GM, Lagreca G, Janji B, Suarez ER. Carbonic anhydrase IX: a renewed target for Cancer Immunotherapy. Cancers (Basel) 2022, 14(6).
Li S, Zhang J, Wang M, Fu G, Li Y, Pei L, Xiong Z, Qin D, Zhang R, Tian X, et al. Treatment of acute lymphoblastic leukaemia with the second generation of CD19 CAR-T containing either CD28 or 4-1BB. Br J Haematol. 2018;181(3):360–71.
Cappell KM, Kochenderfer JN. A comparison of chimeric antigen receptors containing CD28 versus 4-1BB costimulatory domains. Nat Rev Clin Oncol. 2021;18(11):715–27.
Download references
The authors sincerely thank all authors of all studies included in this meta-analysis.
N.S.P.C had a Brazilian National Council for Scientific and Technological Development (CNPq) scholarship (143179/2021-7; 140514/2022-8). V.A.P was supported by a Federal University of ABC institutional scholarship (PIC- UFABC). E.R.S had a grant from Sao Paulo Research Foundation/Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), Brazil, number 2018/17656-0 and 2023/03631-3.
Authors and affiliations.
Centro Universitário FMABC, Santo André, 09060-870, SP, Brazil
Erik Montagna
Center for Natural and Human Sciences, Federal University of ABC, Santo Andre, 09210-580, SP, Brazil
Najla Santos Pacheco de Campos, Victoria Alves Porto, Giselle Correia Próspero da Silva & Eloah Rabello Suarez
Graduate Program in Medicine - Hematology and Oncology, Federal University of São Paulo, São Paulo, 04023-062, SP, Brazil
Najla Santos Pacheco de Campos & Eloah Rabello Suarez
You can also search for this author in PubMed Google Scholar
E.M. performed the statistical analysis, manuscript writing and revision; N.S.P.C performed a literature search, provided data extraction, data clarifications and revised the manuscript; V.A.P and G.C.P.S provided data extraction and revised the manuscript; E.R.S. conceived the study, performed literature search, article selection, and manuscript writing and revision.
Correspondence to Eloah Rabello Suarez .
Ethics approval and consent to participate.
Not applicable.
Competing interests.
The authors declare no competing interests.
Publisher’s note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Below is the link to the electronic supplementary material.
Rights and permissions.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Reprints and permissions
Cite this article.
Montagna, E., de Campos, N.S.P., Porto, V.A. et al. CD19 CAR T cells for B cell malignancies: a systematic review and meta-analysis focused on clinical impacts of CAR structural domains, manufacturing conditions, cellular product, doses, patient’s age, and tumor types. BMC Cancer 24 , 1037 (2024). https://doi.org/10.1186/s12885-024-12651-6
Download citation
Received : 25 May 2023
Accepted : 16 July 2024
Published : 22 August 2024
DOI : https://doi.org/10.1186/s12885-024-12651-6
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
ISSN: 1471-2407
Published on 28.8.2024 in Vol 26 (2024)
Authors of this article:
There are no citations yet available for this article according to Crossref .
Suggestions or feedback?
Press contact :.
Previous image Next image
Legal documents are notoriously difficult to understand, even for lawyers. This raises the question: Why are these documents written in a style that makes them so impenetrable?
MIT cognitive scientists believe they have uncovered the answer to that question. Just as “magic spells” use special rhymes and archaic terms to signal their power, the convoluted language of legalese acts to convey a sense of authority, they conclude.
In a study appearing this week in the journal of the Proceedings of the National Academy of Sciences , the researchers found that even non-lawyers use this type of language when asked to write laws.
“People seem to understand that there’s an implicit rule that this is how laws should sound, and they write them that way,” says Edward Gibson, an MIT professor of brain and cognitive sciences and the senior author of the study.
Eric Martinez PhD ’24 is the lead author of the study. Francis Mollica, a lecturer at the University of Melbourne, is also an author of the paper .
Casting a legal spell
Gibson’s research group has been studying the unique characteristics of legalese since 2020, when Martinez came to MIT after earning a law degree from Harvard Law School. In a 2022 study , Gibson, Martinez, and Mollica analyzed legal contracts totaling about 3.5 million words, comparing them with other types of writing, including movie scripts, newspaper articles, and academic papers.
That analysis revealed that legal documents frequently have long definitions inserted in the middle of sentences — a feature known as “center-embedding.” Linguists have previously found that this kind of structure can make text much more difficult to understand.
“Legalese somehow has developed this tendency to put structures inside other structures, in a way which is not typical of human languages,” Gibson says.
In a follow-up study published in 2023, the researchers found that legalese also makes documents more difficult for lawyers to understand. Lawyers tended to prefer plain English versions of documents, and they rated those versions to be just as enforceable as traditional legal documents.
“Lawyers also find legalese to be unwieldy and complicated,” Gibson says. “Lawyers don’t like it, laypeople don’t like it, so the point of this current paper was to try and figure out why they write documents this way.”
The researchers had a couple of hypotheses for why legalese is so prevalent. One was the “copy and edit hypothesis,” which suggests that legal documents begin with a simple premise, and then additional information and definitions are inserted into already existing sentences, creating complex center-embedded clauses.
“We thought it was plausible that what happens is you start with an initial draft that’s simple, and then later you think of all these other conditions that you want to include. And the idea is that once you’ve started, it’s much easier to center-embed that into the existing provision,” says Martinez, who is now a fellow and instructor at the University of Chicago Law School.
However, the findings ended up pointing toward a different hypothesis, the so-called “magic spell hypothesis.” Just as magic spells are written with a distinctive style that sets them apart from everyday language, the convoluted style of legal language appears to signal a special kind of authority, the researchers say.
“In English culture, if you want to write something that’s a magic spell, people know that the way to do that is you put a lot of old-fashioned rhymes in there. We think maybe center-embedding is signaling legalese in the same way,” Gibson says.
In this study, the researchers asked about 200 non-lawyers (native speakers of English living in the United States, who were recruited through a crowdsourcing site called Prolific), to write two types of texts. In the first task, people were told to write laws prohibiting crimes such as drunk driving, burglary, arson, and drug trafficking. In the second task, they were asked to write stories about those crimes.
To test the copy and edit hypothesis, half of the participants were asked to add additional information after they wrote their initial law or story. The researchers found that all of the subjects wrote laws with center-embedded clauses, regardless of whether they wrote the law all at once or were told to write a draft and then add to it later. And, when they wrote stories related to those laws, they wrote in much plainer English, regardless of whether they had to add information later.
“When writing laws, they did a lot of center-embedding regardless of whether or not they had to edit it or write it from scratch. And in that narrative text, they did not use center-embedding in either case,” Martinez says.
In another set of experiments, about 80 participants were asked to write laws, as well as descriptions that would explain those laws to visitors from another country. In these experiments, participants again used center-embedding for their laws, but not for the descriptions of those laws.
The origins of legalese
Gibson’s lab is now investigating the origins of center-embedding in legal documents. Early American laws were based on British law, so the researchers plan to analyze British laws to see if they feature the same kind of grammatical construction. And going back much farther, they plan to analyze whether center-embedding is found in the Hammurabi Code, the earliest known set of laws, which dates to around 1750 BC.
“There may be just a stylistic way of writing from back then, and if it was seen as successful, people would use that style in other languages,” Gibson says. “I would guess that it’s an accidental property of how the laws were written the first time, but we don’t know that yet.”
The researchers hope that their work, which has identified specific aspects of legal language that make it more difficult to understand, will motivate lawmakers to try to make laws more comprehensible. Efforts to write legal documents in plainer language date to at least the 1970s, when President Richard Nixon declared that federal regulations should be written in “layman’s terms.” However, legal language has changed very little since that time.
“We have learned only very recently what it is that makes legal language so complicated, and therefore I am optimistic about being able to change it,” Gibson says.
Press mentions, fast company.
Researchers at MIT have uncovered a possible reason why legal documents can be so difficult to read, finding that “convoluted legalese often acts as a way to convey authority,” reports Joe Berkowitz for Fast Company . The researchers “tested whether nonlawyers would end up using legalese if asked to write legal documents,” explains Berkowitz. “In the end, all subjects wrote their laws with complex, center-embedded clauses.”
Researchers at MIT have found that the use of legalese in writing “to assert authority over those less versed in such language,” reports Noor Al-Sibai for Futurism . “By studying this cryptic take on the English language, the researchers are hoping to make legal documents much easier to read in the future,” explains Al-Sibai.
Previous item Next item
More mit news.
Read full story →
Massachusetts Institute of Technology 77 Massachusetts Avenue, Cambridge, MA, USA
Executive summary, cash-back transactions, benefits and costs to merchants.
Access to cash is a necessary component of a resilient financial system and dynamic economy. Many people rely on cash for day-to-day transactions due its privacy and reliability, and cash accessibility is particularly critical in the case of a disruption or outage of digital payment systems. While people use various means of getting cash, one common method is to get “cash back” at a store when making a purchase with a debit or prepaid card. This option may be particularly important in banking deserts and in areas where banks and ATM operators charge significant fees. Retailers are essentially filling a void in access to cash, which has historically been supplied by banks and credit unions in an affordable way.
Providing cash back is valuable to consumers and merchants. Survey data show that it is a popular method to get money via consumers’ bank debit or prepaid cards. Merchants offer cash back to attract customers and reduce their cash handling costs. In its recent engagement and market monitoring, the CFPB observed that some retailers charge a fee for this transaction.
This spotlight provides an overview of consumers’ use of cash back, the benefits and costs of such transactions to merchants, and the practices of other market actors which do not charge fees for this service. The CFPB also analyzed the cash-back fees of a sample of national retailers.
Fees for cash back may serve as a barrier and reduce people’s access to cash when they need it. The CFPB will continue to monitor developments related to the fees consumers pay for accessing cash, and the underlying failure of banks and credit unions to adequately supply cash throughout the country in an affordable manner.
This section summarizes the importance of cash availability and the use of cash-back as an access point for consumers.
Cash is a critical part of a resilient payment ecosystem. Surveys show people still try to have cash on hand 1 and nearly 90 percent of people used cash in the last 30 days. 2 Cash accessibility is necessary should other types of digital payment systems experience failures, 3 such as in the event of a natural disaster or some other catastrophe, 4 or a technological malfunction at a single company. 5 Additionally, some populations are more reliant on cash than others for day-to-day transactions. For example, cash is more frequently used by people with lower incomes, racial minorities, and older Americans than other populations. 6 As discussed below, cash back is a common method for obtaining cash for many consumers.
Consumers may obtain cash during the completion of a purchase transaction at certain stores when using a PIN-authenticated debit card or prepaid card at the register. Some merchants also provide cash back at self-service registers. Consumers typically must choose from pre-set withdrawal amount options presented at the payment terminal at the time of the transaction. In a cash-back transaction, consumers are usually limited to a maximum withdrawal amount ranging from $5 to $50, though some merchants may allow higher amounts.
CFPB analysis of data from the Diary and Survey of Consumer Payment Choice (Survey) found that from 2017 to 2022, cash withdrawals at retail locations made up 17 percent of all transactions by which people got cash from their checking account, savings account, or prepaid card. As shown in Figure 1, cash withdrawals at retail are second only to ATMs (61%) and more frequently used than bank tellers (14%). The Survey and methodology are discussed in the Tables and Notes section .
Source : CFPB tabulations of the Diary and Survey of Consumer Payment Choice.
The Survey data also show that from 2017 to 2022, cash withdrawals at a retail location (restricted to those where the source of funds was the consumer’s checking, savings, or a prepaid card) had a mean withdrawal amount of $34 (median: $20). 7 By contrast, during this same timeframe, the mean ATM withdrawal among survey participants was $126 (median: $100). 8 A study by researchers at the Federal Reserve Bank of Atlanta utilizing Survey data found that cash withdrawals at a retail store had the lowest average amount of cash withdrawal, and noted that “[t]he amount of cash received at a retail store is constrained by the store’s limits, so the amount of cash received in this way is not necessarily at the discretion of the consumer.” 9
Cash back may serve as a particularly important point of access in the absence of other banking services. A 2014 study by the Federal Reserve Bank of Richmond analyzed cash-back transactions from a national discount retail chain from 2010 to 2012. 10 Looking specifically at the Richmond bank’s district, the area with the highest frequency of cash-back transactions was in the southeastern region of South Carolina, an area “that has been subject to ‘persistent poverty’” and “has some of the sparsest dispersion of bank branches.” 11 The study also illustrated the lucrative nature of cash-back fees: During the course of this study period, the merchant introduced a fee for cash back. Data from this report indicates that the retailer collected approximately $21 million in cash-back fees in a year. 12
Merchants benefit from offering cash back at point-of-sale. First, the service may attract potential shoppers, either people making a purchase in order to get cash back or people who prefer one retail location over another in order to conveniently combine tasks. Second, it reduces merchants’ cash handling costs. 13 Dispensing cash to consumers, such as through cash-back transactions, reduces merchants’ supply of cash and therefore also reduces their cost of handling, transporting, and depositing excess cash.
Merchants incur costs for processing any type of payment transaction, including cash-back transactions. On any purchase using an electronic payment method, including a PIN-authorized debit-card or prepaid card, a merchant will incur a range of fees for processing that payment, such as interchange, network, and processing fees. While the merchant incurs these fees for a consumer’s purchase, there is an additional cost for providing cash back to the consumer.
To assess this additional transaction cost to the merchant for providing cash back, the CFPB modeled potential scenarios based on publicly available data and our market monitoring activities. The model incorporates estimates of merchant-incurred fees, such as interchange, network, processing, and fraud control fees. Methodology is discussed in detail in the Table and Figure Notes. The CFPB estimates that the additional marginal transactional cost to a merchant for processing a typical cash-back debit card transaction may range from a penny to about 20 cents (Table 1).
Example Retailer | Purchase Amount | Merchant Transaction Cost for Purchase Only | Additional Merchant Cost for $10 Cash Back | Additional Merchant Cost for $40 Cash Back |
---|---|---|---|---|
National Discount Chain | $20 | $0.33 | $0.05 | $0.19 |
National Grocery Store | $20 | $0.33 | $0.01 | $0.02 |
Source : CFPB calculations based on public data about industry practices and averages. See Table and Figure Notes below for methodology .
This section provides an analysis of cash-back fee practices of eight national retail chains. It includes a discussion of the variation of these practices among these national chains and other actors, such as local independent grocers. The analysis is supplemented by market monitoring discussions with merchants about fees, costs, and consumer trends, both among merchants who charge cash back fees and those who do not. The CFPB also conducted consumer experience interviews and reviewed consumer complaints submitted to the CFPB. It concludes with a discussion of how these fees appear to function differently than fees for cash withdrawals at ATMs.
As of August 2024, there is no publicly available survey data regarding merchants’ cash-back practices or fees. To establish a baseline, the CFPB documented the fee practices of eight large retail companies. The sample consists of the two largest retail actors, measured by number of locations, across four different sectors: Dollar Stores, Grocery Stores, Drugstores, and Discount Retailers. 14 Using this approach, the eight retailers sampled are: Dollar General and Dollar Tree Inc. (Dollar Stores), Kroger Co. and Albertsons Companies (Grocery Stores), Walgreens and CVS (Drugstores), and Walmart and Target (Discount Retailers).
All retailers in our sample offer cash-back services, but only Dollar General, Dollar Tree Inc., and Kroger Co. brands charge a fee. Other retailers offer cash-back for free, even for withdrawal amounts similar to or larger than those provided by the three retailers who charge. (Table 2). Among the national chains that charge these cash-back fees, the CFPB estimates that they collect over $90 million in fees annually for people to access their cash. 15
Company | U.S. Stores | Fee for Cash Back | Maximum Withdrawal Amount (Per Transaction) |
---|---|---|---|
Dollar General | 20,022 | $1 to $2.50, depending on amount and other variables | $40 |
Dollar Tree Inc. | 16,278 | Family Dollar: $1.50 | $50 |
Kroger Co. | 2,722 | Harris Teeter brand: | Harris Teeter brand: $200 |
Albertsons Brand | 2,271 | No | $200 |
Walmart | 5,214 | No | $100 |
Target | 1,956 | No | $40 |
Walgreens | 8,600 | No | $20 |
CVS | 7,500 | No | $60 |
Source : CFPB analysis of the retail cash-back market. See Table and Figure Notes for methodology .
Beyond these national chains, there are other providers offering cash back as a free service to their customers. Through its market monitoring activities, the CFPB observed that many local independent grocers offer the service, but do not charge a fee. They do not charge a fee even though they are likely to have thinner profit margins and less bargaining power than national chains to negotiate on pricing on costs they incur from wholesalers or fees for payment processors. The U.S. Postal Service also offers cash back on debit transactions, in increments of $10 up to a $50 maximum, free of charge. 16
Among the merchants sampled, Dollar General and Dollar Tree Inc. charge the highest fees for withdrawal amounts under $50. These fees combined with the constrained withdrawal amount may mean that the fee takes up a hefty percentage relative to the amount of cash withdrawn, and people may be less able to limit the impact of the fee by taking out more cash.
Additionally, the geographic distribution of dollar store chains and their primary consumer base raises concerns that these fees may be borne by economically vulnerable populations and those with limited banking access. Dollar stores are prevalent in rural communities, low-income communities, and communities of color – the same communities who may also face challenges in accessing banking services. 17 For example, Dollar General noted that in 2023 “approximately 80% of [its] stores are located in towns of 20,000 or fewer people,” 18 while Dollar Tree Inc. operated at least 810 dual-brand combination stores (Family Dollar and Dollar Tree in a single building) designed specifically “for small towns and rural communities…with populations of 3,000 to 4,000 residents.” 19
Though they are open to and serve consumers of all income levels, dollar stores report that they locate stores specifically to serve their core customers: lower-income consumers. 20 In urban communities, one study shows, “proximity to dollar stores is highly associated with neighborhoods of color even when controlling for other factors.” 21 These same communities may also face challenges in accessing banking services. Low-income communities and communities of color often face barriers to access to banking services, and rural communities are 10 times more likely to meet the definition of a banking desert than urban areas. 22
Though the dollar store concept existed as far back as the 1950s, it has experienced significant expansion and consolidation since the 2000s. 23 Dollar Tree Inc. acquired Family Dollar in 2015. 24 From 2018 to 2021, nearly half of all retail locations opened in the U.S. were dollar stores. 25 In research examining the impact of dollar store expansion, studies indicate that the opening of a dollar store is associated with the closure of nearby local grocery retailers. 26
In its scan of current market practices, the CFPB found variations in fee charges among store locations and brands owned by the same company. For example, as reflected in Table 2, Dollar Tree charges consumers $1 for cash back at Dollar Tree branded stores, but $1.50 in its Family Dollar stores. Similarly, Kroger Co. has two different fee tiers for its brands. In 2019, Kroger Co. rolled out a $0.50 cash-back fee for amounts of $100 or less, and $3.50 for amounts between $100 and $300. This took effect at brands such as Kroger, Fred Meyers, Ralph’s, QFC, Pick ‘N Save, and others. At the time of the rollout, the company noted two exceptions: Electronic benefits transfer (EBT) card users would not be charged a fee, and customers using their Kroger Plus card would not be charged for amounts under $100 but would be charged $0.50 for larger amounts. Kroger Co. acquired the southern grocery chain Harris Teeter in 2014, but it did not begin charging a cash-back fee at those stores until January 2024, at $0.75 for amounts of $100 or less, and $3 for larger amounts. 27
In its engagement with stakeholders, the CFPB learned that Dollar General’s fees appeared to vary in different locations. To better understand this potential variation, in December 2022, the CFPB mystery shopped at nine locations in one state, across a mix of rural, suburban, and urban communities. The CFPB acknowledges this is a small sample and is not intended to be representative. The data collected is based on the knowledge of the store associates at the time of each interaction.
In these findings, the CFPB learned of a range of fee variations across store locations: five of the nine respondents noted that the fee varies depending on the type of card used for the transaction. When probed for the meaning of “type of card,” most noted that it is dependent on the customer’s bank, though it is not exactly clear what fees will be triggered by what card type prior to initiating the transaction. Additionally, reported fees range from $1 to $2.50, with some stores reporting a flat fee structure of $1.50 and others reporting a range that tiered up with larger withdrawal amounts (with a cap of withdrawal amounts at $40). Most stores in this sample had a range of fees between $1.00 and $1.50, although two stores located in small, completely rural counties had a higher range of fees. The store located in the smallest and most isolated county within the sample, with only about 3,600 people, had the highest reported fee amount of $2.50.
One of the market dynamics likely contributing to retailers’ ability to charge these fees is the high fees also charged to consumers for using out-of-network automated teller machines (ATMs). One source estimates that the average out-of-network ATM fee is $4.77, accounting for both the surcharge fee charged by the ATM owner and the foreign fee charged by the consumer’s financial institution. 28 By comparison, a $2 fee for cash back at a retailer may appear cheaper, and usually does not trigger an additional fee by the consumers’ financial institution or prepaid card issuer. Notwithstanding the high ATM fees, there are reasons for focused attention on the consumer risk of cash-back fees charged by retailers, primarily the amount of the fee relative to the value of the cash withdrawal and the distribution of the fee burden across income groups.
In a typical ATM transaction, a consumer has a greater ability to distribute the cost of the fee across a larger amount of cash than with cash back. There may be some exceptions to this for consumers who have only $10 or $20 in their bank account, but as shown in Table 3, low-income consumers and others withdraw greater amounts at ATMs than via cash-back, on average. In cash-back transactions, lower withdrawal limits are in place, and consumers do not have that option to withdraw larger amounts. CFPB analysis of the Diary and Survey of Consumer Payment Choice from 2017 to 2022 show that even among consumers with incomes below $50,000, the amount withdrawn at an ATM is more than double the typical cash-back withdrawal amount. Additionally, for the average and median amounts, across all incomes the ATM withdrawal amounts are larger than cash-back withdrawal amounts. (Table 3).
Income | Average ATM Withdrawal | Average Cash-back Withdrawal | Median ATM Withdrawal | Median Cash-back Withdrawal |
---|---|---|---|---|
Less than $25,000 | $144 | $45 | $65 | $20 |
$25,000 to $49,999 | $113 | $35 | $60 | $25 |
$50,000 to $74,999 | $113 | $29 | $84 | $20 |
$75,000 to $99,000 | $114 | $45 | $100 | $26 |
$100,000 or more | $146 | $33 | $100 | $20 |
|
|
|
|
|
Source: CFPB tabulations of the Diary and Survey of Consumer Payment Choice. See Table and Figure Notes for methodology .
Further, while merchants limit the amount of a single withdrawal, there is no limit on the number of withdrawals. So, if a consumer needs $100 cash at a store which limits a single withdrawal to a maximum amount of $50 with a $2 fee, the consumer would have to make two $50 withdrawals for a $4 fee plus the cost of any otherwise unwanted purchase required to access the cash-back service.
Finally, the burden of cash-back fees may be distributed differently than ATM fee burdens. The share of borrowers who pay ATM fees for cash withdrawals is relatively evenly distributed across income levels, according to a study based on the Diary and Survey of Consumer Payment Choice. 29 The study found little variation in the percentage of consumers who encountered a fee for an ATM cash withdrawal by income quintile, though the study did not look at the amount of the ATM fees paid. Analogous data are not available for cash-back fees, but a similarly even distribution across incomes is unlikely given the demographics of the consumer base served by the largest retailers which charge fees (dollar stores).
While the use of digital payment methods is on the rise, cash accessibility remains a critical component of a resilient financial infrastructure and dynamic economy. Bank mergers, branch closures, and bank fee creep have reduced the supply of free cash access points for consumers. In this void, people may be more reliant on retailers for certain financial services historically provided by banks and credit unions, such as cash access. In this context, we observe that some retailers provide cash back as a helpful service to their customers, while other retailers may be exploiting these conditions by charging fees to their consumers for accessing their cash.
This spotlight examines the presence of retailer cash-back fees and impact to consumers. Cash-back fees are being levied by just a small handful of large retail conglomerates (Dollar General, Dollar Tree Inc., and Kroger Co.) amidst a backdrop of consolidation in these segments. Meanwhile, other larger retailers continue to offer cash-back services free. The CFPB estimates cash-back fees cost consumers about $90 million a year.
The CFPB is concerned that reduced access to cash undermines the resilience of the financial system and deprives consumers of a free, reliable, and private means of engaging in day-to-day transactions. The CFPB will continue to monitor developments related to the fees consumers pay for accessing cash, and work with agencies across the federal government to ensure people have fair and meaningful access to the money that underpins our economy.
Notes for figure 1.
The Federal Reserve Bank of Atlanta’s annual Diary and Survey of Consumer Payment Choice (Survey) tracks consumers’ self-reported payment habits over a three-day period in October using a nationally representative sample. The survey includes a question about whether and how consumers access cash, such as where they made the withdrawal, the source of the cash, and the amount of the withdrawal. Figure 1 provides a percentage of all cash-back withdrawal transactions from a bank account, checking account, or prepaid card reported between 2017 and 2022, by location (ATM, Retail point-of-sale, Bank teller, and Other). The number of observations during this time is 192 transactions. It does not include cash-back transactions made using a credit card cash advance feature or other form of credit.
This model assumes that 80 percent of the merchant transaction cost is due to interchange fees, 15 percent due to network fees, and 5 percent due to payment acquirer fees. It also includes a $0.01 fee for fraud protection. For regulated transactions, the interchange fees are $0.22 + 0.05% of the transaction amount. Regulated transactions are those where the debit card used is issued by a bank with more than $10 billion in assets, and subject to 15 U.S.C. § 1693o-2. Exempt transactions are those not subject to this statutory cap on interchange fees. While Mastercard does not publish its fees for exempt transactions, Visa does. This model uses Visa’s published fees as of October 2023 for card-present transactions: for the National Discount Chain, the fees for Exempt Retail Debit ($0.15 + 0.80%), and for the National Grocery Chain, Exempt Supermarket Debit ($0.30 flat fee). An October 2023 Federal Reserve report on interchange fee revenue found that in 2021, the most recent data available, 56.21 percent of debit transactions were regulated and 43.79 percent were exempt. This composition is reflected in the table.
The storefront counts for each of the retailers come from their websites, last visited on March 28, 2024, or their most recent reports to investors. Fee information was gathered either through publicly available information such as the merchant’s website, and/or verified through the CFPB’s market monitoring activities.
Dollar Tree Inc. announced on March 13, 2024 that it will close 1,000 of its Family Dollar and Dollar Tree brands stores over the course of the year. If those closures occur, Dollar Tree, Inc. will still have over 15,000 storefronts across the country.
In October 2022, Kroger Co. and Albertsons Companies announced their proposal to merge, though on February 26, 2024, the Federal Trade Commission and nine state attorneys general sued to block this proposal, alleging that the deal is anti-competitive. On April 22, 2024, Kroger Co. and Albertsons Companies announced a revised plan in which, if the merger is approved, the combined entity would divest 579 stores to C&S Wholesalers. If the divestiture occurs, the combined entity will still have over 4,400 stores across the country.
See above notes for Figure 1 about the Diary and Survey of Consumer Payment Choice (Survey). Table 3 provides mean and median amounts of ATM and Retail point-of-sale cash withdrawal transactions by income. In the Survey, participants were asked to report the total combined income of all family members over age 15 living in the household during the past 12 months. From these responses, we constructed five income brackets – four of $25,000 each plus a fifth bin for any respondents reporting more than $100,000 in annual household income for each respondent in each year.
See e.g., Jay Lindsay, A Fatal Cash Crash? Conditions Were Ripe for It After the Pandemic Hit, but It Didn’t Happen , Fed. Rsrv. Bank of Boston (Nov. 2, 2023), https://www.bostonfed.org/news-and-events/news/2023/11/cash-crash-pandemic-increasing-credit-card-use-diary-of-consumer-payment-choice.aspx
Kevin Foster, Claire Greene, & Joanna Stavins, The 2023 Survey and Diary of Consumer Payment Choice , Fed. Rsrv Bank of Atlanta (June 2024), https://doi.org/10.29338/rdr2024-01
See e.g., Hilary Allen, Payments Failure, Boston College Law Review, Forthcoming, American University, WCL Research Paper No. 2021- 11, (Feb. 21, 2020) available at https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3539797
See e.g., Scarlett Heinbuch, Cash Is Critical in Times of Crisis , Fed. Rsrv. Bank of Atlanta (Mar. 7, 2022), https://www.atlantafed.org/blogs/take-on-payments/2022/03/07/cash-in-crisis
See e.g., Carly Page, Square Says It Has Resolved Daylong Outage , TechCrunch, (Sept. 8, 2023), https://techcrunch.com/2023/09/08/square-day-long-outage-resolved/ . See also Caroline Haskins, The Global CrowdStrike Outage Triggered a Surprise Return to Cash , Wired, (July 19, 2024), https://www.wired.com/story/microsoft-crowdstrike-outage-cash/ .
See Berhan Bayeh, Emily Cubides and Shaun O’Brien, 2024 Findings from the Diary of Consumer Payment Choice , Fed. Rsrv. (May 13, 2024), https://www.frbservices.org/binaries/content/assets/crsocms/news/research/2024-diary-of-consumer-payment-choice.pdf (findings related to low-income consumers and older Americans use of cash); Emily Cubides and Shaun O’Brian, 2023 Findings from the Diary of Consumer Payment Choice , Fed. Rsrv., (May 19, 2024), https://www.frbsf.org/cash/wp-content/uploads/sites/7/2023-Findings-from-the-Diary-of-Consumer-Payment-Choice.pdf (findings related to unbanked households use of cash), and Michelle Faviero, , More Americans are Joining the ‘Cashless’ Economy ,” Pew Rsch. Ctr, (Oct. 5, 2022), https://www.pewresearch.org/short-reads/2022/10/05/more-americans-are-joining-the-cashless-economy/ (findings related to use of cash by race and other demographics).
Similarly, the average cash-back withdrawal amount was $33 in 2012, the most recent data available from the Federal Reserve Payments Study. The study was based on self-reported information from financial institutions surveyed by the Federal Reserve. Of the reported transactions, 73 percent were debit cards with an average amount of $33 and 27 percent on general purpose prepaid cards with an average withdrawal amount of $19. 2013 Federal Reserve Payments Study: Recent and Long-Term Payment Trends in the United States: 2003 – 2012 , Fed. Rsrv. Bd. (July 2014), https://www.frbservices.org/binaries/content/assets/crsocms/news/research/2013-fed-res-paymt-study-summary-rpt.pdf
The amounts in the Survey are lower than the average ATM withdrawal amounts reported in 2022 Federal Reserve Payments study, which utilizes data from surveying financial institutions. Per this study, in 2021, the average ATM withdrawal was $198. The Federal Reserve Payments Study: 2022 Triennial Initial Data Release , Fed. Rsrv. Bd. (Apr. 21, 2023), https://www.federalreserve.gov/paymentsystems/fr-payments-study.htm
Claire Green and Oz Shy, How Consumers Get Cash: Evidence from a Diary Survey , Fed. Rsrv. Bank of Atlanta, (Apr. 2019), at 5, https://www.atlantafed.org/-/media/documents/banking/consumer-payments/research-data-reports/2019/05/08/how-consumers-get-cash-evidence-from-a-diary-survey/rdr1901.pdf (finding, “For the largest amounts of cash, respondents mostly turned to employers, with an average dollar value of cash received of $227. At bank tellers and ATMs, consumers also received average dollar values greater than the overall average: $159 and $137, respectively. Consumers received smaller amounts from family or friends ($93) and, notably, cash back at a retail store ($34). All these dollar amounts are weighted. The amount of cash received at a retail store is constrained by the store’s limits, so the amount of cash received in this way is not necessarily at the discretion of the consumer.”)
Neil Mitchell and Ann Ramage, The Second Participant in the Consumer to Business Payments Study , Fed. Rsrv. Bank of Richmond (Sept. 15, 2014), https://www.richmondfed.org/~/media/richmondfedorg/banking/payments_services/understanding_payments/pdf/psg_ck_20141118.pdf
Id. at 8, Figures 7 and 8.
See e.g., Stan Sienkiewicz, The Evolution of EFT Networks from ATMs to New On-Line Debit Payment Products , Discussion Paper, Payment Cards Ctr. of the Fed. Rsrv. Bank of Philadelphia (Apr. 2002), https://www.philadelphiafed.org/-/media/frbp/assets/consumer-finance/discussion-papers/eftnetworks_042002.pdf?la=en&hash=88302801FC98A898AB167AC2F9131CE1 (“The cash back option became popular with supermarket retailers, since store owners recognized savings as a result of less cash to count at the end of the day, a chore that represented a carrying cost to the establishment.”).
These market segments and retailers for purposes of markets analysis are similar to those used in other academic literature related to dollar store locations in the context of food access or impact on other market dynamics, such as on local grocers. See e.g., El Hadi Caoui, Brett Hollenbeck, and Matthew Osbourne, The Impact of Dollar Store Expansion on Local Market Structure and Food Access ,” (June 22, 2022), available at https://ssrn.com/abstract=4163102 (finding "In 2021, there were more of these stores operating than all the Walmarts, CVS, Walgreens, and Targets combined by a large margin.”) and Yue Cao, The Welfare Impact of Dollar Stores ,” available at https://yuecao.dev/assets/pdf/YueCaoDollarStore.pdf (last visited Aug. 23, 2024) (using the categories of dollar stores, groceries, and mass merchandise (such as Walmart) for comparisons across retail segments and noting that dollar stores regard these other segments as competitors).
Estimate based on information voluntarily provided in the CFPB's market monitoring activities.
What Forms of Payment are Accepted? U.S. Postal Serv., https://faq.usps.com/s/article/What-Forms-of-Payment-are-Accepted (last visited Aug. 23, 2024).
See generally, Stacy Mitchell, Kennedy Smith, and Susan Holmberg , The Dollar Store Invasion , Inst. for Local Self Reliance (Mar. 2023), https://cdn.ilsr.org/wp-content/uploads/2023/01/ILSR-Report-The-Dollar-Store-Invasion-2023.pdf . There is also extensive research on dollar store locations in other contexts such as food access and impact on consumer spending habits. El Hadi Caoui, Brett Hollenbeck, and Matthew Osbourne, The Impact of Dollar Store Expansion on Local Market Structure and Food Access ,” at 5, (June 22, 2022), available at https://ssrn.com/abstract=4163102
Dollar General Annual Report (Form10-K) at 7 (Mar. 25. 2024), https://investor.dollargeneral.com/websites/dollargeneral/English/310010/us-sec-filing.html?format=convpdf&secFilingId=003b8c70-dfa4-4f21-bfe7-40e6d8b26f63&shortDesc=Annual%20Report .
Dollar Tree, Inc. Annual Report (Form 10-K) at 7 (Mar. 20. 2024), https://corporate.dollartree.com/investors/sec-filings/content/0000935703-23-000016/0000935703-23-000016.pdf
See e.g., Dollar General Annual Report (Form10-K) at 7 (Mar. 25. 2024) (“We generally locate our stores and plan our merchandise selections to best serve the needs of our core customers, the low and fixed income households often underserved by other retailers, and we are focused on helping them make the most of their spending dollar.” And, Dollar Tree, Inc. Annual Report (Form 10-K) at 6 (Mar. 20. 2024), (“Family Dollar primarily serves a lower than average income customer in urban and rural locations, offering great values on everyday items.”)
Dr. Jerry Shannon, Dollar Stores, Retailer Redlining, and the Metropolitan Geographies of Precarious Consumption , Ann. of the Am. Assoc. of Geographers, Vol. 111, No. 4, 1200-1218 (2021), (analyzing over 29,000 storefront locations of Dollar General, Dollar Tree, and Family Dollar locations across the three largest MSA in each of the nine U.S. Census Bureau-defined divisions.)
Kristen Broady, Mac McComas, and Amine Ouazad, An Analysis of Financial Institutions in Black-Majority Communities: Black Borrowers and Depositors Face Considerable Challenges in Accessing Banking Services ,” Brookings Inst., (Nov. 2, 2021), https://www.brookings.edu/articles/an-analysis-of-financial-institutions-in-black-majority-communities-black-borrowers-and-depositors-face-considerable-challenges-in-accessing-banking-services/ and Drew Dahl and Michelle Franke, Banking Deserts Become a Concern as Branches Dry Up , Fed. Rsrv. Bank of St. Louis, https://www.stlouisfed.org/publications/regional-economist/second-quarter-2017/banking-deserts-become-a-concern-as-branches-dry-up (July 25, 2017).
El Hadi Caoui, Brett Hollenbeck, and Matthew Osbourne, The Impact of Dollar Store Expansion on Local Market Structure and Food Access ,” (June 22, 2022), available at https://ssrn.com/abstract=4163102 .
Dollar Tree Completes Acquisition of Family Dollar , Dollar Tree Inc., (July 6, 2015), available at https://corporate.dollartree.com/news-media/press-releases/detail/120/dollar-tree-completes-acquisition-of-family-dollar
El Hadi Caoui, Brett Hollenbeck, and Matthew Osbourne, The Impact of Dollar Store Expansion on Local Market Structure and Food Access ,” (June 22, 2022), available at https://ssrn.com/abstract=4163102 and Yue Cao, The Welfare Impact of Dollar Stores, https://yuecao.dev/assets/pdf/YueCaoDollarStore.pdf (last visited Aug. 23. 2024).
Evan Moore, Harris Teeter Introduces New Fees that Have Customers Upset. What To Know Before You’re Charged , Charlotte Observer, (Mar. 14, 2024), https://amp.charlotteobserver.com/news/business/article286627340.html
Karen Bennett and Matthew Goldberg, Survey: ATM fees Reach 26-year High While Overdraft Fees Inch Back Up , Bankrate.com (Aug. 21, 2024), https://www.bankrate.com/banking/checking/checking-account-survey/
Oz Shy and Joanna Stavins, Who Is Paying All These Fees? An Empirical Analysis of Bank Account and Credit Card Fees , Fed. Rsrv. Bank of Boston, Working Paper No. 22-18, at Table 2, (Aug. 2022), https://www.bostonfed.org/publications/research-department-working-paper/2022/who-is-paying-all-these-fees-an-empirical-analysis-of-bank-account-and-credit-card-fees .
A semi-vapor electrolysis technology for hydrogen generation from wide water resources.
Cost-effective and scalable green hydrogen production from water electrolysis is crucial to achieve a net-zero emission future. Progress on water electrolysis technologies has long been made towards materials design and device assembly optimization to improve cost effectiveness. However, expensive iridium-based electrocatalyst, pure water feedstock, low current density, and energy efficiency limit state-of-the-art water electrolysis, i.e., alkaline and polymer exchange membrane water electrolyzers based on liquid-water feeding for large-scale implementation. Here we propose a new semi-vapor electrolysis (SVE) system for cost-effective hydrogen generation that adopts low temperature vapor electrolysis at the anode while maintains liquid water circulation at the cathode. The SVE process requires no additional energy input as compared to conventional liquid water electrolysis while enabling the direct use of a wide range of water resources without pre-treatment. In addition, much cheaper and intrinsically more active ruthenium oxide can be used under the conditions of SVE by avoiding dissolution, which achieves an ultrahigh current density of 4.67 A cm-2 at 1.8 V and superior stability under 1.0 A cm-2 operation. Techno-economic assessment suggests significant hydrogen cost reduction due to the improved energy efficiency, reduced material cost and simplified system, and by-product profits, demonstrating the scalability of the as-proposed SVE.
Download citation, permissions.
J. Tang, K. Guo, D. Guan, Y. Hao and Z. Shao, Energy Environ. Sci. , 2024, Accepted Manuscript , DOI: 10.1039/D4EE02722A
To request permission to reproduce material from this article, please go to the Copyright Clearance Center request page .
If you are an author contributing to an RSC publication, you do not need to request permission provided correct acknowledgement is given.
If you are the author of this article, you do not need to request permission to reproduce figures and diagrams provided correct acknowledgement is given. If you want to reproduce the whole article in a third-party publication (excluding your thesis/dissertation for which permission is not required) please go to the Copyright Clearance Center request page .
Read more about how to correctly acknowledge RSC content .
Search articles by author.
This article has not yet been cited.
COMMENTS
However, a review of the literature reveals the lack of original studies on hypothesis generation in clinical research. The authors then summarize their first human participant study exploring data-driven hypothesis generation by clinical researchers in a simulated setting.
Hypothesis generation is a key step in data science projects. Here's a case study on hypotheis generation for data science.
Hypothesis testing is a formal procedure for investigating our ideas about the world using statistics. It is most often used by scientists to test specific predictions, called hypotheses, that arise from theories.
Formulating Hypotheses for Different Study Designs. Generating a testable working hypothesis is the first step towards conducting original research. Such research may prove or disprove the proposed hypothesis. Case reports, case series, online surveys and other observational studies, clinical trials, and narrative reviews help to generate ...
The hypothesis generation methods mentioned above are mainly techniques included in data analysis such as data mining, machine learning, and multivariate analysis.
Hypothesis testing is a common statistical tool used in research and data science to support the certainty of findings. The aim of testing is to answer how probable an apparent effect is detected by chance given a random data sample. This article provides a detailed explanation of the key concepts in Frequentist hypothesis testing using ...
What is Hypothesis Generation? Hypothesis generation involves making informed guesses about various aspects of a business, market, or problem that need further exploration and testing. It's a crucial step while applying the scientific method to business analysis and decision-making.
The novel methods and technologies proposed in Hypothesis Generation and Interpretation are supported by the incorporation of historical perspectives on science and an emphasis on the origin and development of the ideas behind their design principles and patterns.
While hypothesis testing is a highly formalized activity, hypothesis generation remains largely informal. We propose a systematic procedure to generate novel hypotheses about human behavior, which uses the capacity of machine learning algorithms to notice patterns people might not.
A hypothesis is a statement that can be tested by scientific research. If you want to test a relationship between two or more variables, you need to write hypotheses before you start your experiment or data collection.
Hypothesis testing, then, is a statistical means of testing an assumption stated in a hypothesis. While the specific methodology leveraged depends on the nature of the hypothesis and data available, hypothesis testing typically uses sample data to extrapolate insights about a larger population.
Hypothesis Testing vs Hypothesis Generation In the world of Data Science, there are two parts to consider when putting together a hypothesis. Hypothesis Testing is when the team builds a strong hypothesis based on the available dataset. This will help direct the team and plan accordingly throughout the data science project.
As mentioned in this article, Hypothesis generation is an educated "guess" of various factors that are impacting the business problem that needs to be solved using machine learning. In short, you are making wise assumptions as to how certain factors would affect our target variable and in the process that follows, you try to prove and ...
Explore hypothesis testing, a fundamental method in data analysis. Understand how to use it to draw accurate conclusions and make informed decisions.
Hypothesis generation is the formation of guesses as to what the segment of code does; this step can also guide a re- segmentation of the code. Finally, verification is the process of examining the code and associated documentation to determine the consistency of the code with the current hypotheses.
Hypothesis testing is the process that an analyst uses to test a statistical hypothesis. The methodology depends on the nature of the data used and the reason for the analysis.
Hypothesis testing is an important procedure in statistics. Hypothesis testing evaluates two mutually exclusive population statements to determine which statement is most supported by sample data. When we say that the findings are statistically significant, thanks to hypothesis testing.
The Hypothesis Generator is powered by advanced AI models. These models analyze your research question and dataset description, and use their understanding of language, context, and research methodology to generate null and alternative hypotheses that are clear, concise, and testable.
Abstract To correct a common imbalance in methodology courses, focusing almost entirely on hypothesis-testing issues to the neglect of hypothesis-generating issues which are at least as important, 49 creative heuristics are described, divided into 5 categories and 14 subcategories. Each of these heuristics has often been used to generate hypotheses in psychological research, and each is ...
A hypothesis is a tentative statement about the relationship between two or more variables. Explore examples and learn how to format your research hypothesis.
Research begins with a research question and a research hypothesis. But what are the characteristics of a good hypothesis? In this article, we dive into the types of research hypothesis, explain how to write a research hypothesis, offer research hypothesis examples and answer top FAQs on research hypothesis. Read more!
This meta-analysis is a critical hypothesis-generating instrument, capturing effects in the CD19 CAR T cells literature lacking randomized clinical trials and large observational studies. CD19-targeted chimeric antigen receptors (CAR) T cells are one of the most remarkable cellular therapies for managing B cell malignancies. However, long-term ...
Objective: We aimed to collect detailed, hypothesis-generating information about expectations and preferences for cessation digital treatment among older adults who smoke cigarettes. Methods: Semistructured interviews were conducted with adults aged 65+ years currently smoking or who had quit within the past month.
To comprehensively profile complex single-molecule dynamics at the library scale, we integrated high-throughput single-molecule fluorescence resonance energy transfer (smFRET) microscopy (41-44) with Illumina next-generation sequencing (NGS) (45, 46) ().To immobilize a library of FRET constructs on the surface of an Illumina MiSeq flow cell (), we used a 5′ single-stranded DNA overhang and ...
That analysis revealed that legal documents frequently have long definitions inserted in the middle of sentences — a feature known as "center-embedding." Linguists have previously found that this kind of structure can make text much more difficult to understand. ... One was the "copy and edit hypothesis," which suggests that legal ...
According to our latest Preliminary Monthly Electric Generator Inventory, developers and power plant owners added 20.2 gigawatts (GW) of utility-scale electric generating capacity in the United States during the first half of 2024.This new capacity is 3.6 GW (21%) more than the capacity added during the first six months of 2023. Based on the most recently reported data, developers and owners ...
CFPB analysis of the Diary and Survey of Consumer Payment Choice from 2017 to 2022 show that even among consumers with incomes below $50,000, the amount withdrawn at an ATM is more than double the typical cash-back withdrawal amount. Additionally, for the average and median amounts, across all incomes the ATM withdrawal amounts are larger than ...
Electricity generation capacity. To ensure a steady supply of electricity to consumers, operators of the electric power system, or grid, call on electric power plants to produce and supply the right amount of electricity to the grid at every moment to instantaneously meet and balance electricity demand.. In general, power plants do not generate electricity at their full capacities at every ...
Donald Trump's campaign is co-managed by the man who engineered the "swift boating" of John Kerry in 2004, so it should come as no surprise that 20 years later, military service and ...
Here we propose a new semi-vapor electrolysis (SVE) system for cost-effective hydrogen generation that adopts low temperature vapor electrolysis at the anode while maintains liquid water circulation at the cathode. The SVE process requires no additional energy input as compared to conventional liquid water electrolysis while enabling the direct ...