Growth hormone deficiency (GHD) in adults is characterised by alterations in body composition, carbohydrate and lipid metabolism, bone mineral density, cardiovascular risk profile and quality of life.1 Treatment with GH replacement has been shown to improve some, but not all, of these abnormalities.2 In contrast, untreated GHD is associated with increased morbidity and mortality that was previously observed in adults with hypopituitarism.3,4 These findings were substantiated in two large surveys based on national Danish registries, where the morbidity of adults
Growth hormone deficiency (GHD) in adults is characterised by alterations in body composition, carbohydrate and lipid metabolism, bone mineral density, cardiovascular risk profile and quality of life.1 Treatment with GH replacement has been shown to improve some, but not all, of these abnormalities.2 In contrast, untreated GHD is associated with increased morbidity and mortality that was previously observed in adults with hypopituitarism.3,4 These findings were substantiated in two large surveys based on national Danish registries, where the morbidity of adults with GHD was found to be approximately threefold higher than that of a healthy population.5 This result was independent of gender and applied to patients with childhood-onset and adult-onset GHD,5 with mortality of childhood-onset GHD far exceeding that of adult-onset GHD.6
Current published guidelines recommend evaluation of adult GHD to be based on clinical findings, medical history and using the appropriate GH stimulation test for biochemical confirmation,7,8 with the exception of patients with three or more pituitary hormone deficiencies and low serum IGF-I levels.9 The maximum or peak GH secretion following GH stimulation testing is used as a surrogate of the capacity of the pituitary to release GH. The insulin tolerance test (ITT) is generally considered the gold standard test for evaluation of GH deficiency and has been endorsed by different consensus guidelines,7,8,10 but this test is labour intensive, may be unpleasant for some patients, has potential risks, and is contraindicated in elderly patients and in patients with seizure disorders and ischaemic heart disease. Thus, there remains a real unmet medical need for an alternative test to the ITT that is safe yet reliable. For this reason, other dynamic tests have been proposed such as arginine (ARG), combined GH releasing hormone plus ARG (GHRH-ARG), levodopa (L-DOPA) in spite of data indicating poor performance of some of these tests for evaluation of adult GHD.9,11 A potential alternative to the ITT is the glucagon stimulation test (GST) that has been used extensively in the UK,12 and is gradually gaining acceptance in the US.13
Update on the Glucagon Stimulation Test in Diagnosing Adult Growth Hormone Deficiency
Following the publication of several validation studies11,14-16 and recommendations from current consensus guidelines,7,8,10 the GHRH-ARG test has in recent years emerged as the best and most reliable alternative GH stimulation test to the ITT in diagnosing adult GHD. However, when EMD Serono, Inc decided to discontinue the manufacture of recombinant GHRH (Geref®) in the US in July 2008,17 this inevitably left a significant gap for an alternative reliable test for evaluation of patients suspected to have GHD in place of the GHRH-ARG test. This is particularly important for endocrinologists in the US who are not comfortable or do not have the necessary logistical or staff support to conduct ITTs in their office or patients who have contraindications to hypoglycaemia in whom the ITT would be inappropriate. With the lack of reliable GH stimulation tests available in the US, we recommended the glucagon stimulation test (GST) as the alternative test to the ITT for diagnosing adult GHD based on its availability, reproducibility, safety, lack of influence by gender and hypothalamic cause of GHD, and relatively few contraindications.13 Analysing the data of 13,167 GH-deficient patients enrolled in the KIMS pharmaco-epidemiological database (Pfizer International Metabolic Database) from its inception to the end of 2008, Brabant et al. addressed the question of whether there were regional differences in the use of different biochemical tests to diagnose adult GHD in six large European countries and the US.12 This analysis revealed striking regional variations in the approach to GH stimulation testing, with the ITT being the most popular test used in 44 % of all countries but was less popular (13 %) in the US, whereas the GST was more popular in the UK (30 %) than in the US. However, the unavailability of the GHRH-ARG test in the US since 2008 has resulted in the GST being more frequently considered as the alternative test to the ITT.13
The use of the GST for the assessment of GH reserve was first described in 1969 by Mitchell et al.18 Since then, the GST has been shown by various investigators to have a GH secretory potency that is similar to or only slightly less than the ITT, suggesting that it is more reliable than other classic agents such as ARG or clonidine for separating GHD patients from normal subjects.19–23 The true mechanism by which glucagon induces GH release remains unclear. Some of the hypothesised mechanisms include the glycaemic fluctuations during the test where blood glucose levels increase initially before decreasing later in the test,24 the generation of a peptidyl fragment associated with the GH- and ACTH-releasing activity25 and the induction of norepinephrine secretion in stimulating GH release via α-receptors.26 It has also been previously demonstrated that glucagon stimulates GH release more effectively when administered intramuscularly or subcutaneously as opposed to the intravenous route.20
The three studies utilising the GST by Gomez et al.,27 Conceicao et al.,19 and Berg et al.23 evaluated GHD in patients with pituitary disorders. The first two studies19,27 were prospective studies that compared the diagnostic characteristics of GST to ITT and included a control group which was matched for age and sex in both studies and for body mass index (BMI) in one study.27 Using receiver operated curve (ROC) analysis, both studies proposed a peak GH cut-off value of 3 ng/mL as the best cutpoint with the highest combined sensitivity and specificity to differentiate between patients with GHD and healthy controls.19,27
Other Considerations in Performing and Interpreting the Data of the Glucagon Stimulation Test
The diagnosis of adult GHD has proved to be challenging because of the lack of a single biological endpoint such as growth failure, and therefore, the confirmation of adult GHD largely depends on biochemical provocative testing. Clearly, there is no ideal stimulation test and we recommend that the decision to embark on a stimulation test to diagnose adult GHD must factor in the appropriate clinical context of each individual patient together with the number of pituitary hormone deficiencies plus serum IGF-I level,9 the validity of the chosen test and its appropriate cut-off limits, the sensitivity of the GH assay, and the availability of local resources and expertise.
The GST is a simple and safe test to perform (Table 1). Glucagon is readily accessible because it is widely available for treating hypoglycaemic episodes in patients with diabetes. In addition, glucagon is relatively inexpensive (the current average wholesale price of recombinant DNA glucagon is approximately US$50–70 per single 1 mg dose, while for Geref® and ARG are approximately $80–$130 per single 50 μg and US$10–12 per single 30 g dose, respectively).
Glucagon appears to be well tolerated and is only relatively contraindicated in patients with malnourishment or who have not eaten for more than 48 hours due to concern of prolonged hypoglycaemia and those with pheochromocytoma in whom a significant exacerbation of blood pressure may be observed.24 The GST was initially described as a four-hour test in older studies,31,32 but more recent studies have suggested that the test could be shortened to a three-hour test, and that serum GH levels can be evaluated between three to five time points only (0, 90, 120, 150 and 180 minutes) as the majority of GH peaks occurred between 120 and 180 minutes (85 %).26,29
In a study by Orme et al. comparing standard and simplified GST (0, 150 and 180 minutes), 75 % of discordant GH results were due to a peak GH level occurring at 210 minutes.29 Accordingly, the authors proposed that the diagnostic utility of the simplified GST could be improved further by drawing an additional blood sample at 210 minutes when assessing GH deficiency.29 The audit by Leong et al. is the largest study that assessed patients with hypothalamic-pituitary disease whom had undergone the GST, and they reported that the test could be shortened by omitting the 240-minute blood sample.26 Among 414 patients who underwent GSTs, the majority of peak GH levels occurred between 120 and 180 minutes (85 %) and five patients had their peak GH levels recorded at 240 minutes.26 Hence, it is still not clear whether the ideal timing of the GST is three versus four hours, and continuing the test for four hours may be advisable, at least until there are more definitive data available. This also allows the monitoring for late hypoglycaemia, although truly low blood glucose levels are not common. While the lowest blood glucose level with the GST in the literature was reported at 37 mg/dL,18 in our experience, we have rarely observed blood glucose levels falling below 40 mg/dL with this test. The occurrence of hypoglycaemia reported in the literature with blood glucose levels lower than 40 mg/dL during GSTs are also rare events.23,26
The common side-effects in patients with hypothalamic-pituitary disease that underwent testing with the GST included nausea, vomiting and headaches, and have been reported to range from less than 1023 to 34 %.26 In a study of 97 normal subjects, mild nausea in approximately 30 % of the subjects, and transient vomiting and retching in about 10 % of the subjects were the only side-effects that were noted,33 whereas in our experience of 143 GSTs performed at four large academic centers in the US, the main side-effects reported were nausea (41 %), fatigue, headaches, weakness and hunger (12 %).34
Like other GH stimulation tests, there are also limitations associated with the GST. The three- or four-hour GST is still longer than many other GH stimulation tests, and requires an intramuscular injection which may not appeal to some patients. However, as there is a relationship between peak GH response to GHRH-ARG stimulation and ambient glucose levels,35 it is unclear whether hyperglycaemia may play a part in influencing the peak GH response to glucagon stimulation. Furthermore, no peak GH responses have been studied using the GST in normal controls over the age of 70 years and none of the previous studies included patients with frank diabetes. Therefore, it is not known whether testing using the GST in subjects with diabetes is valid. Hence, caution should be exercised when interpreting normal GST results in the patients with diabetes. If the suspicion of GHD remains high in these patients, it is reasonable to consider using a second GH stimulation test. Finally, while it is accepted that a peak GH response of 3 ng/mL or less is the best cutpoint to diagnose adult GHD using the GST,19,27 there remains a lack of consensus over a peak GH response between 3 ng/mL and 10 ng/mL, and further studies are required to address this.
Other provocative tests that have been proposed include ARG alone and GH secretagogues. ARG alone has been shown to be less reliable than the ITT or GHRH-ARG11 and the mean peak GH response to ARG alone is lower than in the ITT or GST, even in normal lean subjects.21 The diagnostic reliability of ARG alone has been previously questioned.11,22 Thus, we recommend that ARG alone should only be considered if the ITT and the GST is contraindicated or if glucagon is unavailable. If this test is used, appropriately low peak GH cutoffs should be employed (for 95 % sensitivity: 1.4 μg/L, for 95 % specificity: 0.21 μg/L and to minimise misclassification in either direction: 0.4 μg/L).11 In contrast, the reliability of testing with GH secretagogues such as GH-releasing peptide-2 alone,36 GH-releasing peptide-6 alone and combined GH-releasing peptide-6 plus GHRH37 in comparison with the ITT has also been demonstrated. These agents utilise the same concept as the GHRH-ARG test in stimulating pituitary GH release by mimicking the activity of the natural GH secretagogue receptor ligand (i.e. ghrelin), and appear to demonstrate a good safety profile with relatively few contraindications.38 The limitation, however, of these GH secretagogues is that these agents are more likely to explore the pituitary somatotroph releasable pool and might potentially induce misleadingly normal peak GH responses in hypothalamic GHD.39 Furthermore, these agents are not readily available in many countries including the US.
Future Perspectives
Recent studies have indicated that further refinements to the GST may still be required to improve the sensitivity and specificity of this test. A study by Micmacher et al. demonstrated in a group of healthy men above 50 years old that GH secretion in response to the GST, but not with the ITT, correlated to physiological spontaneous GH secretion.40
These data indicate that GH response to the GST reflects the endogenous GH spontaneous secretion and poses the question as to whether the cutpoints of peak GH response to the GST should be age-dependent. More recently, we reported a one-year experience of GSTs conducted from 4 large academic centers in the US and explored the potential of the GST in testing the hypothalamic–pituitary–adrenal axis.34 In this study, we found that there was a negative correlation between fasting glucose (r=–0.24, p<0.01) and BMI (r=–0.37, p<0.01) with peak GH levels. When compared with the 250 μg cosyntropin stimulation test with a cutpoint of 18 μg/dL, peak cortisol levels with GSTs were lower (p<0.02), had higher failure rates (44.4 % versus 33.3 %) and the 120-minutes peak cortisol of 16.5 μg/dL achieved 83.3 % sensitivity and 75 % specificity using ROC analysis. Overall, the GST was well-tolerated and can be performed as an out-patient; however further studies are needed to determine whether GSTs may falsely diagnose GHD in patients with fasting hyperglycaemia, and/or high BMIs. Thus, to improve the diagnostic reliability of the GST especially in patients with glucose intolerance and in those with high BMIs, a priming agent may be required to combine with the GST with appropriate cutpoints to improve its sensitivity and specificity, similar to the GHRH in priming the ARG test. Until such data becomes available, we recommend that a second GH stimulation test should be considered for such patients. In conclusion, in line with recent published consensus guidelines,7,8,10 the ITT should remain as the test of reference due to its greatest diagnostic accuracy, even in patients with suspected hypothalamic GHRH deficit. We recommend the GST as the alternative test to the ITT for diagnosing adult GHD because of its availability, reproducibility, safety, lack of influence by gender and hypothalamic cause of GHD, and relatively few contraindications. Despite some studies demonstrating the comparability of the GST to the ITT in assessing the hypothalamic–pituitary–adrenal axis,26,29,41,42 further larger, well-controlled studies are still needed to confirm the reliability of the GST in assessing this axis. If the GST can be shown to reliably distinguish adrenal sufficiency from insufficiency, then the ability of assessing both the GH and cortisol reserve simultaneously, just as in the ITT, would make this test even more appealing. While previous studies have shown that the GST could be shortened from four to three hours and yet maintain its diagnostic utility,26,29 we would still recommend that the GST be conducted over four hours with measurements every 30 minutes for serum GH and capillary blood glucose levels primarily to ensure that delayed peak GH responses and late hypoglycaemia are not missed.