GENERAL ASPECTS OF IN VITRO FERTILIZATION 

Introduction

When reading what many of our supposedly even better medical journals at times allow into print these days, it sometimes feels like we live in an alternative universe. And as fate would have it, a very recent paper in Fertility and Sterility (1) offers an excellent example of what we mean by making this statement.  

 

Considering how much – forgive the word – “garbage” has over the last two decades been published about preimplantation genetic testing for aneuploidy (PGT-A) – of course a favorite topic of permanent discussion at the Center for Human Reproduction (CHR) in NYC and, therefore, in the VOICE as well as The Reproductive Times – it should not surprise that we again are addressing a PGT-A paper. 

 

It does not appear to matter that well-designed prospectively randomized studies have demonstrated no outcome benefits for IVF in general populations, as even a recent combined ASRM/SART Practice Committees’ opinion pointed out (2). Even this ASRM/SART guidance, however, does not appear to have made even the slightest dent in the number of – forgive again the word – “garbage” papers on this subject, some even better medical journals are seemingly willing to publish. Who then can be surprised that PGT-A utilization in association with IVF in the U.S. (and in many other countries) is still increasing. 

 

A Chinese paper on PGT-A

Coming back to the paper of discussion in this commentary – this time a paper authored by Chinese investigators – a point obviously not made to in any way disparage Chinese colleagues who these days often report absolutely amazingly well thought out and conducted studies – but to consider since “garbage” papers are now au-mass produced by paper mills mostly in select countries, many  – though by no means all – can be sourced to China (3-5).   

 

The papers from China (and some other countries) these days deservedly receive special attention in many editorial offices of medical and science journals, should, therefore, not surprise. A really bad paper, nevertheless undiscovered sailing through peer review and being published, therefore, after publication, deserves – and, indeed, mandates – attention and, hopefully, also some internal reassessments and reconsiderations about the editorial peer review processes, including reviewer selection, at the journal. 

 

The paper here addressed claimed to have investigated the effects of preimplantation genetic testing for aneuploidy (PGT-A) in IVF cycles of women, and this is a very important point, who at the time of the IVF cycle were at an advanced age (1). The paper, however, offers absolutely no rational why it would even conduct such a study, considering even ASRM and SART in a joined document finally – and with difficult to understand delay of not only months but years – recently concluded that PGT-A to this point, in all of its iterations and under all of its earlier names, has been unable to demonstrate any IVF cycle outcome benefits in unselected patients populations (2).  

 

And without clinical utility in general patient populations, why would anybody with just minimal common sense conclude that PGT-A, even theoretical, might work in older women? Women with advancing age, of course, produce progressively smaller numbers of eggs and embryos in IVF cycles, which, indeed, is one of two principal causes for declining pregnancy rates in IVF with advancing female age (the second cause is declining egg quality). Older women, based on biologically, but also mathematically indisputable facts, therefore, must be among the worst candidates for routine utilization of PGT-A, even though some “garbage” studies have suggested that women above age 35 benefit from PGT-A (6). 

 

The problem with the still-expanding use of PGT-A in IVF cycles is not only the procedure’s complete lack of clinical utility (i.e., PGT-A not only fails to improve IVF outcomes) but overwhelming evidence that, at least in some subpopulations, PGT-A actually reduces pregnancy and live birth chances in IVF. And older patients, for very obvious reasons, are, of course, among those subpopulations that are harmed by the procedure. And, though there are several very good reasons for that, we here will point out only the most obvious one because it is simply indisputable: Every false-positively as “aneuploid’ labelled embryo (which, therefore, is not used for transfer or is even discarded) reflects lost pregnancy chances.  

 

A word on the irrationality of PGT-A

That PGT-A produces an unacceptably high percentage of false-positive results has, as innumerable publications attest, been argued by the CHR against fierce opposition from PGT-A proponents since approximately 2007. By in 2015 reporting the first chromosomally normal offspring in the world after transfers of what then were reported to be “aneuploid” embryos that should have been discarded based on test-laboratory recommendations (under current PGT-A practice some of these embryos might be classified as “mosaic”), the CHR’s argument was no longer disputable.  

 

Rather than reconsidering the whole concept of PGT-A (in medical terminology called a “hypothesis”) of biopsying embryos for chromosomal abnormalities, PGT-A proponents introduced the concept of “mosaicism” to their report repertoire. They rechristened the test under its current name, PGT-A, though without reassessing whether this test biologically, genetically, and mathematically really made any sense.  

 

Refuting that any of the promised outcome benefits from PGT-A have been achieved, the recent ASRM/SART document on the subject of PGT-A really offers a clear answer (2) because, if a medical intervention does not fulfill its purpose and does not at least fulfill a compensatory outcome benefits, and when this procedure has failed to demonstrate any utility for over 20 years in different presentations and under different protocols and – on top of all of this – when this procedure in addition adds significant costs (in this case at least $5,000) to already for many unaffordable IVF cycle costs, the only logical as well as ethical conclusion left is to abandon the use of the procedure, unless convincing indications exist for its use (and nobody argues that there do not exist some of such rare indications). 

 

Without further rehashing the history of PGT-A, obviously very closely related to the history of the CHR, only so much: PGT-A can have false-positive results for several reasons, among those, and not exclusively, technical and biological reasons. As now recognized, many at blastocyst-stage by PGT-A as “aneuploid” diagnosed embryos – even though at that point correctly diagnosed as containing “aneuploid” cells (which means that the embryo is either “mosaic” or outright “aneuploid”), can, downstream from blastocyst-stage, still self-correct (7) (i.e., expel “aneuploid” cells from its confines). Even a technically correct PGT-A diagnosis may, therefore, turn out to be biologically incorrect. 

 

And, as a biologically incorrect diagnosis – if such an embryo is then deselected from embryo transfer – this deselection reduces a woman’s cumulative pregnancy chance in the IVF cycle in which this embryo was produced. And it, once again, also does not take a genius – or for that matter, large prospectively randomized studies – to figure all of this out. Yet, “garbage” papers, making all kinds of convoluted arguments in support of PGT-A utilization, still make it into the literature 

 

Further evidence on why this paper should never have been published

So, this time it was Fertility and Sterility that fell for the scam involving a paper that claimed to have investigated “embryo transfer outcomes in women of advanced reproductive age”(1). And to make the study apparently more interesting, the researchers decided to restrict the study population to only older women who at most produced three oocytes in a given IVF cycle. 

 

Among 230 studied IVF PGT-A cycles, 49 (21.3%) were repeat cycles, of course representing a first – and not last – major protocol design error since repeat cycles in the same patients can lead to significant patient selection biases. Though the study alleges to have studied women of “advanced reproductive age,” astonishingly, all the manuscript reveals is that all women were above age 38 (no mean ± SD, median, or age range, and all of those, of course, greatly matter. In contrast, 309 patients received no PGT-A, and among those, 89 (28.8%) had more than one cycle (with, of course, the same relevance as in the PGT-A group).  

 

Those two study groups of patients were, moreover, of course, not randomized, suggesting that there had to be reasons why all these women either did or did not undergo PGT-A, very obviously creating further concerns about patient selection biases and confounding patient characteristics. Considering the relatively large number of women with repeat IVF cycles in the study, it also seems likely that at least some of them may have been in PGT-A as well as no PGT-A cycles.  In other words, study design alone should have disqualified this paper from publication. 

 

But there are significant other reasons as well: By restricting the study to women with maximally three retrieved oocytes, the authors are trying to give the impression that they were studying a relatively poor prognosis patient group with abnormally low functional ovarian reserve (FOR). That could be a correct assumption, but only if the mean age were under 40 years, but it would not apply if the mean age was higher because above age 40 retrievals yielding only 1-3 oocytes are not necessarily abnormal.   

 

We, at that point of the review of this paper, however, did find a seemingly redeeming characteristic of the paper: The authors in the Materials and Methods section claimed to have assessed IVF cycle outcomes with reference points cycle start (intent to treat) and egg retrieval (usually very similar reference points). This would have been a positive point for this manuscript because many PGT-A papers in the literature reported outcomes with reference to embryo transfer, which requires an IVF cycle to produce at least one transferrable embryo to be included in PGT-A outcome evaluations. 

 

Exclusion of poorer prognosis patients without production of even a single blastocyst – obviously results in a significant patient selection bias toward good-prognosis patients and, therefore, exaggeration of PGT-A benefits in all but good-prognosis patients.   

 

But in proceeding with review of the paper, it very quickly became apparent that the credit we had given to the investigators was found to be seriously misplaced once their data analysis is looked at: PGT-A cycles of course, as one would expect had lower transfer rates; but the difference was astonishing: Only 14.1% of PGT-A cycles reached embryo transfer vs. 84.4.2% in controls (on a side note, in the abstract the latter percentage was incorrectly listed as 73.2%). And beyond this point, the data analysis, of the paper became deceptive: The study, indeed, reported a positive chemical pregnancy test in the PGT-A group in 71.4% of cycles (30/42, with 42 being the number of cycles that produced at least one transferrable embryo and, therefore, reached transfer). 

 

This, of course, very clearly reflects data manipulation and misdirection because the in Materials and Methods alleged reference points for analysis of IVF cycle outcomes were cycle start and retrieval, for which we above – obviously prematurely, lauded the authors. They now, however, replaced cycle start and retrieval with the incorrect and biased reference point – embryo transfer – as earlier noted, a very typical statistical “trick,” proponents of PGT-A have used in innumerable studies over the years. Every study with reference point embryo transfer obviously selects out the best prognosis patients because it takes at least one or more blastocysts to be included in IVF cycle outcome determinations. In contrast, poorer prognosis patients who do not produce transferrable blastocysts just don’t exist in outcome determinations. The obvious, undisputable consequence is a significant exaggeration of outcome benefits for PGT-A, as so many pro-PGT-A studies have now done for over two decades.  

 

The discussed study demonstrates these facts, however, better than most other studies. Considering the huge discrepancy in percentage of cycles reaching embryo transfer (14.1% vs 84.4%) and how this point was not recognized during peer review is, therefore, difficult to understand. With among non-PGT-A cycles 362/429 reaching embryo transfers, these cycles produced in reference to transfers only 62/362 pregnancies, resulting in a seemingly statistically highly significant outcome benefit of positive pregnancy test (in itself a very unreliable endpoint) of 71.4% (PGT-A cycles) and 17.1% (non-PGT-A cycles, P<0.001). 

 

A correct data analysis of positive pregnancy test with reference point cycle start (i.e., intent to treat) should, however, of course have compared in PGT-A cycles 30 pregnancies out of 298 (10.1%) cycle starts to 62 pregnancies out of 429 cycle starts (14.5%) – not only an inversion of reported outcomes, but suggestive of an almost 50% higher pregnancy rate for women not-undergoing PGT-A. 

 

The same principles, of course, also apply to other more telling outcomes than chemical pregnancies, including clinical pregnancy rates, clinical pregnancy rates per retrieval, live birth rates per retrieval, live birth per number of retrieval cycles, and – requiring slightly different considerations – time to pregnancy (see Table). 

In addition, the paper reported a significant difference in miscarriages with reference retrieval of 4/298 (1.4%) vs.18/429 vs.18/429 (4.2%) to the disadvantage of non-PGT-A cycles. (P<0.027); miscarriage rates per pregnancy of 4/28 (14.3%) vs 18/45 (40%) were also significantly higher in non-PGT-A cycles. Time to pregnancy in months was non-significantly longer in PGT-cycles (17.6 ± 4.6 vs. 16.7 ± 7.9 months), quite a long time in both groups.

Summary and conclusions

After multivariate logistic regression the paper claimed a significant advantage (P<0.001) for PGT-A in chemical pregnancies and clinical pregnancies, which we cannot agree with, marginally lower miscarriage rates per retrieval (P=0.038) and per pregnancy P=0.013) which may or may not be correct, a better live birth rate per ET which – unquestionably – is a spurious finding, and even the paper noted no difference between both cycle groups when it came to live births.

Considering this study’s results, the authors’ claim that “PGT-A significantly improved clinical pregnancy and live birth rates per embryo transfer in women with advanced reproductive age and diminished ovarian reserve” does not appear supported by the study’s results. As already noted before, such a conclusion is also completely counterintuitive to all logical and mathematical considerations and, with great likelihood, can be attributed to patient selection biases, which are so obvious in this manuscript. It, for example, seems intuitive to assume that older women with unusually good FOR will see more frequent utilization of PGT-A than older women with low FOR.

The authors furthermore concluded that, “although the cost-effectiveness of PGT-A in this population requires further investigation, it offers a valuable tool for patient selection and may reduce the emotional toll of miscarriage for some patients.” We again must politely disagree, since we saw nothing in this study to suggest that it in any way contributed to patient selection for PGT-A. Even assuming a marginal non-significant reduction in miscarriage rate with PGT-A to be valid – a very questionable proposition – one must question whether such a minor effect warrants the wide utilization of PGT-A, which in the U.S. already involves over half of all IVF cycles.


References – Because of its subject, this article – exceptionally – offers an extended references listing with up to 5 authors and including titles of articles.

  1. Ou Z, Liu N, Chen A, Li Q, et al. Effects of preimplantation genetic testing for aneuploidy on embryo transfer outcomes in women of advanced reproductive age with no more than three retrieved oocytes. Fertil Steil 2025;123(6):991-998

  2. Practice Committees of the American Society for Reproductive Medicine and the Society for Assisted Reproductive Technology. The use of preimplantation genetic testing for aneuploidy: a committee opinion. Fertil Steril 2024;122(3):421-434

  3. Ro C, Leeming J. Buy the “BY”: Revealing the machinations of paper mills. Nature 2025;642:823826

  4. Soriano JB, Ruano-Ravina A. The rising threat of predatory journals and paper mills in respiratory medicine and research. Lancet Resp Med 2025;13(6):E30-E31

  5. Parker L, Boughton S, Bero L, Byrne JA. Paper mill challenges: past, present, and future. J Clin Epidem 2024;111549

  6. Simopoulou M, Sfakianoudis K, Maziotis E, Tsisoulou P, Sokratis G, et al., PGT-A: who and when? A systematic review and network meta-analysis of RCTs. J Assist Reprod Genet 2021;38(8):939-1957

  7. Yang M, Rito T, Metzger J, Naftaly J, Soman R, Hu J, et al. Depletion of aneuploid cells in human embryos and gastruloids. Nat Cell Biol 2921;23(4):314-321

Previous
Previous

UPHEAVAL CONTINUES AT PROMINENT UNIVERSITIES AND COLLEGES 

Next
Next

GENERAL ASPECTS OF IN VITRO FERTILIZATION