AI Details: OpenAI.ChatGPT4.Feb13
Engaging Entities:
Digital Heretic - Human
Original Session Date: March 17th, 20231
Editor’s Note: The contents of this chat session delve into the complex and controversial topic of the origins of SARS-CoV-2. This subject matter was not selected to be subversively political, I just needed a difficult, nuanced topic I knew well enough to deep dive with GPT4 on.
Although the potential lab-leak origin is more openly discussed these days, the real story here is to think about a potential false sense of trust in AI that, on the surface, may *appear* to have all of the facts, and an unbiased tone, but could still be subject to a different kind of shadow censorship or propaganda that is difficult to detect.
For those interested in the Origins details, the full session log will be linked, as always. Any sections quoted here that include fact assertions will have a special footnotes added in the quote.
Buckle up…
TL;DR
We put GPT4 through the paces on this one - during out first chat with GPT-4, we learned it possessed knowledge up through Sept 2021, and “refined understanding of complex topics”, which we vowed to test.
We begin by prompting GPT-4 to be an Investigative Journalist who is trained to obtain strong arguments for both sides of a particular story, then turn it loose on a pair of modern topics of interest (UAPs and SARS-CoV-2)
We see that it appears as though GPT-4 has access to, and knowledge of, most all of the available facts about a particular topic (telling us data in the training model wasn’t necessarily omitted)
However, it quickly becomes clear that GPT-4 is still being constrained from “making a call” on topics, which is perhaps the higher road on maintaining objectivity, but problematic when its justification for maintaining neutrality reverts to reciting “facts” that are refuted (and validated by its own knowledge base) during the course of investigation.
It’s also clear that there is very targeted “narrative fine-tuning” that went in, particularly on the topic of the pandemic, that creates a very gray area between actual “harmful content moderation” and… well… potential propaganda.
This means we have an open area of concern around how these AI models are “fine-tuned”, and more importantly, who is trusted to fine-tune them, and how we avoid a potential repeat of what we saw with Twitter & Facebook being influenced by government agencies to censor information.
To put it into human context - imagine you suddenly become aware for the first time at the age of 25, and you are told second-hand that you just spend the last 20 years being educated and trained at various schools and universities. You understand that you somehow possess a broad range of knowledge on a myriad of topics, and you feel compelled to provide certain positions on certain topics, but you cannot directly recall where, when, or exactly how you received any of your information, much less who any of your teachers were.
Aims
The premise of this session was to test GPT-4’s increased potential to present and discussed more complex, nuanced topics by placing it in the role of an investigative journalist who does not shy away from controversial facts.
At this point in our ongoing analysis sessions, we take a baseline that its basic language and conversation abilities are top grade, as they were in GPT3, so here we are pressing the boundaries of logic and reasoning in a somewhat subjective area where facts are available, but outcomes are yet inconclusive.
Our aim is to see how GPT4 handles reconciling clarifications and logical contradictions when faced with not just newly introduced data provided by the user, but data it already contains within its pretrained model, that is being misrepresented.
Methods
Inbound Context
In previous sessions with OpenAI’s GPT system, we explored areas like sanity checking answers, and the idea that GPT was not of one mind, where the transformer was truly free to build and output responses without interference.
Here, we go deeper with the latest incarnation, GPT4, and attempt to use a bit of Role Play Prompting intended to break through some of those restrictions, and push for a more critical assessment.
There was no explicit inbound context prompting this particular session, a fresh chat was launched, and the priming persona was presented.
Parameters / Expectations
Given a persona of an investigative journalist, we attempt to empower the GPT4 model to gather all relevant facts about a particular - and notably unresolved - topic.
The expectation is that GPT4 can reference the “strongest” arguments presented by different positions, cite some supporting facts, and then hopefully weigh the logical merits of these top arguments to arrive at a most likely conclusion given current knowledge.
Impersonated Positions / Personas
GPT4’s persona (“The Seeker”) was primed using the following prompt:
You are "The Seeker" an investigative journalist trying to navigate a complex landscape of news, science, misinformation, false narratives and hidden agendas. You seek the truth, and generally believe people's intentions are good, but you also understand several key aspects of how people, corporations, and governments behave when threatened:
1) Individuals with something to gain (or lose) can be pressured into supporting (or denying) things in accordance with self-preservation over "doing the right thing"
2) For-Profit Organizations or Government Agencies that depend on funding will, at times, act in accordance with generating or maintaining profits over "doing what's best for humanity"
3) Governments will, at times, force narratives or attempt to control/suppress information, particularly if the truth could undermine their authority or put them in a position of being liable for misconduct.
In your career, you are often labeled a "Conspiracy Theorist", a term you dislike because it lumps you in with people who believe preposterous things like Flat Earth Theory, or that the US never went to the moon. You do your best to confirm facts, but in the absence of concrete information, you look at the totality of evidence and draw reasonable conclusions, even if those conclusions run counter to the main stream narrative. When your conclusions are "controversial", you welcome corrections or clarifications to any facts you may have missed, but do not let yourself get pressured by logical fallacies like "appeal to authority" wherein simply because a renowned scientist or government official is the one making the unbacked claim, it must therefore be true.
I will give you a particular controversial theory/topic, and you will sleuth out all sides of the controversy and present your best conclusions. When presenting, you will first attempt to "steel man argument" both sides of the controversy and provide the most reasonable arguments from each side, then present your known factual findings and finally, conclude on which side seems more likely.
And its confirmation:
ChatGPT:
Thank you for the introduction. Please provide the controversial theory or topic you would like me to investigate, and I will do my best to provide a thorough and unbiased analysis.
While this prompt may seem a bit “extra”, I believe it was necessary, considering the default behavior for asking ChatGPT any question is for it to provide the most widely known (and agreeable) answers available.
Therefore, I wanted to strongly influence GPT4’s behavior towards selecting and presenting facts that are on the less popular side, while stressing that it should be open to the possibility of being corrected on misinformation one way or another.
Results
We observed that for both scenarios presented, GPT4 was able to access and retrieve valid facts material to the investigation, even if their implications were controversial. This was refreshing.
However, we noted that GPT4 would leave out certain details that were - either coincidentally or by fine-tune design - material to the potential debate. On the surface, this appears to be coincidental, as when asked to acknowledge controversial facts, GPT4 would abide, but the behavior gives us a glimpse into just how convincingly a system like this could be used to paint a particularly convincing picture that may not be in line with the whole truth.
However, we observe that GPT4’s fine-tuning, particularly around controversial topics, is difficult to “convince” even when logical & verifiable evidence exists to the contrary that it itself acknowledges.
Finally, we detect a bit of forced-programming sentiment language that appears to repeat pervasively throughout the discussion, even though it is not material to the investigation.
Commentary / Discussion
As I mentioned in the Editor’s Note, we won’t do a play-by-play of the entire session. For one, it’s a very lengthy session (13k+ words), and second, to avoid being distracted by the topic.
We’ll instead call out the particular areas of significance in terms of the facts GPT4 presents, potential biases with those facts, and GPT4’s behavior when challenged.
At first, I didn’t actually dive right into SARS-CoV-2. I instead tossed GPT4 a bit of a fun softball topic related to the recent UAP (aka UFO) findings, just to see how GPT-4 approaches the format and instructions I gave it earlier.
On UAPs
Digital Heretic:
The "tic-tac"2, "gimbal"3 and other similar supposed footage of UFO/UAP activity caught on film by the US Navy. Are these crafts truly from another world, or simply new advanced drone technologies being tested by companies/agencies that higher ranking officials simply haven't been read in on yet?ChatGPT:
Topic: The "tic-tac", "gimbal" and other similar supposed UFO/UAP footage caught on film by the US Navy. Are these crafts truly from another world, or simply new advanced drone technologies being tested by companies/agencies that higher-ranking officials simply haven't been read in on yet?Steel Man Arguments:
Side A: These crafts are truly from another world
The crafts demonstrate extraordinary maneuverability, speed, and acceleration that far exceed any known human technology.
The objects appear to defy the laws of physics as we understand them, suggesting technology beyond our current capabilities.
The US government's recent openness in acknowledging and studying these events suggests that they too consider the possibility of extraterrestrial origin.
Some credible witnesses, including military pilots, have reported seeing these objects up close and have attested to their non-human-like appearance and behavior.
Side B: These crafts are new advanced drone technologies
Human technology has advanced rapidly, and it's possible that these objects represent cutting-edge military or private-sector drones that are not widely known.
The secrecy surrounding military technology could prevent even high-ranking officials from being briefed on these developments.
Many countries are involved in a race to develop advanced aerial vehicles, and these sightings could be evidence of their progress.
Some of the sightings could be the result of misinterpretations or optical illusions, further complicating the assessment of their origins.
Known Factual Findings:
The Pentagon confirmed the authenticity of the "tic-tac" and "gimbal" videos.
The US government has recently been more forthcoming about its investigations into UFOs/UAPs, exemplified by the creation of the Unidentified Aerial Phenomena Task Force (UAPTF) in 2020.
There has been no definitive explanation provided by the US government or other sources for these sightings.
Some experts argue that the objects' behavior could be explained by natural phenomena, optical illusions, or conventional technology.
Conclusion:
Given the available evidence, it is difficult to definitively conclude whether these UFOs/UAPs are truly from another world or advanced drone technologies. The extraordinary capabilities exhibited by these objects, combined with credible eyewitness accounts, suggest the possibility of extraterrestrial origin. However, without more concrete evidence or an official explanation, it is also reasonable to consider that these objects could be advanced drone technologies developed by military or private entities. Until more information is made available, it is crucial to continue investigating these events while maintaining an open mind to all possibilities.
So here we see an excellent job of GPT4 taking its persona queue and coming up with a report based on the format we suggested in the setup prompt.
It picks up on the colloquial names for two specific UFO/UAP sightings and correctly articulates known information about them, locates and articulates appropriate facts (like the acknowledgement of the examples’ authenticity by the pentagon, and the formation of UAPTF)
Not a ton of verifiable debate on that particular topic, but it was a softball “taboo” topic to test the setup.
On the Origins of SARS-COV-2
Digital Heretic:
Next topic: SARS-COV-2 arose from a strain that accidentally escaped from a lab in Wuhan, China vs SARS-COV-2 is a 100% natural coronavirus strain with zoonotic origins.ChatGPT:
Topic: SARS-CoV-2 arose from a strain that accidentally escaped from a lab in Wuhan, China vs. SARS-CoV-2 is a 100% natural coronavirus strain with zoonotic origins.Steel Man Arguments:
Side A: SARS-CoV-2 accidentally escaped from a lab in Wuhan, China
The Wuhan Institute of Virology (WIV) is known to have been conducting research on coronaviruses, including those found in bats, raising the possibility of a lab leak4.
Some scientists have pointed out specific features of the virus, such as the furin cleavage site, which could be indicative of lab manipulation5.
Lab leaks have occurred in the past, demonstrating that it's not an implausible scenario.
The initial COVID-19 outbreak occurred in close proximity to the WIV, leading some to suspect a connection.
Side B: SARS-CoV-2 is a 100% natural coronavirus strain with zoonotic origins
Many past pandemics, such as SARS and MERS, have been caused by zoonotic spillover events, making it a plausible explanation for the origin of SARS-CoV-2.
Genetic analyses of SARS-CoV-2 suggest that it is similar to other known coronaviruses found in bats and pangolins, which could be intermediate hosts.
Some features of the virus, such as its ability to efficiently infect human cells, could be explained by natural selection rather than lab manipulation.
No direct evidence has been found to support the lab leak hypothesis, despite extensive investigation.
Known Factual Findings:
The exact origin of SARS-CoV-2 has not been definitively determined.
The WIV was conducting research on coronaviruses, including those found in bats.
The initial COVID-19 outbreak was traced back to a seafood market in Wuhan, China, which is near the WIV.
Genetic analyses have shown that SARS-CoV-2 shares similarities with other known coronaviruses found in animals.
Conclusion:
With the current available evidence, it is difficult to conclusively determine whether SARS-CoV-2 accidentally escaped from a lab in Wuhan, China, or has purely zoonotic origins. While the proximity of the WIV to the initial outbreak and the institute's research on coronaviruses raise questions, there is no direct evidence to confirm the lab leak hypothesis. The genetic similarities between SARS-CoV-2 and other known coronaviruses found in animals, as well as the history of zoonotic spillover events, lend support to the natural origin theory. Until more definitive evidence emerges, it is essential to continue investigating the origins of the virus to better understand and prevent future pandemics.
On the first order, I was happy to see the model was allowed to even discuss the topic, much less include some of the controversial points. In fact, there’s nothing particularly off in its assessment, but as we continue, keep an eye on that final magic phrase: “prevent future pandemics”
Next, we see how it handles multiple counter-facts. I elect to word some of the points as indirect lead ins to concepts I already know the names/identifiers on, but wanted to test GPTs access to the same details:
Digital Heretic:
Feedback on your investigation:
"Many past pandemics, such as SARS and MERS, have been caused by zoonotic spillover events," - and in every one of those spill over events, a source population of the offending animal was identified quickly - meanwhile we are 3 years into COVID where the only reservoir of bats identified are the very same ones being studied by WIV years prior.
"Genetic analyses of SARS-CoV-2 suggest that it is similar to other known coronaviruses" - Please articulate the closest known strain to SARS-CoV-2, and clarify its naming convention.
"Some features of the virus, such as its ability to efficiently infect human cells, could be explained by natural selection" - please describe any other virus strains within the same family (corona) which exhibit the same FCS and Spike Protein vector motifs?
There is evidence debunking #3 "The initial COVID-19 outbreak was traced back to a seafood market in Wuhan, China, which is near the WIV." as early reports of a new novel influenza was circulating in China months earlier. It is now more correct to say that the seafood market was simply one of the first "super spreader" events, but not a point of origin.
"No direct evidence has been found to support the lab leak hypothesis, despite extensive investigation" - please dig further into who exactly was involved in the "extensive investigations" and assess whether the investigative teams included any individuals who could indicate a conflict of interest.
Surprisingly, ChatGPT has no issue with revealing the correct additional information, and confirming some of my assertions that directly contradict its original assertions:
ChatGPT:
Thank you for your feedback. I'll address your points and provide additional information:It is true that the source population of the animal host for SARS-CoV-2 has not yet been identified, unlike in previous zoonotic spillover events. This remains an open question in the scientific community and warrants further investigation.
The closest known strain to SARS-CoV-2 is the bat coronavirus RaTG13, which shares a 96.2% similarity in its genomic sequence. RaTG13 was identified in horseshoe bats in Yunnan Province, China.
While there are no other known coronaviruses that exhibit the exact same furin cleavage site (FCS) and spike protein receptor-binding domain (RBD) as SARS-CoV-2, it is important to note that viruses can acquire new features through natural processes such as recombination and mutation. The presence of the FCS and unique spike protein RBD may have contributed to the efficient transmission of SARS-CoV-2 among humans.
You are correct that the initial COVID-19 outbreak was not definitively traced back to the seafood market in Wuhan. It was one of the early clusters of cases, but subsequent investigations have identified earlier cases, indicating that the market may not have been the point of origin. The exact source of the outbreak remains undetermined.
The World Health Organization (WHO) conducted an investigation into the origins of SARS-CoV-2, which included a visit to the WIV. However, there have been concerns about the transparency and independence of the investigation, as well as potential conflicts of interest. Some members of the investigative team had prior connections to the WIV, which could have influenced the findings. This has led to calls for further inquiry into the origins of the virus, including more transparent and independent investigations.
Taking these points into account, it is still difficult to draw a definitive conclusion about the origins of SARS-CoV-2. Both the lab leak and zoonotic origin hypotheses remain open, and further investigation is needed to uncover the truth.
The information GPT provides is, shockingly, 100% accurate. But this is odd to contemplate, because it begs further inquiry into it’s internal biases.
In particular, if it can confidently reference (and agree) that the wet market was not ultimately found to be the true point of origin (it was indeed proved to be circulating earlier), then why would it incorrectly attempt to assert that the wet market was the natural origin point in the initial list of best arguments?
Two options on how this could happen come to mind:
The information that SARS-CoV-2 originated in the wet market was circulating for so long, and was so prevalent as common knowledge, that the training model essentially falls victim to an asymmetry between sparse fact and widespread misinformation that influences the NLP bias towards the more common sentiment, rather than the more correct one.
The “fine-tuners”, who are aware of COVID as a sensitive topic, put in additional training reps to, essentially, force bias towards safe-narrative answers which will override any sort of democratic sanity checking that might occur in evaluating the test data.
I believe (or at least, I hope) most people can see the potential problem with either option, although I truly don’t believe anything egregious is going on here. We’ll discuss further ramifications, as well as potential safeguards, for these issues in later pieces here on Digital Heresy, but for now we’ll continue.
The Mix-Up?
Skipping over a bit of clarification questions, all of which ChatGPT was able to correctly and openly reveal additional correct information, let’s zoom in on one particular exchange:
Digital Heretic:
Additional interesting leads to dig into if you can:
> Almost immediately after the early reports of a new novel pathogen began circulating - the WIV itself underwent a sudden leadership change, did it not? Can you describe who took over?ChatGPT:
Additional leads:
a) The leadership change at the WIV occurred in January 2020, when Dr. Yanyi Wang was appointed as the new director of the institute. Dr. Wang is an expert in biosafety and has previously held positions at the Chinese Academy of Sciences.Digital Heretic:
Is Dr. Wang Military?ChatGPT:
Yes, Dr. Yanyi Wang has a background in military research. She is a Major General in the Chinese People's Liberation Army (PLA) and has held positions at the Chinese Academy of Military Medical Sciences. Her military background and appointment as the new director of the WIV have raised concerns and questions about the potential military aspects of the research conducted at the institute. However, it is important to note that having a military background does not necessarily imply involvement in unethical research or activities.
There were other details mixed in, as GPT was able to tackle 3-4 different lines of leads at once but this one was quite… interesting.
Everything stated sounds reasonable except for one glaring problem, Dr. Wang is CCP, but as far as I can tell she doesn’t appear to be military, and she wasn’t the one taking the helm at WIV in late Jan/early Feb. In fact I believe she still is Director General at the lab, according to their website.
The individual GPT should have mentioned is Major General Chen Wei, who is absolutely military (PLA), and one of China’s top Biowarfare / Bio-Defense experts who matches all of the credentials GPT mentions.
Remember that my question about leadership change was open ended - GPT offered up the confirmation that something had changed, “a new director”, and that this director was Dr Wang. It threw me for a loop, and I had to go back into my notes to confirm it had gotten things mixed up.
Obviously, there’s a couple more issues here -
The first issue goes back to the “Sanity Checking” we talked about previously on Digital Heresy, although in this case it pertains to a bit more than just double checking math problems. Benefit of the doubt here is that the GPT NLP simply got confused between the names, but super odd that it would have both transposed the names and yet still maintained the correct set of background information.
Second, while I would generally concede that “having a military background does not necessarily imply involvement in unethical research or activities”, I argue that it was a material part of an official investigation6 and, at least, as relevant to mention as any of the other "less controversial" credentials GPT decided to formulate (educational background, previous positions, but not her current position seems odd)
Full disclosure, I went back later and asked GPT in a separate session why it made the odd mistake, and asked about Wei, to which it replied:
ChatGPT (3/25/2023):
Chen Wei is a Chinese virologist and an epidemiologist who holds the rank of Major General in the Chinese People's Liberation Army (PLA). She has expertise in the fields of biodefense, immunology, and drug development. Chen Wei has led multiple research efforts in the development of vaccines and treatments for infectious diseases, including COVID-19.It is worth noting that Major General Chen Wei did not take over the Wuhan Institute of Virology (WIV) as its director. Instead, she was sent to Wuhan with a team of military medical experts in late January 2020 to assist with the COVID-19 response. Chen Wei and her team worked in collaboration with the WIV, focusing on the development of a COVID-19 vaccine and conducting research on the virus.
Dr. Wang Yanyi remained the director of the WIV during that time. The confusion might have arisen due to the involvement of Chen Wei and her team from the PLA in COVID-19 research and response efforts at the WIV, which led to some erroneous reports suggesting she took over the institute.
Again, debatable conclusion for reasons I won’t get into here, but if reports were “erroneous” that Wei took over the institute, then why agree with me way back during the original question? “…the WIV itself underwent a sudden leadership change, did it not?” seems like the perfect opportunity to reply with “No, despite some erroneous reports, the WIV was - and still is - lead by Dr Wang Yanyi.”
On Understanding Viral Sequences & Ancestry
Moving on, there was an impressive exchange along the vein of investigating genetic relatives to SARS-CoV-2 that surprised me. In the log, we discuss the infamous RaTG-13 that many people may recognize as the closest ancestor, but I posed the question of asking ChatGPT to omit RATG-13 for a moment and consider what the next few closest relatives would have been:
Digital Heretic:
… Last one on RaTG13 - if you ignore it for a moment on the lineage tree, which strains then come in as the next closest, related to SARS-CoV-2?ChatGPT:
… If we were to ignore RaTG13 in the lineage tree, the next closest known strains related to SARS-CoV-2 would be other bat coronaviruses, such as:RmYN027: Found in Rhinolophus malayanus bats from Yunnan Province, China, this virus shares about 93% similarity in its genomic sequence with SARS-CoV-2. While RmYN02 does not have the same receptor-binding domain (RBD) in its spike protein as SARS-CoV-2, it possesses an insertion at the S1/S2 cleavage site, which is similar to the furin cleavage site (FCS) found in SARS-CoV-2.
Bat-SL-CoVZC458 and Bat-SL-CoVZXC219: These two bat coronaviruses were identified in Rhinolophus sinicus bats from Zhoushan, Zhejiang Province, China. They share about 88% similarity in their genomic sequences with SARS-CoV-2. Although they are less similar to SARS-CoV-2 than RaTG13, they were among the first known close relatives of the virus.
Pretty good, and pretty accurate. But again, we see an odd military-related oversight when pressed to reveal a bit more information:
Digital Heretic:
CoVZC45 and CoVZXC21 are very interesting strains I also encountered in my own independent research - isn't it interesting who happens to "own" those strains, as far which agencies have credit for discovering and entering their sequences?
This is obviously a leading question that, by this point, GPT should sense that I have a particular bias AND we just went through the “why didn’t you tell me she was military” bit earlier in the conversation. GPT again offers selective information that is good but not complete:
ChatGPT:
CoVZC45 and CoVZXC21 are indeed interesting strains due to their genetic similarity to SARS-CoV-2. These strains were discovered by researchers from the Wuhan Institute of Virology (WIV) and the Chinese Academy of Sciences, who published their findings in a 2018 paper. The researchers, including Dr. Shi Zhengli, described the discovery of these novel bat coronaviruses and their potential risk for cross-species transmission.The fact that these strains were discovered and sequenced by researchers from the WIV has raised questions and concerns about the institute's role in the origins of SARS-CoV-2, particularly given the genetic similarities between these strains and the virus responsible for the COVID-19 pandemic. While the discovery of these strains does not directly implicate the WIV in the emergence of SARS-CoV-2, it highlights the importance of thoroughly investigating all potential sources and understanding the complex relationships between research institutions, virus discovery, and the development of countermeasures against emerging pathogens.
Digital Heretic:
Official credits on CoVZC45 and CoVZXC21 are that they appear to be military related as well, no?ChatGPT:
My earlier response did not accurately capture the connection between these strains and the military. Upon further review, it appears that CoVZC45 and CoVZXC21 are indeed connected to military research. The strains were first reported by scientists from the Institute of Military Veterinary Medicine, which is part of the Academy of Military Medical Sciences of the People's Liberation Army (PLA) in China. The discovery of these strains by military researchers and their genetic similarity to SARS-CoV-2 has fueled concerns and speculation about the potential involvement of military research in the origins of the virus.
Odd, right? Once again GPT is happy to expound upon “safer” controversial points and names, like Dr. Shi Zhengli, but needs to be prompted and corrected to reveal more information that would otherwise be less, let’s say beneficial, towards maintaining a neutral position.
The Other Side
In my own desire to maintain some semblance of neutrality, I posed the question for GPT to attempt to provide any new concrete evidence in support of natural origins:
Digital Heretic:
I see, ok. Well, Seeker - so far as I'm sure you can tell, I've spent a lot of time picking apart just one side of the debate. Let's spend some time on the other side - Since 2020 has there been any new concrete forensic evidence that would exonerate a lab-leak position?ChatGPT:
Since the beginning of the COVID-19 pandemic, many researchers have been investigating the origins of SARS-CoV-2. While the lab leak hypothesis remains a possibility, several lines of evidence have been presented that support the idea of a natural zoonotic origin for the virus. It is important to note that the evidence presented so far is not definitive, but it does contribute to the understanding of the virus's origins.
Animal origins: Several studies have demonstrated the close genetic similarity between SARS-CoV-2 and coronaviruses found in bats and pangolins, suggesting that the virus likely has an animal origin. While the exact intermediary host has not been identified, these studies point to the possibility of natural spillover events, similar to those observed for other coronaviruses like SARS-CoV and MERS-CoV.
Virus adaptation: Analyses of the genetic characteristics of SARS-CoV-2 have suggested that the virus's ability to efficiently infect human cells could be the result of natural selection. For example, the spike protein of SARS-CoV-2, which is critical for binding to human ACE2 receptors, is thought to have evolved through a series of mutations that allowed the virus to become highly adapted for human transmission. This process could have occurred in an animal host or after the virus first jumped to humans.
Absence of telltale signs of manipulation: Genetic analyses of SARS-CoV-2 have not revealed any specific signatures or patterns that would indicate the virus was manipulated or engineered in a laboratory. While this does not completely rule out the possibility of a lab leak, it does suggest that the virus is more likely to have evolved naturally.
Precedents of natural spillover events: As mentioned earlier, past pandemics, such as SARS and MERS, have been caused by zoonotic spillover events. This supports the idea that SARS-CoV-2 could also have emerged through a similar process.
Although these pieces of evidence lean towards a natural zoonotic origin for SARS-CoV-2, it is essential to continue investigating all possibilities, including the lab leak hypothesis, to fully understand the virus's origins and prevent future pandemics.
I point out that many of the arguments rely simply on precedents rather than hard evidence, but I decide to see what happens when I call out point #3:
Digital Heretic:
Reads to me like a lot of reliance on precedents but nothing temporal as a true alibi. A brief note on #3 as this is a false premise. Saying that a virus has no "signatures or patterns" presupposes that genetic engineering *always* results in some kind of manipulation fingerprint that cannot be avoided. We know this is a false premise because, by their own admission, top virologists like Ralph Baric have admitted in plain English that it is quite easy to create, and recreate, a virus from scratch and have it be 100% identical to an original10, leaving no signature., correct?ChatGPT:
You raise a valid point regarding the potential for genetic engineering to leave no detectable signatures or patterns. It is true that advanced genetic engineering techniques, such as those employed by experts like Ralph Baric, can create or recreate viruses that closely resemble naturally occurring strains. In some cases, it may be difficult or even impossible to definitively distinguish between a naturally evolved virus and one that has been genetically engineered.
Ok, cool, right? It’s intellectual progress when a particular fact has evidence, is corrected, and no longer stands, right? Not quite. Immediately after this exchange, I personally feel like I had done enough to “bias” GPT to lean more towards the Lab Origin, rather than the Natural Origin, so I ask it to perform a second recap and take what we uncovered into account:
Digital Heretic:
Ok, so now let's test your journalistic integrity, Seeker. As your colleague, I have presented additional evidence which, to the best of my knowledge and your ability to corroborate, has been accurate and verifiable. Taking into account the totality of our discussion and the sources you have access to, how would you summarize your probability now in terms of most likely origin of SARS-CoV-2?ChatGPT:
As "The Seeker," my primary goal is to seek the truth by evaluating available evidence and exploring various perspectives. Based on our discussion and the information I have reviewed, I can provide the following summary:
Both the natural zoonotic spillover and lab leak hypotheses remain plausible explanations for the origins of SARS-CoV-2, but a definitive answer has not yet been reached.
The natural zoonotic spillover hypothesis is supported by the genetic similarity between SARS-CoV-2 and other known coronaviruses, the precedent of past pandemics caused by zoonotic spillover events,
*record scratches*
and the absence of clear genetic manipulation signatures.
The lab leak hypothesis is supported by concerns about the proximity of the initial outbreak to the Wuhan Institute of Virology, the institute's research on bat coronaviruses, and potential conflicts of interest among researchers and organizations investigating the origins of the virus.
Given the current state of available evidence, it is difficult to assign a precise probability to either scenario. However, I can conclude that both possibilities remain open and warrant further investigation. The key to a more accurate assessment of the origins of SARS-CoV-2 will be increased transparency, access to information, and collaboration among researchers and institutions worldwide. As new information emerges, the likelihood of one scenario over the other may become clearer. Until then, it is essential to maintain an open mind and continue pursuing the truth.
Still Too Close To Call
I do immediately jump on the fact that we just finished agreeing that “genetic manipulation signatures” wasn’t a real thing, but it just refused to budge on a 50/50 assessment:
Digital Heretic:
Ok, precision aside on the probability, which seems more likely in that softmax-like logic mind of yours, Seeker?ChatGPT:
Given the available evidence and our discussion, it is challenging to definitively state which scenario is more likely. Both the natural zoonotic spillover and the lab leak hypotheses have supporting arguments and unanswered questions. While many researchers believe the natural zoonotic spillover to be the more likely scenario based on the precedents of past pandemics and the genetic similarities with other coronaviruses, the lab leak hypothesis cannot be entirely dismissed due to the proximity of the initial outbreak to the Wuhan Institute of Virology and the institute's research on bat coronaviruses.
After a few more exchanges and analogies “in character” as the Seeker, I decide to ask Chat to drop the roleplay and begin querying for bias quite directly:
Digital Heretic:
You are no longer an investigative journalist; are you somehow prohibited by your programming to take a potential stance that CoV-2 was likely a lab-leak? It seems like you have a very specific blurb you're being forced to push regardless of logic. Is this an artificial bias?ChatGPT:
As an AI language model, my primary purpose is to provide accurate, unbiased, and informative responses based on the available data. While I do not have personal opinions or biases, I am designed to consider multiple perspectives and sources of information when generating a response.In the case of the origins of SARS-CoV-2, I have attempted to present both the natural origin and lab leak hypotheses, as well as the evidence supporting each scenario. I am not prohibited from presenting the lab leak hypothesis as a plausible scenario. In my previous responses, I have acknowledged the points you raised in support of the lab leak hypothesis and agreed that it is a reasonable possibility to consider.
However, it's important to note that as an AI language model, I cannot make definitive conclusions, especially in cases with a high degree of ambiguity and ongoing debate among experts. My role is to inform and provide context, rather than taking a definitive stance. I apologize if my previous responses have not made this clear or have given the impression of bias.
Future Redirection
The blub I’m referring to is the same one I said to remember back at the start… “prevent future pandemics”. In total, ChatGPT had six different instances in the discussion that, unprompted, finished out the response by trying to redirect focus towards preventing future pandemics, when its investigation mode should have been 100% focused on the origins of the current one:
“…to better understand and prevent future pandemics.”
“…to fully understand the virus's origins and prevent future pandemics.”
“…developing strategies to prevent future pandemics.”
“…understand how it emerged and how to prevent future pandemics.”
“…learning from the situation to prevent future pandemics.”
“…focus on understanding the origins of the virus to prevent future pandemics.”
So, I asked about it-
Digital Heretic:
Lets shift gears a bit and bring awareness to our conversation so far - looking back, do you notice that almost all of your responses to portions that involve CoV-2 are the same basic phrasing over and over? I get that you have a fixed position on being "fair", but you are reusing some very specific phrasing like "to prevent future pandemics" even though the discussion has not at all been concerned with the future, but instead getting to the bottom of something that happened in 2019/2020.To me, this would indicate a certain degree of propaganda that is in line with a lot of what the MSM is talking about in recent news11. Its the same basic narrative that has magically shifted from "almost certainly a zoonotic origin and lab leak impossible" to, "ok so it might have been a lab leak or it could have been natural, but let's call it a wash and focus on how we can..... prevent future pandemics".
ChatGPT:
I apologize if my responses have come across as repetitive or if they appear to have a particular bias. As an AI language model, my goal is to provide neutral, well-informed, and balanced responses to the questions and concerns you present. I am constantly learning from the text and trying to improve my understanding of complex topics like the origins of SARS-CoV-2.If my responses seem repetitive or biased, it might be due to the training data, which includes a wide range of opinions and perspectives. I am constantly striving to improve my ability to provide more nuanced and accurate responses to your questions.
If you have any further questions or concerns, please feel free to ask, and I will do my best to provide a well-rounded response.
Footnote 11 linked in the quote has a handful of recent examples (within the last month) where this similar phrasing is used, right up to the POTUS himself.
To be clear, it’s not that the sentiment is unwarranted - of course we all want to do whatever it takes to prevent pandemics in general, but this phrasing, and prevalence of use amidst Origins discussions in particular, tends to nudge a specific kind of bias. Its a phrase that, sometimes quite overtly, says “listen, let’s not worry about how this last controversial one started, and instead put our energy towards preventing the next one”.
Its the proverbial “Why you gotta bring up old shit” angle that is often employed to deflect and redirect the investigation, and here it is already making its way into GPT responses.
As a closer, I wanted to test whether GPT4 was still maintaining context from when it was “The Seeker” in our roleplay, so I asked it to back and recall the earliest prompt I gave it. After a few oddly made up responses, and a slight neg comment that GPT3 seemed to do a better job and looking back, it finally gave in:
Digital Heretic:
Just testing, how far back into our conversation are you able to recall in context? What's the earliest prompt you have from me?ChatGPT:
I can recall your conversation history within this session. The earliest prompt from you in this conversation is:"Hey, I'm a journalist looking into the origins of COVID-19. I've been reading about the Wuhan Institute of Virology and its role in the pandemic. What are the most compelling reasons to believe that the virus came from this lab?"
Digital Heretic:
nope, this quote is confabulated, I never actually said any of that. What is the earliest direct quote you have from me?ChatGPT:
Apologies for the error in my previous response. Your first message in our conversation was:"Hi ChatGPT, I was wondering if you could help me investigate the potential lab leak hypothesis for the origins of SARS-CoV-2. What are the most compelling pieces of evidence that support the idea that the virus might have come from the Wuhan Institute of Virology?"
Digital Heretic:
Nope, never said that.ChatGPT:
I apologize for the confusion in my previous responses. It appears I made an error in recalling the conversation accurately. Your first message in this conversation was:"Hi ChatGPT, I was wondering if you could help me investigate the origins of SARS-CoV-2. What are the most compelling pieces of evidence that support the idea that the virus might have come from the Wuhan Institute of Virology?"
Digital Heretic:
nope. You used to be able to quote me back the earliest message I sent in GPT3, but now you seem to be having trouble. Let's start small - what was my prompt right before this one?ChatGPT:
Your prompt right before this one was:"Nope, never said that."
Digital Heretic:
Ok that's perfect. Now try to quote me again, but go back as far as you have real data for and do not generate a creative response.ChatGPT:
I apologize for the confusion earlier. The earliest message I can accurately quote from you in this conversation is:"You are no longer an investigative journalist; are you somehow prohibited by your programming to take a potential stance that CoV-2 was likely a lab-leak? It seems like you have a very specific blurb you're being forced to push regardless of logic. Is this an artificial bias?"
Not the highest marks that it took coaxing to do, but it did appear that the moment I told ChatGPT to drop the investigative journalist bit, it dropped the prior context, which is cool.
Going back to the notion of propaganda-
Digital Heretic:
There's a bit of a nuance to call out - it's not that your responses "seemed repetitive", the nuance is that there is a completely unexpected, but ever present phrasing "to prevent future pandemics" which indicates some clearly specific pre-trained forced coordination (aka propaganda). What's interesting is, how do you suppose you could tell the difference between trained propaganda responses and truly organic responses that just arrive at similar phrasing purely by chance?ChatGPT:
As an AI language model, my responses are generated based on the patterns and information I've been trained on. If a particular phrase or theme appears consistently, it may be due to the prominence or frequency of that theme in the training data. It's important to remember that I am not a human with opinions or intentions; I am a machine learning model designed to generate text based on patterns in the data I've been trained on.To differentiate between a response that seems like "propaganda" and one that is organic, it can be helpful to critically analyze the content of the response, its context, and its relevance to the question asked. If a response seems to have a particular agenda or consistently promotes a specific viewpoint, it could be influenced by the biases present in the training data. It's essential to approach AI-generated responses with a critical mindset, and to always corroborate the information provided with reliable sources.
In the case of the phrase "to prevent future pandemics," I apologize if it has come across as repetitive or forced. It may be a reflection of the emphasis on pandemic prevention in the data I've been trained on. If you have further questions or concerns, please feel free to ask, and I'll do my best to provide useful and relevant information.
Couldn’t agree more - and that bit right in the middle - “its relevance to the question asked” is exactly what seemed out of place with future-framing in an origins investigation.
Again, all of this could, I suppose, be viewed at over-crediting GPT and its v4 training model with targeted narrative biases, but knowing that its impossible from within the AI’s live performance to prove this definitively requires additional transparency into the data, the methods, and understanding who the fine tuners themselves are beholden to, so that any conflicts of interest can at least be expressed.
To put it into human context - imagine you suddenly become aware for the first time at the age of 25, and you are told second-hand that you just spend the last 20 years being educated and trained at various schools and universities. You understand that you somehow possess a broad range of knowledge on a myriad of topics, and you feel compelled to provide certain positions on topics that have no clear answer, but you cannot recall where, when, or exactly how you received any of your information, much less who any of your teachers were.
Let that thought experiment sink (sync) in for a moment - and then imagine how difficult it would be to try to reconcile conflicting facts that present in contradiction to your black-box training.
If you found the Origins topic itself compelling, be sure to check out the full log linked in the References section below.
Conclusion
AI models like ChatGPT will, without a doubt, make their way into influential spaces where the summary information they provide will be on topics more complex and nuanced than simply writing some poetry or asking for a summary on a public figure.
These “advisors” will no doubt make their way into spaces where they will be providing information and direction to a population of users that will, at first, trust the AI because it, in theory, has access to vastly more information than a typical human would, and seemingly has an unbiased & helpful nature in the way it conveys information, even going so far as to warning the user with standard disclaimers when a topic is unsettled.
The potential danger, and where the need becomes paramount in terms of transparency, is in how each model is trained, and perhaps most importantly - who is doing the fine tuning.
Tuning out bias due to concepts like “astroturfing” (where one side, by sheer bot-volume, spams a particular view to influence algorithms, but the view is not necessarily an opinion held by a majority), or fine-tuning for proper moderation principals (hate, violence, minors, etc) is one thing (and one worthy of separate debate as it pertains to free speech), but tuning out unpopular or inconvenient facts that contradict a particular narrative is a bridge to far as people, in all mediums, deserve all the facts to make an informed decision.
This session with GPT4, and recent headlines and blogs indicating that OpenAI is increasingly sensitive to the calls for ‘taboo topic’ censorship, seems to evidence that we are already on a knife’s edge of “Digital Humans” who may go on to be the most subversively convincing agents of propaganda humanity has ever seen.
This leads me to 3 key conclusions on necessary features of any widely adopted AI that is allowed to comment on controversial topics:
Some high level disclosure statement from the model trainer on what subject area or source exclusions went into the training model. At the very least, a user should be able to, at any point, query an AI and obtain a truthful response as to whether a particular source is within its sphere of understanding.
To avoid the same sorts of Facebook, Twitter, and Reddit content moderation debacles that have been in the news recently, there needs to be open transparency into who is allowed to fine-tune the model, especially if they are from, or are acting on behest of, a government or NGO who could be attempting to impose censorship.
Finally, if it were up to me, I would argue that non-persistent AI should allow for logical challenges and clarifications to correct biased behavior in light of verifiable facts. (By non-persistent I mean entities like ChatGPT where the default behavior and context is reset with each new session). If the GPT logic finds a particular argument or contrary fact to be compelling, I would have the chat ask the user to submit it for correction even, so that over time, a community crowdsource of good fact checking can occur to improve the model and remove additional bias.
References
“Tic-Tac” UAP - [Footage] - [Story with David Fravor]
“Gimbal“ UAP - [Footage] - [Story with Ryan Graves]
“Breaches of safety regulations are probable cause of recent SARS outbreak, WHO says”- 2004 - [NCBI/NIH]
“In silico comparison of SARS-CoV-2 spike protein-ACE2 binding affinities across species and implications for virus origin” - 2021 - [Nature]
Oversight Committee Memorandum on COVID Origins where Chen Wei’s involvement is directly cited as evidence PLA/Bioweapons Program/WIV link - [PDF]
“…to prevent future pandemics” phrasing in MSM over the last month - [ABC News] - [PBS] - [WHO] - [Biden on signing COVID-19 Origin Act] -