AI
California’s AI Court Clerk Reaches Into Criminal Cases
The Los Angeles County Superior Court signed a $314,000 contract with AI company Learned Hand in February, granting the startup access to the country’s largest trial court. Riverside County signed its own $10,000 agreement the same month. Both courts launched pilots without telling litigants their cases could be touched by the system.
Documents obtained by the nonprofit newsroom CalMatters show both contracts extend further than the courts have described publicly, authorizing AI assistance in criminal cases where defendants face incarceration or the reversal of a conviction. The distance between what the contracts permit and what the judges assigned to criminal courtrooms were told is the problem officials are now managing quietly.
Two Courts, Two Contracts, One Missing Disclosure Rule
The California Judicial Council required superior courts to adopt generative AI use policies before deploying such technology, and both pilots launched under those mandates. But the policies contain a significant gap: disclosure is required only if a motion, decision, or other document is written entirely by generative AI. Assisted drafting, research memos, and tentative ruling suggestions that Learned Hand actually produces fall below that threshold. A judge can receive an AI-generated research memo and a suggested tentative ruling, revise a paragraph, and sign it without any disclosure appearing in the public record.
Both courts declined to say whether plaintiffs are aware the tool is being tested on their cases. A spokesperson for Los Angeles County Superior Court said testing is conducted on motions that have already been decided, but the contract permits testing on live cases. The roughly 31-to-1 gap between the two contract values reflects different ambitions: Riverside County is explicitly exploring the technology, while the LA contract carries a detailed roadmap for expansion into criminal, family, and probate divisions.
| Feature | Los Angeles County Superior Court | Riverside County Superior Court |
|---|---|---|
| Contract value | ~$314,000 | $10,000 |
| Pilot launched | February 2026 | February 2026 |
| Current active users | 6 judges and research attorneys | 7 civil/probate attorneys |
| Current scope | Civil cases | Civil and probate |
| Criminal expansion in contract | Yes, explicitly roadmapped | Discussed in internal emails |
| Defendant notification required | No | No |
Inside Learned Hand’s Architecture
Learned Hand combines language models from Anthropic, OpenAI, and Google to function as what the company calls an AI clerk. The tool can draft orders, produce research memos, and suggest tentative rulings. Shlomo Klapper, the company’s founder and chief executive officer, previously clerked for a federal appeals court and worked at Palantir, a data analytics and surveillance technology company. He has framed the product as a necessary response to court backlogs that hiring alone cannot close.
“I think the only solution is to give every single judge and staff attorney their own AI clerk,” Klapper told CalMatters. He acknowledged the tool may carry bias but argued testability is the relevant standard. “I’m not saying my machine isn’t even biased,” he said. “I’m saying we can test it and people have tested it. And that is the benefit over humans.” The company says it tests for bias and accuracy but has not published those results.
Roughly a dozen of the 51 superior courts that responded to CalMatters’ public records requests said they use AI tools from LexisNexis, Thomson Reuters, and Microsoft’s Copilot. The California Judicial Council’s court technology framework required each superior court to create a generative AI policy before deploying such tools – but those policies set the disclosure threshold that currently exempts assisted drafting from notification requirements. Learned Hand goes further than those off-the-shelf products, building its functions specifically around judicial drafting and offering courts an explicit expansion path into every case type, negotiated into each contract before any judge sees the tool run.
The Racial Justice Act Collision
California’s Racial Justice Act, passed by the legislature in 2020 as Assembly Bill 2542, gives incarcerated people the right to challenge a conviction or sentence they believe was infected by racial bias. Petitions arrive directly from prison. A valid petition is unlike any other criminal filing in the state system, requiring evidence that racial bias was a factor in the choice of charges, the plea, the verdict, or the sentence. The evidence is often woven into jury selection records, prosecutorial communications, and language patterns that legal specialists say require careful human interpretation.
The March Luncheon and Tentative AI Decisions
In March, Superior Court Judges Yvette Verastegui and Olivia Rosales, who lead the criminal branch of the Los Angeles court, presented to colleagues at a courthouse luncheon and said the tool could one day help process Racial Justice Act petitions. They described a workflow in which an AI-generated tentative decision would reach judges after a petition arrived from prison, either advancing or denying the case before a human judge made a final call on the record.
At least one judge in attendance was alarmed. “The concern, of course, that I have is that the courts will utilize that as a reference point and then get stuck to that initial analysis,” that judge told CalMatters on condition of anonymity under judicial conduct rules. “It essentially reimposes a one-size-fits-all style of justice.” A second judge who attended said independently they would not trust the tool to summarize any Racial Justice Act petition.
Defenders Call It Bordering on Unethical
Elizabeth Lashley-Haynes, a deputy public defender at the Los Angeles County Public Defender’s Office who specializes in Racial Justice Act cases, called the application “highly problematic and bordering on unethical.” The petitions turn on contextual racial language she says no current AI can reliably parse. “Words that are used in these cases that have racial undertones or racial meanings are way beyond the realm of anything that artificial intelligence could do,” she told CalMatters.
Klapper and LA Superior Court Executive Officer David Slayton both denied the court has any current plans for this use. A court spokesperson confirmed the contract permits it regardless, describing the possibility as one that “has not commenced in any way.”
A Track Record That Gives Critics Ammunition
The case against AI in criminal proceedings is not theoretical. Large language models have a documented history of race and gender bias. An analysis of predictive policing technology used by the Los Angeles Police Department found racial disparities in its outputs, and a separate analysis of the COMPAS risk-scoring algorithm, widely used in criminal sentencing decisions, found it more likely to classify Black defendants as future crime risks than white defendants with similar records. AI systems can replicate or amplify those patterns from their training data.
- Nearly 90 California state or federal court cases have involved AI-related mistakes since August 2024, per researcher Damien Charlotin’s global tracking database
- $10,000 fine was issued to a Los Angeles-based attorney for citing cases that do not exist, described at the time as a historic sanction
- 4 Nevada County prosecution cases contained AI-linked errors, the Sacramento Bee reported this month
- 2 federal judges withdrew AI-tainted opinions in 2025 after admitting their offices used generative AI tools to draft rulings
Those federal judges, Henry T. Wingate of the U.S. District Court for the Southern District of Mississippi and Julien Xavier Neals of the U.S. District Court for the District of New Jersey, were pressed by Senate Judiciary Committee Chairman Chuck Grassley of Iowa to explain their withdrawn opinions. Both confirmed AI involvement: Wingate’s law clerk had used the tool Perplexity; Neals’s intern had used ChatGPT. Both opinions contained fabricated case citations and invented quotes attributed to litigants. Sterne Kessler’s 2025 review of AI hallucination sanctions across US courts found no consistent federal standard for addressing AI errors in judicial proceedings.
A 2025 MIT study found that language models from major companies can reduce critical thinking and brain activity in users, a compounding concern when judges receive AI-drafted memos they did not generate. UCLA Law School professors have predicted that more judges will make AI-linked errors as adoption spreads into courts at every level.
What the LA Contract Permits in Criminal Court
Los Angeles County District Attorney Nathan Hochman reviewed the contract and flagged two criminal motions it explicitly covers. The first is the motion to suppress, the pretrial hearing that determines what evidence prosecutors may present at trial. The second is the motion for post-conviction relief, filed by people already sentenced who seek to overturn their conviction or reduce their sentence. Hochman called these “two incredibly important motions in the criminal justice system.”
“When you’re dealing with someone’s liberty – as opposed to in the civil setting, which is everything other than liberty – the stakes couldn’t be higher,” he told CalMatters. “I don’t want to take the chance, particularly in a criminal case, that AI happens to get it wrong. And now someone’s constitutional rights have been infringed. Someone has gone to prison who shouldn’t have.” Internal emails obtained by CalMatters show Klapper pitching Riverside County executives on applying the tool to “things with the largest paper records” in criminal cases, including death penalty habeas corpus petitions and parole revocation hearings.
- Motion to suppress – LA contract, criminal division, explicitly covered
- Motion for post-conviction relief – LA contract, criminal division, explicitly covered
- Racial Justice Act petitions – permitted by contract language, not yet commenced
- Death penalty habeas corpus petitions – raised in Riverside internal emails
- Parole revocation hearings – raised in Riverside internal emails
An Oversight Framework Still Being Built
David Slayton, executive officer of the Los Angeles County Superior Court, confirmed the tool will be evaluated quarterly, though he offered no methodology specifics when initially asked. A court spokesperson later provided a standard: the tool is measured against expectations applied to human law clerks, including accurate legal research, sound analysis, and neutral writing. No independent auditor or third-party bias reviewer is specified in the criteria or in the contract.
Judge Samantha Jessner, who chairs the court’s Judicial Technology Advisory Committee, told CalMatters she became aware only recently that the tool could be extended beyond the civil division. Judges are excluded from contract negotiations due to ethical constraints on their role. Jessner said the court has a duty to explore AI’s potential, though she did not address how defendants whose motions are processed during the evaluation period would learn of the tool’s involvement, given the disclosure rules currently in place.
In Riverside, Jason Galkin, chief executive officer of the Riverside County Superior Court, kept his description measured. “We don’t even know if expansion is likely so there is no set criteria for what expansion might look like,” he said. His court’s seven users remain in civil and probate matters for now. Hochman’s line was more direct. “The only thing I’m 100% sure of is that AI didn’t go to law school,” he told CalMatters. “It’s the analysis of the case itself, coupled with the conclusions that will be reached, that I’m very hesitant to trust AI at this point.”
Quarterly evaluations for the Los Angeles civil pilot are underway. If the review satisfies court leadership, the contract permits expansion into criminal and family divisions, with no new notification requirement for defendants and no publication of bias test results required before that step. If it does not satisfy leadership, the criminal and Racial Justice Act provisions stay in the contract, dormant but intact, available for the next version of the conversation.
AI
Stak Energy Plans $500M AI Data Center on Alaska’s North Slope
Stak Energy, an Alaska-based startup, has filed for state approval to build a $500 million AI data center on the North Slope, covering more than a square mile of tundra off the Dalton Highway and targeting up to 3 gigawatts of power drawn entirely from natural gas that currently sits stranded beneath Arctic oil fields. On May 12, the Alaska Department of Natural Resources issued a preliminary finding that the project is in the state’s “best interest,” opening a public comment period through June 15 before any land lease can proceed.
The pitch rests on geography: every friction point strangling AI data center buildouts in the Lower 48, from spiking electricity prices to water shortages and community opposition, largely disappears at 70 degrees north latitude. Whether Stak can translate that geographic logic into a funded, turbine-equipped, customer-signed facility by 2028, its stated target for first operations, is the question it has yet to answer publicly.
Project Aaka and the Square-Mile Scope
Stak calls the development Project Aaka, named for the Iñupiaq word for “grandmother.” At full build-out it would place multiple buildings across approximately one square mile of land some 25 miles south of the North Slope’s major Prudhoe Bay infrastructure, connected to petroleum fields by a newly constructed natural gas pipeline. State documents show the power plant feeding those buildings could consume more than twice the natural gas that all of urban Alaska burns annually for electricity generation and combined home and commercial heating.
At a maximum of 3 gigawatts, the project would sit in the same general tier as the largest AI campuses currently under development in the Lower 48. Meta’s Hyperion campus in Louisiana is targeting 7.46 gigawatts; Microsoft and Chevron are building a five-gigawatt gas plant in West Texas; Google is partnering with Crusoe Energy on a 933-megawatt facility in North Texas. Stak’s ceiling is more modest, but its fully off-grid, self-powering design is structurally different from all of them.
| Feature | Stak Energy (North Slope) | Typical Lower 48 Hyperscale |
|---|---|---|
| Power capacity | Up to 3 GW | 0.5 to 7+ GW |
| Power source | Stranded natural gas, fully off-grid | Grid-tied, mixed sources |
| Cooling method | Arctic ambient air | Water cooling towers |
| Estimated water use | 90% below industry norm | Standard industrial volume |
| Average site temperature | 12°F annually | 40°F to 65°F |
| Local ratepayer grid impact | None | Raises local electricity rates |
Stak officials declined to respond to specific questions about the proposal. The company’s prepared statement called the lease filing “an important milestone for anchoring Alaska as America’s at-scale energy solution” and described the company as committed to “responsible development” of Alaska’s economy. No anchor tenants or committed investors are named in any public document filed with the state.
Three Reasons Alaska Beats Virginia
Northern Virginia holds most of the world’s existing data center capacity, but the region’s grid is straining under the load, its water tables are under pressure, and community opposition to new campuses grows louder at planning meetings across the state. Stak’s geographic argument comes down to three physical advantages that no warmer-latitude market can replicate:
- Stranded natural gas with no competing demand. North Slope oil fields hold large reserves of associated gas that petroleum companies have historically left in place because no pipeline connects them to buyers. Stak would face virtually no price competition for that supply, at least at project launch.
- Arctic air cooling. With an average annual temperature of 12°F, the campus would use outside air to cool its servers rather than relying on water towers. The lease application says that approach cuts water consumption by 90% or more compared with industry norms, removing a key source of local opposition that has plagued data center projects elsewhere.
- Complete grid independence. The facility would not connect to Alaska’s main power grid, creating no new load for Anchorage or Fairbanks ratepayers. Data centers in Virginia, Texas, and Oregon have faced backlash precisely for driving up household electricity bills.
Antony Scott, a former commercial petroleum analyst for the Alaska Department of Natural Resources, told Northern Journal that the off-grid structure clears a significant political obstacle. “The issue of data centers and the effect on normal humanity’s electricity bills is causing real angst,” Scott said. On the North Slope, he added, “we avoid all of that. You can just step into this friendly environment.”
The Company and Its Politically Connected Hires
Sparrow Mahoney, Stak Energy’s founder and chief executive, grew up in Alaska and attended Wasilla High School. She launched the company roughly a year ago with a far narrower goal: convert stranded North Slope gas into electricity and use it to mine Bitcoin. Prior involvement in the Iditarod’s push into cryptocurrency was part of her startup background, according to Alaska Public Media. The pivot to large-scale AI and cloud computing has transformed both the company’s pitch and the scale of capital it now needs to raise.
Alaska is the only place this makes sense, long term, for the industry.
Mahoney said this at the May 2026 Alaska Sustainable Energy Conference, where she cited U.S. Department of Energy figures showing the country needs 100 gigawatts of new data center capacity by 2030. To staff up for that ambition, Stak made a series of politically connected hires. John Boyle, who served as natural resources commissioner under Governor Mike Dunleavy, joined the company, as did Jim Shine, a former special assistant at the same department.
Both bring familiarity with the land-use and permitting process the project must work through, including a federal Clean Water Act authorization to build a gravel pad that documents indicate would be nearly twice the size of the one at ConocoPhillips Alaska’s Willow oil field. Fundraising is running through McKinley Alaska Private Investment, an Anchorage-based firm. How far along Stak is toward its $500 million target is not public. Stak Energy’s project materials describe the venture as building “the secure, scalable energy foundation for American dominance,” a framing that fits the current federal energy policy environment comfortably.
The AI Power Crunch That Sent Stak North
Stak’s proposal didn’t arrive in a vacuum. Across the United States, AI training and inference workloads are consuming electricity fast enough to strain every major data center market simultaneously. Former U.S. Senator Kyrsten Sinema of Arizona, now co-chair of the AI Infrastructure Coalition, put it plainly at a May 2026 energy conference: “The number one bottleneck that we face is energy.”
- 6.7% to 12%: the share of all U.S. electricity that data centers could consume by 2028, per a U.S. Department of Energy projection cited in Mitsubishi Power’s sector outlook
- 40%: share of AI data centers projected to face electricity deficits by 2027, per Gartner
- 5 to 7 years: current lead times for new combined-cycle gas turbines, per S&P Global
That turbine shortage affects every natural-gas data center build in the country, not just Stak. GE Vernova, Siemens Energy, and Mitsubishi Heavy Industries collectively supply most of the world’s new gas turbines, and all three are reporting delivery schedules stretching well past 2029. Some manufacturers have reportedly stopped accepting new orders without committing to delivery timelines. U.S. data centers are expected to add between 3 and 6 billion cubic feet per day of new natural-gas demand by 2030, meaning competition for both fuel and turbines will intensify before it eases.
The Gaps the Lease Papers Don’t Fill
No Gas Supply Deal on File
The lease application the Alaska DNR reviewed was thorough on engineering concepts, but contained a material gap: at the time Stak filed it, the company had not struck a firm supply agreement with any petroleum operator on the North Slope. The proposed pipeline connecting the campus to a gas source could run anywhere from 25 to 90 miles, a range that implies Stak has not settled on a specific field or a specific seller.
Antony Scott told Northern Journal that the range gives the situation away. “That means they don’t have a gas supply,” he said. North Slope operators are presumed willing to sell associated gas they currently cannot monetize, but presumed willingness is not a signed contract with price and volume terms. A binding supply agreement is the project’s most pressing next milestone.
Turbine Timing vs. a 2028 Start
Stak says it wants initial operations running by 2028. That target runs directly into the gas turbine shortage. S&P Global’s analysis of the U.S. gas turbine supply crunch puts current lead times at as long as seven years, and some manufacturers have effectively closed order books until 2028 without committing to delivery dates. Getting into a manufacturer’s queue today would not guarantee turbines arrive before 2030 under current backlog conditions.
Stak has not disclosed whether it has placed orders with any manufacturer or secured production slots. For a facility with a 2028 first-power target, turbine procurement is the item that every other schedule depends on. Without it, project timelines are aspirational rather than contractual.
Investors and Customers
The company has said it is raising capital through McKinley Alaska Private Investment, but has not named institutional backers, disclosed how much equity is committed, or confirmed any AI operator as a tenant. Northern Journal reporter Nathaniel Herz found that Stak had not been forthcoming about confirmed customers or investors. The lease application, technically detailed and described by energy experts as thorough, stops short of naming any offtake agreements or committed equity investors.
That profile, a credible site with professionally assembled permitting documents and politically connected advisors but no confirmed anchor tenant, is common at the early stage of large infrastructure projects. It is also the stage at which the most ambitious ones most often stall indefinitely.
Carbon, Permitting, and the Path From Paper to Power
Running 3 gigawatts off natural gas creates a carbon liability that Stak’s potential tenants cannot easily ignore. Technology companies including Microsoft, Google, and Meta maintain formal emissions-reduction targets, and a fossil-fuel-only power source in a remote Arctic location without access to wind, solar, or existing carbon-capture infrastructure creates a sourcing problem for any tenant that must report against a net-zero commitment.
Stak says in its application that it is monitoring carbon capture and storage technology, but acknowledges that the North Slope’s geology for sequestration is poorly understood and that no capture infrastructure exists there yet. For the foreseeable future, each megawatt-hour the facility produces will carry a full fossil fuel emissions footprint.
Governor Dunleavy’s administration formally found the project in the state’s best interest on May 12, and the Alaska DNR public comment portal is accepting input on that preliminary decision through June 15 before any lease can be finalized. If Stak can show a signed gas supply agreement, confirmed turbine orders, and named tenants before the 50-year lease goes final, the project becomes a genuine competitor in the AI infrastructure race. If those three items are still open when a new Alaska governor takes office in 2027, the square mile off the Dalton Highway will be the most detailed unfunded concept in the state’s energy history.
AI
UC San Diego AI Foundation Model Predicts Cancer Treatment Response
Only about 8% of cancer patients whose tumors are sequenced end up matched to an FDA-approved therapy on the basis of genetics. That number reflects not a failure of sequencing technology – a standard clinical panel already finds roughly 11 distinct genetic alterations in the average tumor – but a fundamental shortage of tools for interpreting what most of those mutations mean for treatment choice.
Researchers at the University of California San Diego published a paper Monday in Cancer Discovery describing an AI foundation model called MutationProjector that reads a tumor’s full mutation landscape and predicts treatment response with accuracy matching or exceeding existing methods across several independent patient cohorts. The second-order implication gets less attention than the performance numbers: if a system like this holds up at scale, it changes the purpose of a sequencing panel from a narrow flag-list into a comprehensive treatment-strategy input.
The 92% Gap in Genomic Treatment Matching
- 8%: share of cancer cases successfully matched to an FDA-approved therapy by genetics alone, per the UC San Diego research team.
- ~11: average number of distinct genetic alterations identified in one tumor by a standard clinical sequencing panel.
- 30,000+: tumor genomic profiles across 10 solid cancer types used to train the foundation model.
- 1 gene: the basis for the vast majority of those successful genetic matches – standard treatment stratification typically hinges on a single biomarker at a time.
Genetic testing after a cancer diagnosis has become routine for good reason. The published study in Cancer Discovery notes that DNA sequencing panels flagging alterations in cancer-associated genes are relatively fast and low-cost, with a strong track record where validated biomarkers exist. The problem is that validated biomarkers currently cover a narrow slice of what a panel actually finds.
“Genetic sequencing is already routine in cancer care, but we still struggle to fully interpret the many mutations found in a patient’s tumor,” said Trey Ideker, PhD, professor of medicine at UC San Diego School of Medicine and director of the Big Data Institute at the University of Oxford, who led the research. Clinical treatment decisions based on genetics today still hinge mostly on a single gene, whether BRCA1, EGFR, or KRAS, while the other 10 alterations in the same report sit unused.
Two problems make those unused mutations hard to act on. First, many individual mutations are rare – appearing in only a tiny fraction of patients – which makes them nearly impossible to study through conventional clinical trials. Second, one mutation’s effect on drug response often depends on what other mutations are present in the same tumor. Standard biomarker testing checks each gene in isolation and misses interaction effects entirely.
How the Model Encodes a Tumor’s Full Mutation Landscape
From Mutation Lists to Biological State
A standard biomarker approach scans for a known alteration, confirms its presence, and maps it to a drug. The UC San Diego team’s system works differently. It takes the full set of genetic alterations identified in a tumor and compresses them into what the researchers describe as a “compact representation” of the tumor’s biological state – a summary of which molecular pathways are likely disrupted, rather than a catalogue of individual mutations.
That representation is what the system uses to make treatment predictions. Instead of asking “does this tumor carry KRAS G12C?” as a binary flag, it asks what the combined mutation profile implies about how the tumor’s cellular machinery is functioning. The shift from mutation list to pathway state is the core architectural choice behind the model’s design, and it is what allows the system to produce an explanation alongside each prediction rather than a raw probability score.
The model belongs to a class called foundation models (AI systems pretrained on large, broad datasets and then adapted for specific downstream tasks with relatively small labeled samples). Foundation models underpin large language models in general AI applications; the UC San Diego version runs on tumor genomes rather than text, but the conceptual logic is the same: expose the model to enough examples across enough variation that it can detect patterns no single institution could find in isolation. A 2026 Nature Reviews Cancer perspective on machine learning and genomics notes that as large clinicogenomic datasets mature, such tools have a real opportunity to extract more information from sequencing data and generate therapeutic hypotheses for patients who currently have none.
Why Rare Mutations Become Detectable at Scale
Training on more than 30,000 tumor profiles across 10 solid cancer types gives the model exposure to mutations that no single institution sees often enough to study in a conventional clinical setting. “By pretraining on a large collection of tumors and integrating molecular network knowledge, MutationProjector can detect patterns that would be easy to miss with conventional biomarker approaches. That gives us a way to move from long lists of mutations toward a more functional understanding of the tumor,” said JungHo Kong, PhD, the study’s first author and a postdoctoral researcher in the Department of Medicine at UC San Diego School of Medicine.
The model integrates biological network data – information about how genes and proteins interact – alongside raw mutation profiles. That integration connects mutations to known biological pathways, giving the predictions interpretable biological grounding rather than a pure statistical correlation drawn from the training data.
| Attribute | Conventional Single-Biomarker Approach | Foundation Model Approach |
|---|---|---|
| Input data | One or a few targeted gene flags | Full mutation profile from standard clinical panel |
| Treatment signal | Presence or absence of a specific mutation | Combined pathway-level biological state |
| Handles rare mutations | No – needs sufficient trial data per mutation | Yes – patterns learned across large population |
| Interpretability | Direct binary match or no-match | Pathway-level reasoning per prediction |
| Training scale | Small labeled datasets per biomarker | 30,000+ tumor profiles, 10 cancer types |
| Biomarker discovery | Limited to pre-specified known targets | Can surface unexpected new associations |
Bladder Cancer, Lung Cancer, Melanoma: Validation Across Independent Cohorts
The research team validated the AI tool across several independent patient cohorts – populations whose data were held out from training and tested separately, the standard check for whether a model’s patterns hold beyond the data it saw during training. Across cancers including bladder cancer, lung cancer, and melanoma, the tool matched or exceeded existing methods for predicting response to both immunotherapy and chemotherapy.
Peer-reviewed publication in Cancer Discovery, a journal of the American Association for Cancer Research, placed the results through external evaluation before they were made public. The key performance findings from the published paper:
- The tool matched or exceeded state-of-the-art prediction accuracy in every independent cohort evaluated.
- Immunotherapy response predictions held across multiple cancer types, not just one tumor category.
- Chemotherapy response predictions also held, suggesting the approach generalizes across different treatment mechanisms rather than optimizing for a single drug class.
- Multi-cohort validation separated the results from single-dataset overfitting, a common failure mode in earlier machine-learning oncology research.
Earlier attempts to apply machine learning to cancer treatment prediction have sometimes produced strong numbers on a training dataset and poor performance when tested elsewhere. Building the validation around independent cohorts was deliberate, and the consistency of results across cancer types and treatment classes is among the more credible aspects of the published work.
Two Biomarker Findings the Training Data Did Not Obviously Predict
Beyond the core accuracy results, the model identified associations that were not part of the researchers’ initial hypotheses – and at least one sits in tension with how current clinical sequencing guidance is constructed.
Mutations in the gene KMT2D were flagged as a predictor of sensitivity to immunotherapy. KMT2D encodes a histone methyltransferase involved in gene regulation. Prior research had associated KMT2 family mutations with immune checkpoint inhibitor response in pan-cancer analyses, but the finding had not been incorporated into standard clinical sequencing guidance. The model’s identification of KMT2D sensitivity, surfaced from patterns across the training population, points toward a candidate biomarker for prospective clinical study.
Our results suggest that tumor genome foundation models may help extend the clinical value of sequencing beyond a handful of well-known genes. This could support a more comprehensive and biologically grounded approach to precision oncology.
Trey Ideker made that comment in the UC San Diego release accompanying the paper’s publication on May 26.
The second finding is more operationally immediate. The joint presence of mutations in both SMARCA4 and STK11 emerged as a predictor of immunotherapy resistance. Prior published research had already linked SMARCA4 mutations to poor immunotherapy outcomes in lung cancer and STK11 mutations to resistance in non-small-cell lung cancer. What the model added was evidence that when both appear together in the same tumor, the resistance signal is stronger and more consistent than either gene alone. That co-occurrence pattern is precisely what single-biomarker testing misses: SMARCA4 mutant tumors have been documented as having poor immunotherapy response even at high tumor mutational burden, and the co-mutation picture with STK11 deepens that signal considerably.
Interpretability as the Clinical Bridge
A prediction model that cannot explain its reasoning has limited use in the clinic. Oncologists need to understand which biological features drove a prediction in order to weigh it against other patient factors and make a defensible treatment decision. That interpretability requirement was built into the foundation model’s design from the start, rather than added as a post-hoc visualization layer.
Because the tool maps mutation combinations to biological pathways rather than learning purely statistical correlations, the pathway disruptions it identifies can be read back as a form of explanation. A prediction of chemotherapy resistance, for instance, comes attached to information about which molecular pathways the model weighted most heavily – information a clinician or researcher can evaluate against existing biological knowledge, rather than being asked to trust an opaque score.
That design also shapes how the system can contribute to biomarker refinement over time. When a novel association surfaces, as with KMT2D, the pathway-level framing gives researchers a biologically grounded starting point for hypothesis generation, shortening the distance between a computational observation and a testable clinical question. A 2026 BJC Reports analysis on AI and precision oncology flags dataset diversity as a parallel challenge: training populations that underrepresent certain ethnic and geographic groups risk missing population-specific variations in tumor mutational burden and drug metabolism that affect therapy response – a limitation the UC San Diego team will need to address as the model expands.
Pancreatic Cancer, Prostate Cancer, Liquid Biopsies: The Expansion Roadmap
The published study covers 10 solid cancer types. Among the expansion targets explicitly named by the research team are pancreatic cancer, prostate cancer, and sarcomas – all with large unmet needs in treatment stratification and all representing tumor profiles the current training set did not include.
Beyond adding cancer types, the team wants to incorporate other data modalities. Medical imaging, transcriptomics (gene expression analysis, distinct from mutation detection), and electronic health records all carry treatment-relevant information that standard sequencing panels do not capture. Liquid biopsies – tests that detect circulating tumor DNA (ctDNA, fragments of tumor genetic material shed into the bloodstream) from a blood draw rather than a tissue sample – were specifically called out as a future application, particularly for early cancer detection before a solid tumor is large enough to biopsy directly.
The study listed co-authors from Lunit Incorporated, a Seoul-based AI company focused on oncology applications, alongside the UC San Diego team. That collaboration points toward the kind of multi-institutional data-sharing that expansion will require: access to large international genomic datasets of sufficient quality to train on rarer cancer types and data modalities that are harder to standardize across different health systems.
If that data access materializes and the model holds its performance in new tumor contexts, the current 8% treatment-match rate faces structural pressure from a direction the field has not previously had the tools to pursue. If the gains prove specific to the cancer types and treatment classes in the current training set, the tool joins a growing list of promising oncology AI research that still awaits the harder test of broad clinical deployment – where dataset diversity gaps and regulatory standards will matter as much as benchmark accuracy numbers do.
AI
ByteDance’s 300 Job Postings Reveal an AI Agent OS Battle
More than 300 job openings at ByteDance this spring set off a familiar tech-industry rumor cycle: the TikTok parent was building phones again. ByteDance denied it Tuesday, telling Chinese outlet PencilNews that reports of a restarted self-developed handset project were false, and the denial is almost certainly accurate. The postings span Android Framework engineering, Telephony RIL (Radio Interface Layer, the software bridge between Android and cellular hardware), chip-driver development, radio-frequency design, and edge-side AI deployment, all positions belonging to a company trying to own the operating-system layer that determines what AI agents can see, reach, and pay for on any smartphone. That is the strategic territory the entire global AI industry is now converging on.
What 300 Job Postings Signal About the AI Agent Race
The positions break into two clusters. About 83 are tagged to Doubao Mobile Assistant, ByteDance’s AI agent product. The remaining 236 sit under mobile OS development. Together, they cover Android Framework, HAL (Hardware Abstraction Layer, the interface between Android software and device-level hardware), Telephony RIL, power consumption and thermal management, foldable-screen adaptation, and long-term memory architectures for agents. Several postings carry production-readiness language that does not belong in research roles: IP68 waterproofing specifications, NPI (New Product Introduction, the factory-readiness process for consumer devices), and ODM/OEM mass-production collaboration terms.
None of that is chatbot vocabulary. These positions describe a company trying to own what AI agents can see, touch, and command before any third-party app gets to decide what is permitted. IDC, the technology market research firm, projects 147 million AI phone shipments in China for 2026, a 31.6% year-on-year increase, with 53 percent of the total smartphone market expected to carry next-generation AI capabilities by year-end.
The commercial context sharpens the ambition. QuestMobile, a Chinese mobile analytics firm, tracks Doubao as China’s dominant AI application, ahead of rivals including DeepSeek. Yet despite that position, Doubao still depends on external platforms for every real-world transaction. Book a flight and the agent needs a travel platform. Order food and it requires Meituan’s permission. Send money and it needs Alipay or WeChat Pay. Each of those platforms can, and does, close the door.
- 345 million monthly active Doubao users as of March 2026, per QuestMobile
- 300+ ByteDance job openings for mobile OS and edge-AI roles as of late May 2026
- 147 million AI phone units IDC projects will ship in China in 2026, a 31.6% year-on-year increase
- 53% of China’s smartphone market IDC forecasts will be next-generation AI phones by year-end
The Nubia M153 Test and the Ecosystem Blockade
Capabilities the Prototype Demonstrated
On December 1, 2025, ByteDance and ZTE Corp unveiled the Nubia M153, a 6.78-inch Android prototype priced at 3,499 yuan (approximately $495) and limited to roughly 30,000 units. The device sold out on its first day, with secondary-market prices climbing more than 40 percent above retail. Positioned as an engineering prototype rather than a consumer flagship, it was designed to test one question: could an AI agent embedded at the OS level navigate Chinese apps and complete multi-step tasks without the user tapping a single screen?
Hands-on testing showed it could. The Doubao assistant, activated by a physical side button or voice command, identified a product from a social media post, compared prices across multiple e-commerce platforms, and navigated app interfaces using visual recognition rather than pre-built API connections. Ask it to play a specific podcast episode and it would open the app, search, find the episode, and begin playback without human input. Li Liang, vice president of Douyin Group, described the ZTE partnership on December 9, 2025 as “the starting point for the development of AI phones,” adding that “regardless of whether this attempt succeeds or not, AI is undoubtedly the future.”
The Walls That Super Apps Built
Within days of the launch, major Chinese platforms began pushing back. WeChat login showed abnormalities on the device. Taobao and Alipay raised access restrictions. Meituan’s food delivery flow rejected automated inputs because fraud-detection systems were not built for OS-level agents clicking through their interfaces. ByteDance preemptively suspended banking, payment, and gaming automation on December 5, 2025, describing the move as a “necessary step to ensure the technology has a more solid and far-reaching future.” By December 10, some Alibaba apps partially lifted restrictions after ByteDance disabled corresponding Doubao capabilities for those platforms.
The reasons were not purely technical. An executive source at a major e-commerce company told Japanese financial outlet Nikkei Asia that the real concern was control: an AI agent routing users to the cheapest platform cuts through the closed economic loop each super app depends on for traffic, recommendations, and transaction data. A financial app executive raised the accountability question directly: if a user instructs Doubao to transfer money and something goes wrong, who bears responsibility?
| Doubao Capability | Status During December 2025 Test |
|---|---|
| Cross-platform price comparison | Functional |
| Podcast playback via voice and UI navigation | Functional |
| Ticket booking via Douyin integrations | Partially functional |
| Taobao automated ordering | Blocked by platform |
| Meituan food delivery flow | Blocked by fraud detection |
| Banking and payment operations | Suspended by ByteDance, Dec 5, 2025 |
| WeChat login | Abnormalities reported during testing |
| Gaming rank automation | Suspended by ByteDance, Dec 5, 2025 |
ByteDance’s Unresolved Hardware Problem
ByteDance has attempted consumer hardware across nearly every major category over the past six years, and the results trace the same structural weakness: world-class software, essentially no offline retail presence in China. The Nubia M153’s app blockade is the newest manifestation of that gap, not its origin.
- Pico Interactive, the VR headset company acquired in 2021 for approximately 9 billion yuan, received an additional 10 billion yuan by 2022, bringing total investment to over 20 billion yuan. Consumer sales fell well short of targets and the business underwent its largest-ever restructuring in late 2023, with the consumer division significantly scaled back.
- ByteDance acquired part of the Smartisan Technology team and patents in 2019, following the struggling phone startup’s collapse. The Nut Pro 3 and Nut R2 devices that followed sold fewer than 100,000 units combined on JD.com and Taobao before the Nut phone line was closed in 2021.
- A smart lamp product was disrupted by China’s regulatory environment for education hardware. Several audio hardware projects were discontinued before reaching consumers.
- Doubao earphones and AI glasses remain in development, with the glasses team reportedly drawing on engineers recruited from Meta’s hardware division. Neither product has yet reached commercial scale.
The channel deficit explains why hardware remains structurally difficult for ByteDance. Huawei operates more than 10,000 offline retail stores across China. Xiaomi Home exceeds 16,000 locations. OPPO and vivo together maintain hundreds of thousands of distribution terminals. ByteDance has no equivalent physical retail infrastructure, which means any device it sells must win on spec and online visibility alone, without the floor staff who explain, demonstrate, and convert browsers into buyers.
The Entrance Everyone Wants and Nobody Controls
ByteDance is not the only company that has concluded the mobile OS layer is the prize. The question of who owns the AI agent stratum on a smartphone has become the central competitive question across the global technology industry, with several players that have far stronger hardware positions working on their own answers simultaneously.
AI agents will be foundational to the evolution of super apps, with success depending on deep integration across payments, logistics, and social engagement.
Charlie Dai, vice president and principal analyst at Forrester Research, made that observation to CNBC in January 2026, naming the three capabilities an agent needs to be genuinely useful at scale. Those three are the same chokepoints Doubao cannot currently cross without cooperation from WeChat, Meituan, and Alipay.
The bind is structural and competitive at once. According to Nikkei Asia’s December 2025 reporting, the reason ByteDance chose ZTE’s Nubia as its hardware partner was that the major manufacturers, including Huawei, Xiaomi, OPPO, and Honor, are all building proprietary AI assistants and would not grant a rival’s AI the system-level permissions that make an agent useful. ZTE had the commercial motivation to accept deeper collaboration precisely because it does not hold a dominant market position. The second-generation Doubao phone is again being developed with ZTE’s Nubia, with ByteDance simultaneously negotiating with OPPO, vivo, and smaller manufacturers to embed Doubao as a licensed system assistant on devices they control.
OpenAI, Apple, and the Pursuit of a Clean-Slate Device
Outside China, the same battle is developing on a longer timeline. Supply-chain analyst Ming-Chi Kuo of TF International Securities reported in late April 2026 that OpenAI was accelerating its first AI agent phone, with mass production now targeted for the first half of 2027, pulled forward from a prior 2028 estimate. Per Kuo’s April 26 supply-chain analysis, the device would be built around a customized MediaTek chip with Luxshare Precision Industry, a Chinese electronics manufacturer, as manufacturing partner, with combined shipments projected at around 30 million units across 2027 and 2028.
OpenAI’s theoretical advantage is that it has no legacy app ecosystem to protect. A device designed around agents rather than app icons would not need to negotiate screen permissions with Taobao or Meituan because those platforms would not be part of the interaction model. The disadvantage is the same one ByteDance already encountered: the major apps users depend on were not built for an external AI to operate them, and their owners have commercial reasons to keep it that way.
| Company | AI Terminal Strategy | Hardware Path | Stage as of Mid-2026 |
|---|---|---|---|
| ByteDance (Doubao) | OS-level agent via OEM partnerships | ZTE Nubia collaboration; no self-manufacturing | Gen 2 phone in development with ZTE Nubia |
| OpenAI | Agent phone with app-less architecture | MediaTek chip plus Luxshare manufacturing | Mass production targeted first half 2027, per Kuo |
| Apple | Apple Intelligence, on-device model processing | Full proprietary silicon and hardware stack | Active; phased agent feature rollout ongoing |
| Gemini agents across Pixel and Android OEM fleet | Pixel hardware plus Android OS licensing | Pixel 10 ships with background AI agent features | |
| Huawei / Xiaomi / Honor | Proprietary AI OS with full system integration | Self-owned hardware plus dominant offline channels | Scaling aggressively across distribution network |
For all its model capability and large user base, ByteDance lacks what every other player in that table possesses: hardware sovereignty and a distribution channel that functions without a screen. Among the companies listed, it is the only major AI contender that depends entirely on partners for both the device and the shelf space.
The second-generation Doubao phone, now in development with ZTE’s Nubia and expected to arrive in the coming months, will be the first real test of whether its app-negotiation strategy has advanced since December 2025. If cooperation agreements hold and Doubao secures system-level licensing across a broader manufacturer set, the platform play becomes credible. If the super apps resist again, those new OS engineers solve the technical layer but leave the commercial problem exactly where the Nubia M153 left it.
-
CRYPTO3 weeks agoAndreessen Horowitz Bets $2.2B on Crypto’s Quiet Cycle
-
CRYPTO3 weeks agoCathie Wood Calls SpaceX IPO Demand ‘Voracious’ Ahead Of $1.75T Debut
-
NEWS3 weeks agoGhana CSA Plants Office In Ho As Volta Cybercrime Climbs
-
APPS3 weeks agoGoogle’s Buried Page Reveals 500 Niche Websites Still Making Cash
-
NEWS3 weeks agoHormuud Bets $19 Down Will Finally Pull Somalia Online
-
NEWS3 weeks agoApple Strikes Preliminary Deal For Intel To Make iPhone And Mac Chips
-
NEWS3 weeks agoMetalenz Polar ID Hides Face Unlock Under OLED Smartphone Screens
-
AI3 weeks agoGoogle AI Overviews Adds Subscribed Label, Reddit Quotes Inline
