Introduction
In recent years, particularly in the past few, ICT technologies have advanced rapidly, seemingly bringing significant changes to the overall structure of society. Many people may now be beginning to realize that these changes are also impacting health and medical services. The term “AI” has become ubiquitous, and it seems that everything is being labeled as AI. However, in reality, there are still very few who truly understand AI and are accurately designing its applications and plans. Additionally, the differences between Japan and other countries appear to be significant. Medicine, as an area that exists after a doctor diagnoses a disease, includes drug administration as part of medical practice. The concept of “medicine” itself has the potential to undergo significant changes due to recent scientific advances. Furthermore, we seem to have entered a new world where it is difficult to define the boundaries of medicine, such as how far regenerative medicine and cell therapy fall within its scope. In this new era, nurturing talent in the field of computer science—a new world that utilizes data to advance science—seems crucial. In Japan, the question arises of how to train such data scientists, create an environment that enables them to participate in social transformations and various businesses, and help them thrive globally. For this discussion, we spoke with Dr. Kaminuma, who has been engaged for many years in fields such as medical care, life sciences, drug discovery, and environmental chemical substances from the perspectives of computational science and informatics.
The Dawn of Artificial Intelligence and Pattern Recognition


Upon returning to Japan in the summer of 1971, I joined Hitachi’s newly established information systems research institute. There, I became involved in a project to enable computers to perform precise differential diagnoses of heart diseases, collaborating with Dr. Kiyoshi Machii (then at Mitsui Memorial Hospital, later a professor at Toho University Medical School) and young researchers at the institute. This effort involved transferring the experiential knowledge of human specialists to computers to automate the process of enhancing diagnostic accuracy—what we now call an expert system. This work took place in the early 1970s. Additionally, we explored the use of R. Bellman’s dynamic programming to optimize drug administration based on data from gout patients accumulated at Toranomon Hospital. Incidentally, the term “expert system” was coined by Kulikowski. After moving to Rutgers University, he developed a system using expert knowledge for glaucoma diagnosis, which later became part of Japan’s so-called Fifth Generation Computer initiative.
In 1976, I joined the newly established Tokyo Metropolitan Institute of Medical Science. There, I aimed to create an environment capable of handling all types of medical data, including documents, numerical data, waveforms, images, 3D images, and maps—a platform-like system in today’s terms2). By the late 1970s, I had established a Medical Informatics group focused on developing statistical packages, though we did not engage in clinical diagnostic research. However, I became acquainted with researchers like E. H. Shortliffe, who developed MYCIN, an interactive system supporting antibiotic usage at Stanford, and P. Szolovits, who conducted clinical application research at MIT’s Artificial Intelligence Laboratory. These young researchers, in their early 30s at the time, formed the foundation of medical AI groups in the U.S., which eventually led to the creation of AMIA4). This movement resonated strongly with Dr. Shigeyoshi Kaihara, then an assistant professor at the University of Tokyo Hospital’s Information Processing Department. Dr. Kaihara, with some support from me, leveraged a series of lectures by these young U.S. researchers to establish the Japan Association for Medical Informatics in 1983. However, our research group has since had limited involvement in clinical diagnostic research.
The Dynamic and Ambitious Japan of the 1980s


The Fifth-Generation Computer Project is well known for its goal of creating an implementation environment for expert systems. However, I advocated a “semi-empirical methodology,” suggesting that “directly inputting the knowledge of specialists into a computer cannot yield reliable results. Such knowledge systems should serve as a starting point for improvement through learning from real-world data in practical settings.”
Today’s CBI Society is highly focused on drug discovery, but its initial goal was to integrate systems for compound databases and computational chemistry. Toward the late 1980s, I moved to the National Institute of Health Sciences, where I worked on establishing the newly accessible internet infrastructure, launching projects like the “Global Information Network on Chemicals (GINC)” in collaboration with international organizations such as WHO and UNEP, and addressing emerging issues like microplastic debris in oceans.
Since my retirement in 2001, I have stepped away from active research. Reflecting on that era, the period when I was engaged in research marked a time when Japan’s science, technology, and corporate activities (economy) were truly on the rise. Some researchers at institute meetings even declared that “there’s nothing to learn from America.”


In 1992, as a WHO consultant, I spent a week in Singapore visiting agencies involved in chemical management, and I was struck by the experience. Singapore was ambitiously aiming to establish an electronic government by 2000. Around the same time, at a WHO conference in Malaysia, I was surprised to see participants from various countries, including China, Hong Kong, Canada, and Fiji, all of whom were ethnically Chinese.
Inspired by the introduction of internet and WWW environments to my research institute and the CBI Society, I felt compelled to write a book to highlight the opportunities and warn of the challenges of this era. This resulted in my book *The Third Opening of Japan: The Shock of the Internet* (Nijiman Kaminuma, Kinokuniya, 1994). Unfortunately, it did not receive the attention I had hoped for. I believe the theme of “The Third Opening of Japan” diluted the urgency of the message that Japan needed to address the internet wave immediately. The goal should have been not an “opening” but a “reformation.”
Subsequently, the nations and companies that embraced the internet wave experienced significant growth. In this sense, the 1990s was a major turning point, and the defining keyword or zeitgeist of the era was undoubtedly the response to the (opened) internet.
Understanding Data Science


For instance, in the context of medical diagnosis, doctors constantly make decisions. The foundational knowledge, information, and materials they rely on are diverse, including language (descriptions, natural language), waveforms, images (photographs), and observations of movements. Today, many of these elements can be handled digitally, though not completely. At a basic level, they encompass language, waveforms, diagrams, and images, including videos. This is a classification by data format.
Another classification pertains to data associated with domain-specific knowledge. For example, molecular spectra and structural data related to molecular interactions are vital to chemists; genomic sequences matter to geneticists and biologists; and images of malignant skin tumors are tied to the expertise of pathologists. Chemists may see proteins as large molecules, while biologists or informaticians regard genes as nucleotide sequences (codes).
There is also a cognitive purpose-based classification, including deduction (reasoning), induction, creativity (inspiration), and planning techniques for goal achievement. For instance, the first data discrimination (classification) technique I worked on in the 1960s, which would now be called an SVM (Support Vector Machine), was related to quadratic programming methods. However, its use was primarily in pattern classification and inductive reasoning (inference or conjecture). In other words, cognitive computational techniques (AI) are intricately nested within these domains.
My classification is experiential, and I am not aware of others’ perspectives. Given the current AI boom, I think there’s a need to explain these data science techniques and their classifications in simple general terms. However, to be honest, it’s a formidable challenge.

Figure 1: Computational thinking techniques. In scientific research (D2K Science) that generates knowledge from data, thinking relies on a combination of universal theories and techniques with domain-specific knowledge, theories, and techniques. Data is derived from searches of existing databases, measurements and investigations in laboratories, and records from medical services (clinical records). While computers cannot generate ideas themselves, they can assist experts in generating ideas through data visualization.


In the United States, there are departments or faculties of Computer Science, where Ph.D. degrees are awarded. While the situation may be different now, in Japan, creating an excellent program alone was often insufficient to earn a doctorate. This may be one of the factors contributing to the shortage of talent in this field in Japan.
Advancements in Medicine, Health Science, and Genomic Decoding


I’ve proposed the concept of “Health Metrics.” Intervention at the early warning stage is the most effective, as outlined in a roadmap by former NIH Director Elias Zerhouni. While I advocate for interventions at the tertiary prevention stage, there’s growing recognition in the U.S. of the importance of maintaining health through lifestyle innovations in next-generation healthcare8). The key question is whether aging should be considered a disease to be treated. Recently, there has even been research using AI to measure healthy life expectancy9).


Transforming genetic data into breakthrough drugs is still in its infancy and will take time, but I believe progress is steady.
Bio-Pharmaceutical Development and Strategic Data Science


This is a very personal impression, but based on my experience since the 1970s working on immunology (such as hepatitis B and complement systems) from a computational application standpoint, I’ve found immunology to be extremely challenging to handle from a systems (engineering) perspective. This difficulty arises because the subjects—cells and molecules (like complements)—continuously change their states. I believe this issue is also relevant to regenerative medicine. It relates to the challenge of “cell fate and control,” which suggests that computational applications require some innovative approaches. However, this very complexity makes it an appealing and worthwhile area to tackle from the perspective of computational and information techniques.
That said, addressing this challenge requires teams composed of biomedical experts and younger generations specializing in computational and information techniques. One example, based on online information, is the European (German-French) LifeTime Initiative12).
In any case, cancer, immunology, and research, as well as clinical applications, appear to be advancing at an accelerated pace. However, I feel that adequate frameworks to fully process and utilize the data generated are not yet in place. Perhaps it’s necessary to start by focusing on specific targets, such as cancer or autoimmune diseases, and systematically collecting clinical data related to them.
Strategic Use of Public Funding


In AI, I believe there should be increased funding for socially critical issues, such as nuclear decommissioning, fraud prevention (e.g., “It’s me” scams), and understanding and addressing dementia. Furthermore, I believe the government should support the development of cancer treatment drugs as a response to the aftereffects of nuclear exposure. Innovation doesn’t emerge from merely aiming for it; it arises from tackling the challenges that need to be addressed.
Regarding biobanks and cohort studies, the 10th issue of *Drug Discovery Plaza* discusses baseline studies. What’s currently needed is a numerical understanding of normal states relative to disease states. Additionally, there needs to be more recognition of the importance of clinical record management techniques and databases that can generate disease-specific knowledge from real-world data (RWD).
For instance, the U.S. company Flatiron Health was acquired by a major pharmaceutical firm for a substantial amount, likely due to the value of its techniques for recording clinical (support) data. Analyzing such data is expected to yield useful insights into cancer care. A similar approach could be applied to clinical records and database construction for autoimmune diseases like rheumatoid arthritis. These can be considered “heuristic databases.” The same principle applies to biobanks.
Developing such systems in Japan is extremely challenging, but analyzing why this is the case could reveal new areas for innovation.


Perhaps self-serving, but the nonprofit organization we established, the Institute for Cyber Bond (ICA), aims to address these two changes. Specifically, we are promoting “participatory healthcare.” In simple terms, this means encouraging patients and citizens to make smart use of existing services while taking active steps to manage their health through measures like exercise, diet, sleep, and environmental adjustments without necessarily relying on prescriptions.
In Japan, there is still a prevailing ethos of “patients should just follow instructions,” reflecting a paternalistic approach among doctors. While such paternalism has also been criticized in other developed nations, there is growing recognition in the U.S. and Europe of the importance of patient-centric care. In Japan, companies like Takeda Pharmaceutical have started adopting a “Patient First” approach. In contrast, we advocate for empowering citizens and patients to take proactive roles in maintaining their health and addressing illnesses.
Lifestyle diseases are a prime example where ensuring behavior changes among citizens and patients is crucial. Practical methods to achieve these changes are at the heart of what we should be implementing.
The Challenges of Developing Data Science Experts




Figure 2: Diagram of relationships between academic fields. Artificial intelligence is a study that corresponds thought processes to calculations (algorithms). Its foundation lies in mathematical or computational techniques common across various academic disciplines. The theories and computational techniques in the diagram share common concepts. This relationship suggests that pattern recognition, artificial intelligence, machine learning, and D2K Science are deeply connected to existing fields such as mathematics, natural sciences, and engineering (including computer hardware) but are likely to evolve into independent academic domains.

I foresee that current data science and AI will evolve into a new academic domain, independent of mathematics, natural sciences, and engineering, which could be called “quantitative thought studies.” This field will include applications in healthcare and drug discovery.
One of the core challenges in understanding informatics or computational techniques, including AI and machine learning, is the rapid pace of technological advancements and the increasing breadth and depth of their applications. Moreover, acquiring these skills tends to favor younger generations while being more challenging for older individuals, akin to gymnastics or swimming, where early involvement offers significant advantages. Consequently, a gap widens between developers and users of data science technologies in fields requiring years of expertise.
This generational gap raises issues of how to harmonize the roles, job functions, and career advancement of younger professionals. Their work often does not align with traditional evaluation criteria for researchers in science and engineering. This misalignment is because platforms for presenting results, such as conferences, journals, and research communities, have not kept pace with their work. It is not uncommon for data scientists to be appreciated for their contributions but not adequately compensated or recognized in terms of position or salary.
In the private sector, hierarchical structures like subcontracting often result in poor recognition and treatment of IT specialists who perform the actual work. While this issue resembles the treatment of skilled experimental technicians, it differs in that computational techniques directly relate to human cognitive tasks like reasoning and decision-making. These techniques may lead to “creative destruction,” altering the design of work or research itself.
Given the rapid pace of ICT advancement, professionals focusing on computational techniques need dedicated environments to specialize. Becoming an expert takes time, and young professionals should ideally develop their skills by tackling real-world problems. Creating an environment that facilitates this approach is crucial.
In Japan, particularly in healthcare, what is lacking are strategies and the logistics mindset. From a strategic perspective, NIH declared its commitment to transforming into a data science-based organization14), with “continuous knowledge generation and utilization” as its goal. Data science is inextricably linked to knowledge generation. In the U.S., the National Library of Medicine (NLM), whose director in the 1980s, Donald Lindberg, was both a physician and an AI researcher, has become a foundational institution for genomic medicine15). Japan lacks such a foundational institution. Additionally, the application of AI techniques requires a glossary of technical terms. Japan has few researchers in natural language processing related to healthcare, compounded by challenges with the limited user base of the Japanese language.
To promote data science in Japan, a robust support infrastructure is essential. Training human resources must also be viewed from this perspective. In drug discovery, as NCATS suggests, it’s vital to restructure workflows, particularly pipelines, in pharmaceutical companies and reassess each component from a data science perspective16).

Thank you very much for sharing your valuable insights today.
References
- Satoshi Watanabe, Knowing & Guessing, John Wiley & Sons, 1969 (Japanese translation: Yoichiro Murakami and Nobuharu Tanji, *Knowledge and Guessing: Scientific Epistemology*, 4 volumes, Tokyo Tosho, 1975); Satoshi Watanabe, *Recognition and Pattern*, Iwanami Shinsho, 1978.
- Tsuguchika Kaminuma, *Medical Innovation and Computers*, Iwanami Shinsho, 1985.
- Tsuguchika Kaminuma and Shusuke Kurashina (translators), *Computer-Based Medical Consultations: MYCIN* (original by E. H. Shortliffe, Elsevier, 1976), Bunkodo, 1981.
- AMIA: American Medical Informatics Association
- C. A. Kulikowski, E. H. Shortliffe et al., *AMIA Board white paper: Definition of biomedical informatics and specification of core competencies for graduate education in the discipline*, J Am Med Inform Assoc (2012). doi:10.1136/amiajnl-2012-001053
- R. Feynman, *Feynman Lectures on Computation*, Addison-Wesley, 1996.
- Translated by Tsuguchika Kaminuma / Supervised by Yukio Tada and Masashi Horiuchi, *The Truth About Drug Discovery: From Clinical to Investment* (Revised Edition, Nikkei BP, 2014): Original by Bartfai T and Lees GV (2006), *Drug Discovery: From Bedside to Wall Street*, Elsevier/Academic Press: Amsterdam.
- Tomomi Usui, *The State of U.S. Healthcare: Introduction of Lifestyle Intervention Education into Medical Training*, Biofeedback Research, 45(1): 19-23, 2018.
- A. Zhavoronkov and P. Mamoshina, *Deep Aging Clocks: The Emergence of AI-Based Biomarkers of Aging and Longevity*, Trends in Pharmacological Sciences, 40(8): 546-549, 2019.
- F. S. Collins and H. Varmus, *A New Initiative on Precision Medicine*, The New England Journal of Medicine, 372(9): 793-795, 2015.
- CAR-T Therapy: A treatment that extracts the patient’s own T-cells, modifies them, and reinfuses them back into the patient.
- European LifeTime Initiative (https://lifetime-fetflagship.eu/).
- Yukio Tada, *The Story of Lonsurf Development*, CBI Journal, 7:13-39, 2019.
- *NIH Strategic Plan for Data Science* (https://datascience.nih.gov/sites/default/files/NIH_Strategic_Plan_for_Data_Science_Final_508.pdf); C. Mura, E. J. Draizen, and P. E. Bourne, *Structural biology meets data science: Does anything change?* Current Opinion in Structural Biology, October 2018.
- D. A. B. Lindberg, MD, *The National Library of Medicine, 1984*; NCATS *Drug Discovery, Development and Deployment Maps* (https://ncats.nih.gov/translation/maps).

Tsuguchika Kaminuma
Born in 1940 in Kanagawa Prefecture. Educated at International Christian University, Yale University, and the University of Hawaii, he earned a Ph.D. in physics. From 1971, he worked at Hitachi Information Systems Laboratory, Tokyo Metropolitan Institute of Medical Science, and the National Institute of Health Sciences. His research spanned pattern recognition, medical artificial intelligence, medical information systems, bioinformatics, and chemical safety. In 1981, he founded an industry-academia-government research exchange organization (now known as the CBI Society) aiming for theoretical drug design. Later, he was involved in interdisciplinary human resource training at Hiroshima University and Tokyo Medical and Dental University. In 2011, he established the Cyber Bond Institute NPO.

Hisashi Iwase
Advisor for Life Science Innovation at the Japan Analytical Instruments Manufacturers’ Association (JAIMA), and President & CEO of BioDiscovery, Inc. Born in 1951 in Tokyo, he graduated from the Department of Industrial Chemistry, College of Science and Technology, Nihon University. He gained extensive experience in management and marketing of analytical and bioscience instruments at Merck Japan, Waters Japan, Millipore Japan, PerSeptive Biosystems Japan, Applied Biosystems, Varian Technologies, and Agilent Technologies. In 2001, he established BioDiscovery, Inc. Since 2013, he has also served as an advisor for Life Science Innovation at JAIMA.