Thursday, June 30, 2005

 

GECCO-2005: The Day After

As I am recovering from the long return trip (weather problems), I thought I would post a few words on my visit to GECCO-2005.

I attended a number of interesting talks, tutorials, and workshops. Most of all, I enjoyed EDA (estimation of distribution algorithms) sessions and the workshop OBUPM-2005. I am happy that our EDA track attracted so many good papers and so many GECCO attendees came to EDA events, and I hope that the next year will be even better. The next year's EDA track chair is Peter Bosman.

I also enjoyed the bioinformatics tutorial given by James Foster and Jason H. Moore and the XCS tutorial by Martin Butz. Although I could not attend Kumara Sastry's tutorial on principled efficiency enhancement because of OBUPM-2005, I know the material and I find these results impressive.

I was also happy to see new work on hierarchical problem solving presented by Edwin de Jong. I hope that there will be more work in this area in the next GECCO and other genetic and evolutionary computation conferences. Just before the talk on hierarchical problem solving, Martin Butz had a great talk Kernel-based, Ellipsoidal Conditions in the Real-Valued XCS Classifier System. Although I could not attend Daan Wierstra's talk Modeling Systems with Internal State using Evolino, I've read the paper and I think the results of this work are impressive.

The above list of interesting talks and tutorials is far from complete but I think there are already quite a few posts regarding this. So, to conclude, GECCO-2005 was a great event and I hope it continues to get better in future. The organizers did a great job. I also had a chance to meet many good friends and great researchers, which is yet another great thing about GECCO.

Wednesday, June 29, 2005

 

Live from GECCO-XI: The war of the worlds

Last day in GECCO. Wednesday is always as intense as any other day in the conference. Last minute discussions, hugs with friends that we won’t see till next year if we are lucky, business cards exchanging at an incredible pace. Today my post is going to be short. I spent most of the time talking to people and attending a few talks on EDAs. To celebrate another thrilling GECCO some of us went for dinner to a nice Indian restaurant in Georgetown---Pier Luca’s fault again---and then, we went to watch the opening of the War of the Worlds. I am back to the hotel now. I need to rush now to pack and get some sleep before we ride back to Urbana. Yes! We are leaving at 6am ;)

 

2005 Human-competitive results winners announced

This morning in the business meeting of SIGEVO, Riccardo Poli announced the winners of the 2005 Human-Competitive Results Competition (the "Humies"). A previous post discussed the contest, and over 20 applicants vied for $10,000 in prize money. The judges selected two Gold Medalists, Randy Bartels for using GAs to shape laser pulses to effect quantum dynamics control (see here) and Preble, Lipson, and Lipson for using genetic algorithms to design photonic crystals (see here). Other medalists used GAs to evolve KOTHs in Core Wars, top backgammon, chess, and robocode players, optical lenses that infringe on existing patents, fingerprint wavelets that beat the FBI wavelet, and a scalable quantum Fourier Transform.

The Humies were a highlight of the conference, and the competition will heat up as the word spreads.

Tuesday, June 28, 2005

 

Live from GECCO - X

After a very interesting keynote speech by Dr. Joshua Epstein, I attended the EDA session in the morning, where four interesting papers were presented.
Claudio Lima gave a wonderful presentation on hybridization of competent recombination and mutation operators. Alden Wright presented a very interesting paper on combining factorization distribution algorithm with linkage detection method. Of course, as usual Martin Pelikan gave a wonderful talk on scalability of multiobjective estimation of distribution algorithm.

 

Live from GECCO-IX: Building block in GP?

Today I was chairing GA-7 session, and one of the paper quickly drew my attention. The paper is entitled "Schema disruption in tree-structured chromosome" by William Greene. Basically, it's a sequel of his paper in GECCO 2004: "Schema disruption in chromosomes that are structured as binary trees." I suspect some theories that he developed can be borrowed to say something about building blocks in genetic programming, but I'm not really knowledgeable enough to say if there's any connection or not. Maybe Kumara can see some connections?

 

Live from GECCO-VIII: Reflections on a compact classifier system

I am sitting on some of the EDA sessions and I can help thinking about a discussion I had with Tian-Li Yu when I was preparing the papers about the compact classifier system. The discussion was about the main differences of DSMGA when compared to eCGA or BOA. DSMGA, unlike eCGA and BOA, provides a crisp and clear presentation of the identified building blocks at the end of a run—just to mention the reason of my wondering. Once the population has converged, eCGA and BOA models just indicate that all the variables are independent. From a substructure identification point of view, the final DSM gives a clear picture of how the problem looks like. Such a property keeps getting my attention every time I sit in an EDA session.

Mental not to self: I need to talk to Tian-Li again. Genetics-based machine learning systems that can provide problem decomposition at the end of the run is one of the challenges I would like to see more solutions on.

 

Dead from GECCO

At long last I arrived at GECCO yesterday afternoon. Saturday was the lil penguin's Bar Mitzvah, and he did a wonderful job. On Sunday, we drove the older penguin to Northwestern University for the NHSI program on journalism (the famed "cherub" program). On Monday, an exhausted father, scraped himself up, drove to Bloomington and arrived at the GECCO venue at L'Enfant Plaza Hotel at about 5 pm.

The conference was in full swing with nearly 600 attendees. Both keynote speakers were a hit, and I helped judge the Humies (or Human Competitive Competition) in the morning. Excellent finalist presentations, and I'll have more to report on that score later.

Tonight is the poster session, and the the lineup appears to be very strong. This conference marks the 20th anniversary of regular genetic algorithms conferences with the first ICGA being held in Pittsburgh at CMU in 1985.

Monday, June 27, 2005

 

Live from GECCO - VII

Phew! I finished all my presentations today! Yesterday, I had to give a presentation in the OBUPM-2005 workshop and then a tutorial on efficiency enhancement, for which I was preparing the first two days. I had my final presentation in the morning on sub-structural niching and then it was time to finally attend differen talks without being stressed out or preoccupied.

Martin Butz gave a great talk of combining XCS with Bayesian optimization algorithm, followed by Stefan Dorste's nice presentation on some theory of compact genetic algorithm.

Lam Bui and Hussein Abbass presented an interesting paper on using fitness inheritance in noisy multiobjective problems. Xavier gave a wonderful presentation on combating user fatigue in interactive genetic algorithms.

A few of us also managed to visit National Gallery of Art after lunch.

 

Live from GECCO-VI: Marketing and genetic algorithms

In one of today’s GECCO sessions, I run into a talk by Doug Newell founder of Genalytics. First I wasn’t sure what he would cover, but in the end it was a very interesting talk. He covered several topics. He started talking about direct marketing. The bottom line, direct marketing is still using mail as a main stream--yes, regular mail. Corporations are working to learn how to improve their targeting capabilities. Sophisticated analytics can improve the targeting up to a 25%. However, statisticians, usually using SAS, and taking around 3 weeks synthesize such models for improvement. Some of the methods use regression, logistics egression, neural networks, and decision trees. Doug show some of their work were GAs can beat the traditional approaches. Another friendly marketing area for GA applications—in Doug’s words--is fraud detention.

Next talk was about Eric Bonabeau presented some of the work of Icosystems on interactive evolution. They path--different from our current work on iGAs--empowers users with all sorts of freedom over the evolutionary process. One application: tile design. They are interested in the navigation across the search space, which is why they are more interested on the evolutionary path, instead of getting a final solution.

Mental note to self: Lack of fit of the adjustment of the model (LOF) by Flor Castillo (Dow Chemichal). Any of these statistics could be used in genetic-based machine learning?

 

Live from GECCO-V: Time continuation issue

In one of the GA session (GA-3) that I attended, two paper (entitled "Improvements to Penalty-Based Evolutionary Algorithms for the Multi-Dimensional Knapsack
Problem Using a Gene-Based Adaptive Mutation Approach" and "Adaptive Isolation Model using Data Clustering for Multimodal Function Optimization") proposed a similar scheme of GAs: they both try to speed up the convergence, and then re-initialize the population to explore the search space. I think this trend leads to the issue about time continuation (for more information, please refer to [Goldberg, 1999]). The issue there is, should we use one epoch, large population, or multiple epochs, but smaller populations? Both the papers show promising results based on their test function; however, no theory or model is developed to support their results. I think we need more theories to guide us in this direction so we can better design the algorithms.

Sunday, June 26, 2005

 

Live from GECCO-IV: The right tutorials

GECCO, day 2. Still blogging offline. The conference has not started yet, but today I am chasing tutorials. I always like the chance to attend different workshops and tutorials for free two days before the conference. As mentioned in an early post, yesterday we had the eighth International Workshop on Learning Classifier Systems. Today, there are two related tutorials.

The first tutorial was Martin Butz’s and “The XCS Learning Classifier Systems: From Theory to Applications”. The second related tutorial was by Tim Kovacs with a general introduction to “Learning Classifier Systems”. Both gave a nice overview on the Michigan style classifiers---should we start a no Pittsburgh left behind tutorial? ;)

Mental note to self: could we rearrange the schedule next year so we can have the tutorial before the workshop? It may help bring more people in to the LCS world.

Another mental note to self: there is no talk about creativity and innovation support, should we have a talk about the DISCUS effort next year?

A final note for Nosophorus. On my way from Martin’s tutorial to the OBUPM 2005 workshop I saw Ingo Rechenberg getting ready for his tutorial. Sorry, I could not stop by, but I can tell you that the room was getting crowded. Then, I came back because Kumara Sastry was giving his in the same room. I just so people still talking about his presentation.

Saturday, June 25, 2005

 

Live from GECCO-III: The Eighth International Workshop on Learning Classifier Systems

Finally, it happened. Today, from 8:50am to 6:30pm we have been involved in IWLCS—you may find the schedule here. In my humble opinion, this has been a great and lively workshop—I saw business card changing hands, envelope calculations, and hallway discussions. We had 17 papers, 3 on going works, and a big discussion session.

The morning stared with the “Methods and Scenarios” session. From the problems with class imbalance to policy transfer a big deal of interesting ideas were proposed. The same stream of interesting ideas happened with the second, and last, morning session “Representations”. There were too many ideas to choose a small subset, but to give you and idea, the “Continuous Actions” talk by Stewart S. Wilson start breaching the dam toward such kind of actions.

Lunch break. We tried to go for lunch near the hotel. The part of D.C. where GECCO is being held does not seem to be friendly during the weekends. Eventually, we had lunch at the food court of the Smithsonian Museum. It was the only place that our exploration mode was able to find before we turned into exploitation mode.

And talking about exploration/exploitation modes, that one was one of the hot discussion topics in the first afternoon session “Reinforcement Learning”. “Multiagent Systems”, “Applications”, and “Environments”, were the other three afternoon sessions. Some ideas also expanded sentinel class predictor reminded me similar efforts on chance discovery. Some other work showed clear interactions with reinforcement learning theoretical work. Interesting remarks were also done into the realm of multiple rewards are available.

Oh, I almost forgot. Yes, as usual, we all got for a big and relaxing dinner together.

 

Live from GECCO-II: Parameter setting workshop

This morning I attended Fernando Lobo and Claudio Lima's workshop--parameter settings in genetic and evolutionary algorithms (PSGEA 2005). It consists of two sessions. The first three talks were about GPs, and the final three were about GAs. One of the main themes of the presentations was about population sizing; however, studies on parameters settings concerning crossover and mutation were also presented. More details about the talks can be found here.

 

Live from GECCO - I

Xavier, Tian-Li, Paul, Kazuhisa and I left Champaign at 3:00 AM and drove to Washington DC and reached our hotel at around 7:00PM. Xavier and I made the slides for some of our GECCO presentations in the van!

Due to some problems, the wireless access in not yet available, and hopefully it will all be sorted out by tomorrow and we can blog live. Now back to the conference, and we will blog about proceedings of the day soon.

Thursday, June 23, 2005

 

Educating a penguin: Part IV

My absence from the airwaves was due to my impromptu college visits with Max Tuesday and Wednesday. We visited Earlham College, Denison University, and Kenyon College. I would like to blog at length about the visits, but suffice it to say right now, that visiting small colleges as opposed to hotshot Ivies is the difference between night and day. Incredible personal attention, lovely, well-designed facilities for small numbers of students, just a different planet. Makes me wish I had my own college education to do over. Max's ranking of the three was (1) Earlham, (2) Kenyon, and (3) Denison. More later, but my other penguin is having his Bar Mitzvah Saturday. To read or review parts I-III, click here.

 

Local recommendations for GECCO

Paul Wiegand passes along the following website as containing useful information for GECCO attendees. GECCO-2005 starts on Saturday and goes until Wednesday. IlliGAL Blogging bloggers intend to blog live from the conference for those of you who can't attend (and for those of you who can).

Tuesday, June 21, 2005

 

Using genetic algorithms for studying climate change impact

A few days ago, ScienceDaily published an article York Scientists Warn Of Dramatic Impact Of Climate Change On Africa, which reports the use of a genetic algorithm to study the response of plants to climate change and fill in gaps in data.


Because of a scarcity of hard data, the team used a computer programme written by Dr Colin McClean, of York's Environment Department to study the response of plants to climate change.

Dr Lovett added: "We needed a method that would help fill in gaps in knowledge. The technique we used is called a genetic algorithm because it works in a similar way to the effect of evolution on chromosomes - the programme combines different variables in lots of different ways and the bad fits are knocked out, leaving the best solutions."


More can be found here.

Sunday, June 19, 2005

 

Michelin, the tyres, and the bad publicity

Some of the IB members went today to the US Grand Prix of F1 at Indianapolis. It has been a remarkable unique Grand Prix. You may be already aware that no pilot or car has been the rising star of this race. The work of Michelin has put them in the hall of fame. They better adapt quickly, reliably, and accurately. A mistake like this can ruin a year of good work as fast as you snap your fingers.

Below you can find a fragment of one of the several letters that FIA and Michelin have been exchanging this weekend. You may find the letters here and here.

[...] We (FIA) are very surprised that this difficulty has arisen. As you (Michelin) know, each team is allowed to bring two different types of tyre to an event so as to ensure that a back-up (usually of lower performance) is available should problems occur. It is hard to understand why you (Michelin) have not supplied your teams with such a tyre given your years of experience at Indianapolis [....]

 

Technorati crawling IB again

For some reason, blog search engine Technorati has not been crawling IlliGAL Blogging for the last month or two. The trouble is apparently cleaned up, and you can once again search for IB results using Technorati.

Thursday, June 16, 2005

 

Automating Invention

Robert Plotkin recently started up a blog called Automating Invention that discusses the social and legal implications of computer-automated invention. A full-time patent lawyer and part-time professor at the Boston University School of Law, he offers some noteworthy insights into software patents and addresses David Goldberg's post on the term "Computer Science vs. Computer Engineering" and how this issue affects the law's perspective on software.

Wednesday, June 15, 2005

 

Free Day with The Economist

The inventive people at Philips are offering a free day pass to The Economist, where you'll not only find a great alternative to Fox News (also known as "LiarTV"), but also this nice article on bio-inspired design.

Tuesday, June 14, 2005

 

Of interest to lonely genetic algorithmists

Working on genetic algorithms is a lonely pursuit, so the following article should be of interest to IlliGAL Blogging readers, as should the following social networking site.

 

HGB posts GECCO table of contents

Hans-Georg Beyer, editor-in-chief of the GECCO conference proceedings this year, just sent me an email saying that the GECCO-2005 proceedings table of contents is available at this link. Instructions for full citations are available here.

The 2005 conference program is quite impressive (here), and HGB and his team of track editors deserve a round of virtual applause from the blogosphere (what sound do bits make when clapped together?). Great job, a job made more challenging by the transition to ACM status under SIGEVO.

 

Is GECCO ready for IlliGAL Blogging?

GECCO-2005 is ready to go, with what looks like record attendance, a terrific program, and live coverage by IlliGAL Blogging bloggers. Wireless internet willing, our band of bloggers will cover happenings at GECCO from Saturday to Wednesday, 25-29 June 2005. Read IlliGAL Blogging for live GECCO coverage, or better yet, visit us at the conference.

Sunday, June 12, 2005

 

Mark Steyn sets me straight on China

I came back from my recent trip to Shanghai with advice to my sons (see here), but Mark Steyn's column has me wondering. Read the whole thing especially the last paragraph. via Instapundit

 

Hot deal on Aristotle

I'm 2/3rds of the way through a terrific short lecture series on the Ethics of Aristotle given by Father Joseph Koterski at Fordham University. The series is a blow-by-blow description of the Nicomachean Ethics, and in 12 sessions Father Koterski highlights the key ideas in a way that makes reading the Ethics so much easier. The course is part of the Teaching Company's stable of courses that I have discussed previously (here).

 

Why did CS defect?

In my previous post I included computer scientists in my discussion of specialization and the postmodern engineer. Doing so reminded me that one of my pet peeves over many years has been that the field of computer science chose to call itself "computer science" and not use a term that included the term "engineering."

There is no proper argument that computer science is one. Science is largely descriptive: it works primarily on describing that that is. Engineering is primarily prescriptive: it says what should be built and how things should be done. Without the invention of an engineered object, the computer, so-called computer science would have nothing to describe, and computer science's past has been a glorious history of continual improvement, engineering, and invention of some of the most powerful technological artifacts known to man.

None of this says, of course, that science is not used in computer science toward its primary mission of creating better computer systems, and none of this says that computer scientists don't contribute to science through their descriptive efforts, but other disciplines of engineering also use and contribute to science in a similar ways. So why did computer science "defect" from the other engineering disciplines and choose to ally itself nominally with the sciences?

History and prestige give much of the answer. Computer science arose in the aftermath of World War II and the success of the glory projects, the atomic bomb and radar, were largely attributed to the science involved. A more careful reading of history (see this) suggests that the success of the glory project required a good bit of formal and informal engineeirng acumen. Moreover, a better understanding of the Allied victory, generally, would credit engineering as part of a larger production dynamo that produced materiel for the war effort more rapidly and in larger quantities than had ever before been seen. Such nuance is largely lost to the larger culture, however, and the postwar zeitgeist raised the prestige of science. That CS chose to call itself a science can be understood in that milieu even if the name is inaccurate.

Unfortunately, the harm caused by this choice is not restricted to sloppy speech. What a field chooses to call itself matters. For example, the training of CS graduates teaches them "theory" as some sort of pristine exercise in proper mathematics as opposed to teaching model building as something that is integral to proper engineering. As a result, CS majors tend to treat theory as something divorced from the engineering of systems in a manner not seen in disciplines that choose to call their practitioners "engineers." My campaign for little models is largely an effort to counteract the imbalance in the theory education of computer scientists.

Card carrying computer scientists have recognized these problems. For example, Peter Denning's famous article Educating a New Engineer (here, ACM access required) chose the term "engineer" in making a case for change in educating the CS major of the future, and he created what he called the Center for the New Engineer to promulgate that vision. The selection of the term "engineer" was not an accident, and part of the point was to correct the errors of a CS past in which science was overvalued above the primary prescriptive mission of the discipline.

CS is the most recent discipline of engineering to be created, but it probably won't be the last. Let us hope that the next one that comes along has the wisdom and courage to choose a more accurate name.

 

Blogging, specialization, and the postmodern engineer

Self SEO has an interesting post about the art of business blogging. One of the key ideas of the post is that bloggers are unusual birds in this age of specialization in that the good ones tend to be generalists:


Bloggers, from what I have seen in my years of blogging, tend to be more generalist than specialist, except for a few who are both...which is to say they have wide knowledge and interests, but also have one or two things that they know a great deal about.

Business bloggers, ideally, fall into this last category. They know a great deal about their company, service, products and so on, but also have a wide-band tuning which allows them to connect with people of all stripes, types and interests. After all, one should not expect that a customer or client know all the fine points of what they are seeking to buy, yet the seller should know those fine points, but also a good deal about the customer as well. This is where a wide-band understanding comes in handy.

This sounds about right; however, I've been wondering lately whether these postmodern times are as friendly to the notion of a specialist as was the Cold War was. In engineering, at any rate, it seems that companies more and more value those engineers who can communicate with marketers, finance people, manufacturing types, and customers. No longer is it acceptable to sit at a drafting table or CAD tube and think only grand technical thoughts alone. Postmodern engineers (and computer scientists) need to be more broadly educated and broadly interested to be successful, and certain changes are coming about to an engineer's education, albeit slowly, in recognition of the workplace shift.

None of this suggests that specialists won't continue. Division of labor and the efficiency it brings about will continue to demand specialization, but the specialization will be more and more be like the second category of business blogger: "which is to say they have wide knowledge and interests, but also have one or two things that they know a great deal about."

Friday, June 10, 2005

 

Taking a blues break

A group of IlliGAL researchers is on the road doing an important study of .... the blues at the Chicago Blues Fest this weekend. By day, mild-mannered Kumara Sastry bangs out difficult solutions to impossible difference equations with the best of them, but set him down in front of a Buddy Guy recording (You Got a Hole in Your Sole clip here) and he becomes a wild and crazy blues guy.

 

MOGA for designing tires

Yokohama Rubber Co., Ltd., announced that they developed a designing method for tires consisting "multiscale simulation" and "multiperformance map" that is based on multiobjective GA in their Japanese news release issued on 5/24.

Thursday, June 09, 2005

 

SIGEVO Logo Competition: Deadline extended to June 23

The deadline was extended for the new logo design competition for the ACM Special Interest Group for Genetic and Evolutionary Computation (SIGEVO) (the former International Society for Genetic and Evolutionary Computation). The new deadline is June 23. The official announcement follows:


The new ACM SIGEVO needs a logo! Send submissions to
sigevo_logo@yahoo.com

The Special Interest Group for Genetic and Evolutionary Computation
(SIGEVO) was formed in January, 2005, and is looking for a logo. We
are now extending the deadline for submitting entries in the SIGEVO
logo contest. The prize will be free registration for GECCO-2005
(or refund of 2005 registration) or, at winner's choice, free
registration for GECCO-2006. The awarding of the prize will depend
on receipt of qualifying entries from at least five persons.
Each entrant may submit one or several logo entries.

To be considered, the logo must be received by June 23, 2005.

Please note:

1) Submissions may not contain any material copyrighted or trademarked
by anyone other than ACM or ISGEC.

2) The winner must agree that the winning logo becomes the property of
ACM, which will have the right to use, copyright and/or trademark the
logo.

Judging will be during GECCO-2005 by a panel appointed by the SIGEVO
chair, and the final decision will be announced on June 29 during the
SIGEVO business meeting. The logo need not replace the "gecko" logo
associated with the GECCO conference, but may be used for any of
SIGEVO's activities, including GECCO. Please send entries to
sigevo_logo@yahoo.com, as JPEG files. (Please consider that, if the
logo is selected, we'll need it in a vector format).

I hope you'll participate!

Erik Goodman
Chair, SIGEVO


Wednesday, June 08, 2005

 

Riding a tidal bore

Regular reader Nosophorus found my learning to sail post a bit tame and suggests surfing an Amazonian pororoca or tidal bore in the Amazon River delta. In the famous words of the Brian Wilson, "Let's go surfing now, everybody's learning how, come on a safari with me."

Tuesday, June 07, 2005

 

Google is your really good friend

Deepak at AI-complete has a nice post about a paper by Vitanyi and Cilibrasi that is causing a buzz. Read the post and read the paper.

 

OptionsScalper and GAs

OptionsScalper has a post about genetic algorithms here. The newly started blog covers a variety of topics from software engineering, to finance, to emergent computation. Why not take a look?

 

The irrelevance of warrant

Josep Maria Terricabras is one of the most well-know contemporary philosophers of Catalonia. In his blog he has a post reflecting about everyday “naïve induction”. The ultimate point is a reference of how people ignore “the irrelevance of warrant” thesis by David Hume (1711-1776). A rudimentary translation of an excerpt of his post elaborates like this:

“…Support the hypothesis that things that happened in the past in a certain way will keep repeating in the future in the same manner shows a great naivety…”

As I mentioned in an early post, I am revisiting some of the initial works about genetics-based machine learning. Reading such an elaborated argument by Terricabras, besides sending me back to my high school philosophy classes, it reminded the hypothesis that we almost always take for granted in learning process. Once you have properly learning from experience and history, such knowledge will be useful for understanding the new inexperienced situations. In another words, given a training data set (or environment), once I learnt a certain model, such a model will describe the new data I collect. Even in dynamic environments, people assume that a learned dynamic model may explain future dynamics.

Unfortunately, Hume’s irrelevance of warrant thesis reminds us how careful and humble our claims should be about the generalization/inducting capabilities of any machine-based learning system.

 

New Papers About Hierarchical Problem Solving

Recently, Edwin De Jong, Richard A. Watson, and Dirk Thierens wrote a couple of interesting papers about hierarchical problem solving in optimization:

You can find downloadable versions at Edwin's web page. Of course, it's even better if you go to GECCO-2005 and hear Edwin talk about this in person.

Monday, June 06, 2005

 

Learning to sail

One of our readers enjoyed the sailing blogging (here and here), so we continue our voyage with a discussion of sailing lessons. In the States, two organizations dominate keelboat sailing lessons, the American Sailing Association and US Sailing. Both offer curricula and certified instructors to teach budding sailors, and the resulting certification is akin to a driver's license that can be used as a credential to charter a bareboat.

My wife and I have taken ASA courses, starting with 101 and 103 at Torresen Marine in Muskegon, Michigan. I continued on with Bareboat Chartering 104 at Fairwind Sailing in Chicago with Ben Sells. The courses were a combination of hands-on sailing and coursework in the rules of the road, boat terminology and workings, and navigation, and the courses did result in my being able to lease a boat. Once you've got your ticket, it is possible to charter boats all over the world from organizations such as Sunsail and others.

So, if you're looking for a break from the rigors of genetic algorithms, consider learning how to sail today.

 

Yes, but can she do the lambada?

jwz has a nice post about a Japanese robot that can ballroom dance with a partner by sensing necessary spatial movements from partner feedback. Where was she when I was taking dance classes at David Lin's Regent Ballroom?

 

Blog Mining

Given the explosive growth of data generated by bloggers, blog mining seems to become more important than ever. CRM Buyer recently published an article Blog Mining Gets Real by Louis Columbus, who discusses just this:

Like unstructured content captured on Web forms that never really gets used, blogs' explosive growth is generating raw data sets that your company really can't afford to ignore. At the beginning of the year blogs were considered by many industry watchers one of the top ten trends. It's becoming very clear that blog mining is certainly part of that mix.

 

Fighting HIV with antispam tools & more

ComputerWorld has recently published an article on using Microsoft's anti-spam software in developing a vaccine to fight HIV:

Spam-filtering technology may soon save millions of lives, thanks to the technology's potential use in developing a vaccine to fight the deadly human immunodeficiency virus (HIV).

Researchers are conducting in vitro tests of HIV vaccine models developed using Microsoft's anti-spam software, according to Kevin Schofield, general manager, Microsoft Research in Redmond, Washington.

The project, he said, is a joint initiative between Microsoft Research, the University of Washington (UW) in Seattle, and the Royal Perth Hospital in Australia.

The great similarity between how spam works and how HIV cells mutate in the human body has allowed researchers to use Microsoft's machine learning and data mining algorithms to analyze the virus's genetic sequences. The purpose was to identify patterns within the genetic mutations of the virus and the patient's immune system, according to Schofield.


Several years ago, a different approach in the same area was pursued by Stefan Kramer, Luc De Raedt, and Christoph Helma, who analyzed the AIDS Antiviral Screening Database using the levelwise version space algorithm that forms the basis of the inductive query and database system MOLFEA (Molecular Feature Miner); for more information about this approach, click here. Stefan Kramer later coauthored a paper by Ulrich Rueckert, where they present another approach to the same problem, which is based on mining for free (unrooted) trees (more info can be found here).

Sunday, June 05, 2005

 

The pillow papers

The trip to Mexico was interesting for several reasons. The CONCEV 2005 conference was a pool of ideas. I was sitting at Francisco Herrera talk on memetic algorithms and I could not stop thinking how fun was we both were there, and neither of us were giving a talk about genetics-based machine learning. But, that was not the reason for feeling uncomfortable sitting in a very comfortable chair in the auditorium; Francisco’s talk was sharp and inspiring. Suddenly, I just realized that there was something else buzzing in my mind: how easy we may be trying to reinvent the wheel.

In another words, sometimes we realize that after inverting our efforts on a particular endeavor, someone had already gone through the same problem, pain, and thought process. Sometimes trying to face the problem straight without looking around first is a good guarantee for reinventing the wheel. In a previous post, Dave Goldberg has coined the term WORN (write once, read never) for such a phenomenon of postmodern times. Thinking about it, it is hard not to be trap in it with the explosion of available information and the time required to successfully dive in it looking for the clue we need.

Among other reasons, the WORN wheel issue is the tip of the iceberg of why I am currently revisiting four genetics-based machine learning papers. Since I am a Pittsburgh style guy, let me suggest first:


Neri F. and Giordana A. (1995). "A Distributed Genetic Algorithm for Concept Learning", Proceedngs of the International Conference on Genetic Algorithms (Pittsburgh, PA), Morgan Kaufmann, pp. 436-443.


Konstam, A. (1994) “N-group classification using genetic algorithms”, Proceedings of the 1994 ACM symposium on Applied computing (Phoenix, AZ), ACM Press, pp. 212-216.


The other pillow papers I would like to suggest are Stewart Wilson’s ones. These two papers are greatly responsible for changing how people approached the so-called Michigan style classifier systems, setting the basis of the current efforts in the field.


Wilson, S.W. (1991) "The Animat Path to AI". In From Animals to Animats: Proceedings of The First International Conference on Simulation of Adaptive Behavior, J.-A. Meyer and S.W. Wilson, eds., Cambridge, MA: The MIT Press/Bradford Books, pp. 15-21.


Wilson, S.W. (1995), "Classifier fitness based on accuracy", Evolutionary Computation, 3(2), 149-175.

 

Sailing toys

It's the weekend in the summer and a GA guy's fancy turns to...sailing. The family and I had our first sail on the Urdragon last weekend (see previous post here) and had a total blast. Although it's hard to get lost going up and down the Chicago coastline, as the good obsessive compulsive that I am, I'm working on building up my collection of navigational aids and toys.

Just found an Explorist 100 GPS receiver on sale for $79.99 at that famous marine supply store Meijers. The 100 is a basic model, but the technology that is now crammed into a little box for a little price is truly amazing. Also got a VHF 100 two-way handheld marine radio from West Marine (on the way back from China the week before last, I had 8 hours to kill in LA, went to Marina del Rey, went to West Marine, got a VHF radio, rented a bicycle, rode to Santa Monica and back, had coffee, and went back to the airport). Again, the VHF 100 is a basic model, but it is submersible, has digital squelch, and is a pretty good value on sale for $129.95.

Next, we need some good old fashioned charts and charting tools, and I'm looking at Maptech, their Lake Michigan chartbook, and some single charts along with parallel rules and dividers. Maptech is one of a number of companies licensed to reproduce NOAA marine and USGS togographical charts in paper and electronic form.

Throw in a hand bearing compass, and these toys ought to help me train up my crew in the basics of coastal navigation the old fashioned way and in the GPS age. Some time later this summer, the good lab IlliGAL will be transported onto the good ship Urdragon, and we will see if the landlubbing labbies are as good cranking the winches as they are at typing up papers. Genetic algorithms are one thing, general quarters are another.

Friday, June 03, 2005

 

Social Interaction - a Mandatory Ingredient for Higher Level Consciousness

In relation to our previous discussions on machine consciousness, I have to point out that there are very strong indicators that higher levels of consciousness, those relating to self-consciousness and phenomena including the feeling of self-control and free will, are strongly related to social interaction.

The Russian cognitive psychologist L.S.Vygotsky suggested in the 1930s that a significant number of normal human psychological processes could be understood as internalized versions of processes that are inherently social in nature (Vykotsky, 1978, Mind in Society, Harvard University Press) (from Terrence W. Deacon, The Symbolic Species, p. 450). Essentially, Deacon suggests that consciousness is a gradual process in that also our dog and at least all mammals and probably birds experience lower degrees of consciousness. However, our capability to handle symbols and our drive to continuously seek for causes and effects in symbolic form opens up a new world of consciousness. Continuous social interaction and communication shapes the symbolic world and makes us continuously adapt to newest standards and word usages in society. Thus, the Mind can be considered as a Darwin machine - not only potentially as a competitive engine distributing and controlling competitive neural activity (somewhat another topic) - but also in the sense that the symbolic representations in the mind are continuously shaped by the social and intellectual (co-)evolution in society and the world as a whole (with all its good and bad sides).

For other strong indicators that social interaction is a key ingredients to reach also higher levels of intelligence, I recommend Matt Ridley's book on "The Origins of Virtue", of which a review can be found here, a shorter one stressing more the beneficial aspects of social interaction here.

Thursday, June 02, 2005

 

Why we cite & a call for a UCC

The Slurrier has some interesting thoughts on the changing role of academic citation here:

It struck me the other day that there has been a shift in the social role of citation in academia. From what little I understand of it, bibliography arose as credentialing for the citing author, not as a chit paid to the cited. More recently, however — perhaps just since libraries and search have become more automated? — citation has come to be something owed to the cited. You write a paper in hopes that, or perhaps expecting that others will cite you, if only in a review article, somewhere down the line.
I'm not sure I buy the argument. After all, hasn't citation always played a dual function of an author showing familiarity with and acknowledging prior art? Nonetheless, the question whether online search changes citation in important ways is an interesting one. Online search cheapens research, but the lower cost also lowers the bar on what constitutes research. Moreover, online search does little to eliminate the WORN (write once, read never) phenomenon of postmodern times.

Thinking along these lines begs us to ask whether we can eliminate the plethora of citation standards that now exist and the lack of reference work standards. For example, wouldn't it be nice to have a UCC (universal citation code) like a UPC (universal product code). That way we could label citations once in a single database, and have them called up correctly whenever needed. The UCC would also make it easier to track citation usage, as the placement of the UCC in a document could be reported to a tracking website that would keep tabs on citation statistics.

 

Advanced motion software uses GAs

In an earlier post, I discuss a Java applet that I stumbled upon called GA-Walk in which a stick figure skeleton walked something akin to skipping while being electrocuted. Despite its shaky movement, its real-time visualization of fitness improvement over generations made it worthwhile.

Well, this behavior pales in comparison to Natural Motion's Endorphin software. Endorphin uses GAs and AI to develop real-time motion of characters - thus saving weeks of animation cleanup for a even a 30-second PepsiMax ad. The software uses Newton's laws of motion and other physics and biology principles to model such things as muscle tension and fatigue, stored energy in the joints, and hyper-extension (the lack of it). The result is commercial quality, smooth movement (and a $12,795 pricetag). Demos can be found here. This has been used on a battle scene in "Lord of the Rings: The Return of the King" and various games and commercials.

NaturalMotion was founded by former Oxford researchers Torsten Reil and Colm Massey. More information can be found here in this Wired article.

 

GECCO-2005 sked posted

The complete schedule for GECCO-2005 (the biggest, best GA/EC conference on the planet) is available here. Wow, what a lineup. There is still time to register, and even though student and conference housing is full, there are plenty of reasonable hotels available online. I'm particular heartened by the increase in papers on linkage, EDAs, and building blocks (BBs), even papers submitted by former card-carrying members of the BB boobirds. It's too bad it took them two decades to see the light, but better late than never.

 

Darwinian Poetry

Many would agree that one of the good things about genetic and evolutionary algorithms is that they can use a human-based evaluation function (and we could go even beyond that and talk about the human-based genetic algorithm). Genetic algorithms that use human evaluators are called interactive. Psychology Today published an article Darwinian Rhapsody about an interactive genetic algorithm for writing poems. The application is called Darwinian Poetry and can be found here. There is an online interface so you can go there and evaluate some poems yourself. More than 135k poems have been evaluated since July 2004 when the site was launched.

This site reminded me of few other similar sites I've seen over the years, but for some reason I could not recall (or find fast enough) any of them :-)

Wednesday, June 01, 2005

 

More GA fun & games

Rob & I appear to be in a game riff, so here's some more. Project Nero is a collaboration of Digital Media Collaboratory and the Neuroevolution Group at the University of Texas at Austin in which genetic algorithms and neural nets are used to evolve agents that adapt to different scenarios in a game of training and battle. Some battle screenshots are available here, and a variety of movies are available here. The NERO team is headed by Texas faculty member Risto Miikkulainen. Hat tip to lemons in space.

 

Noisy Chess

I believe some of the folks who read this blog might be interested something I just blogged on roundtuit. It's an idea I call noisy chess. I have a theory that backgammon is learnable, while chess isn't, because the noise (dice rolls) in the former make more of the game space accessible.

Noisy Chess is a chess variant for testing this theory. Plus, it might be fun.

And perhaps there is something to explore here about evolvable versus non-evolvable landscapes?

 

Will Wright discusses new game, Spore

Will Wright creator of Sim City, the Sims, and other popular simulation-based games previews his new game Spore in a BBC article. The ambitious project simulates the evolution of organisms in a large multi-player controlled ecosystem. I met Will Wright some number of years ago at a Sante Fe Institute workshop organized by John Holland at the University of Michigan, and in that sense, it is not surprising that he has tackled evolution and genetics as the basis for a computer game. It is not clear from the BBC article or a more detailed account at Gamespy whether Spore uses genetic algorithms or not. If so, the idea is similar to Dawkins Blind Watchmaker code of many years ago, but the organisms in Spore are 3D, dynamic, and generated from 22,000 different animations.

Do any readers know of more GA/EC detail on Spore? The project is breathtaking in its ambition and scope. I'm not much of a computer game fan (ask the 2 penguins), but even I might have to own a copy of this one. Hat tip to Step into the future.

This page is powered by Blogger. Isn't yours?