IDEAS FOR I548/N560 SEMESTER PROJECTS - 15 Sept. 2007 (DAB = Don Byrd)

Here are some ideas for semester projects for I548. I'd be happy to consider
anything else you think is relevant to the course; in fact, it's better in some ways
if students make their own projects up. However, I'll expect you to convince me your
project can demonstrate your mastery of several of our "core competencies". Please
feel free to discuss any of this with me at any time.

For background information, see my Information Sources for Music Informatics
Students page
(http://www.informatics.indiana.edu/donbyrd/Teach/GeneralInformationSources.HTML)
and my Music IR and Music Informatics Bibliography
(http://www.informatics.indiana.edu/donbyrd/DonMusicIRBibliography.HTML). In
addition, some of the entries include references of their own.


Requiring Programming
---------------------

- Write a simple program to compute the similarity between two melodies, rhythm
patterns, chord progressions, or even complete pieces, represented in some symbolic
form (such a program could be used as the basis for a music-retrieval program). With
what types of music would you expect your program to work well? With what types
would you expect it to work at least somewhat? Test it against a well-thought-out
collection of test files, and report on the results.  [topic: symbolic retrieval]

- Make significant improvements to an existing program that analyzes audio in some
way. Most such programs are probably too difficult for you to do anything with,
given the constraints of time and your limited knowledge, but there are lots of simple
ones around. DAB's R program DescribeAudioSegments (one of the "R Program Examples"
on my website) is an unusually simple -- a better word might be "crude" -- example
of such a program. In what way(s) is your version an improvement? Test yours against
the original with a well-thought-out collection of test files, and report on the
results.

- Make significant improvements to an existing plugin for the Sonic Visualiser
(www.sonicvisualiser.org). Do a user study to determine whether your changes
really are improvements with either subjective or objective metrics.


Could Be Done With Or Without Programming
-----------------------------------------

- Adapt Steve Larson's "theory of musical forces" (see, e.g., "Musical Forces and
Melodic Expectations: Comparing Computer Models and Experimental Results"; abstract at
http://caliber.ucpress.net/doi/abs/10.1525/mp.2004.21.4.457) to recognize similarity
between melodies or even complete polyphonic pieces of music. Devise a way to test
your version of the theory; test it with a well-thought-out collection of music --
it might be possible to do this manually with a small collection -- and report on
the results. [topic: music analysis]

- Devise a way to test Steve Larson's theory of musical forces with a database of
melodies. Run it with a well-thought-out collection of music -- it might be possible
to do this manually with a small collection -- and report on the results. [topic:
music analysis]

- It's obvious that automated Schenkerian analysis, even going just two or three
levels down from the surface, would be incredibly valuable for music IR; however, a
general, style-independent solution is probably not possible in the forseeable
future. But how about a solution for a very limited range of styles -- e.g., only
Anglo-American folksongs or 12-bar blues? Cf. "controllers" in David Cope's EMI
system. Study the problem and report on what you find in as scholarly a way as
possible; preferably write a program and test it, but at least say just what your
method _would_ do for some real music. [topic: music analysis, music perception,
cognitive science]

- Investigate clustering musical documents on whatever basis; this could be very
useful for visualization, recommender or improvisation systems, etc. Cf. several
papers from ISMIR and elsewhere, and techniques like MDS, PCA, Kohonen maps, and
spring embedding.

- Investigate converting between representations of different types (e.g., notation
to MIDI); if possible, write a program to implement your ideas, and test it on a
well-thought-out collection of music. [topic: music representation]

- Investigate user-interface issues in music searching, either content-based or
bibliographic, by designing a user interface and testing it on a number of
volunteers. The testing can be for either subjective or objective factors, but
it should conform to the scholarly method.

- DAB's Extremes of CMN list (on my website) is interesting, but _distributions_ for
some collection of music and one or more of the features (e.g., written pitch
or duration, or just number of augmentation dots!) showing how often various values
occur in a significant body of music would be much more revealing; such distributions
could be useful in statistical authorship studies, for example. Compute distributions
for some of the items in the list and investigate how the resulting information
could be useful. For a music collection, you could use the CCARH database
(http://www.ccarh.org/), with kern data (http://kern.humdrum.net/) accessed via the
Humdrum toolkit, or with MusicXML data (available from me) accessed via a program of
your own. In any case, the programming part of this is relatively easy.

- Propose a new task for MIREX. Why is this task significant? How could entries
be evaluated? One possible task would be OMR. [topic: evaluation]

- Investigate a basis for ranking music documents in search results. With music, as
with text, this is normally done by similarity between the results and the query,
and justified via "relevance". But are these the best concepts for ranking music?
Present solid evidence one way or the other, perhaps via a user study.


Probably Not Involving Programming
----------------------------------

- The "music as different as possible" problem. The idea is to find six or so
pieces of music each of which is considered by listeners to be as different as
possible from all the others; the hard part is to find objective evidence for the
choices! Once you've chosen them, either justify or refute your claim that each is
as different as possible from the others on a basis that's as objective as possible,
almost certainly via a survey of listeners. Better, use such a measure to find the
pieces in the first place. In either case, "as objective as possible" is not likely
to be very objective: discuss the inherent limitations of objectivity here. This
would be a step towards defining a musical style space. [topic: music classification]

- Study perception of musical-instrument timbre. For example, there's evidence that
people can identify instruments from note _attacks_ more reliably than from note
"steady states" (sustains); can you find strong evidence one way or the other? Of
course it depends on the definition of "attack". Another question: if you "morph"
notes from one instrument into notes from another, at what point (in terms of
the physical characteristics of the sound) do people switch from hearing one to the
other? And how well do people agree? You might expect players of an instrument to
be more likely to think intermediate sounds are their instrument -- or more likely
to think they're _not_ their instrument. Do a user study.

- Investigate abstract spaces relevant to music, e.g., timbre or musical style.
To my knowledge, published work on timbre space has been around since the
1970's, but little has been done in many years. Far less has been done on musical
style spaces, which are a far more difficult problem -- but making progress from
almost nothing shouldn't be that hard!

- Study how MIREX works. Compare it to similar undertakings in other domains (TREC
for text IR, the standard speech-recognition and question-answering tests, etc.).
Say in detail how MIREX be improved. Can you give a convincing argument that your
changes would improve MIREX? This seems like a difficult question of experimental
methodology. One way might be to show what would have happened in a previous MIREX
with your changes, but I'm skeptical that could be done in a convincing way. [topic:
evaluation]