Reprinted from the Proceedings of the IEEE,USA
vol. 82 no. 10 pp.
1492-1500, OCT 1994.
Invited Paper
The MiniDisc system was designed with the obvious objective of replacing the conventional Compact Cassette tape recorder system. The MiniDisc format defines two types of optical discs. One is a recordable magneto-optical disc for user recording and another is a conventional read-only disc for music-software publishing. Audio data compression is used to achieve 74 min of playing time on a 64-mm disc. By means of a built-in buffer memory called Shock Resistant Memory, MiniDisc can be used for outdoor portable applications with great ease. Furthermore, MiniDisc was evolved into the MD Data system and with a data capacity of 140 Mbytes and a very compact size, the MD Data system is expected to become one of the standards for removable data storage systems.
Fig. 1. Example of commercial MiniDisc player. |
Before starting the CD business, many engineers engaged in the development of the CD solely for its improvement in sound quality, but after the introduction of the CD player into the market, we found out that the consumer became aware of the quick random-access characteristic of the optical disc system. In addition, the size of the 12-cm disc was easy to handle compared to that of the LP. The longer lifetime for both the media and the player strongly supported the acceptance of the CD format. The next target of development was obviously to be the rewritable CD. SONY and Philips jointly developed this system and made it a technical reality in 1989. Two different recordable CD systems were established. One is the write-once CD named CD-R and the other is the re-writable CD named CD-MO.
Sales of cassette tapes had been decreasing since 1989, and Sony felt that the compact cassette system was approaching the end of its format life. Even if recordable CD were to be accepted by the consumer, it would still be difficult to break into the portable market. Here, portable compact cassette dominated because of its strong resistance to vibration and its compactness. Clear targets for a new disc system were to overcome these weaknesses. Sony was able to achieve this by introducing a disc system called MiniDisc (MD).
Magneto-optical disc recording technology has been used for computer data storage system for several years. Based on this technology, we had developed direct overwriting technology with a similar recording density as Compact Disc. Additionally, we employed a shock-resistant memory control for portable use and applied a digital audio compression system called ATRAC (Adaptive TRansform Acoustic Coding) that enabled us to use a 64-mm disc size. Recent technological improvement of semiconductors helped to realize this technology.
Fig. 2. Technical position of MiniDisc |
Similar to the way the CD-ROM was realized, the MD Data system was developed based on audio MD. A data capacity of no less than 140 Mbytes, combined with a dedicated file structure present new possibilities to implement the MD Data disc system in several different computer systems as well as future equipment. (Fig 2).
Fig. 3. MiniDisc disc type and groove/pit layout. |
Fig. 4. Pre-mastered pit. |
It is mandatory for the MD system to be able to play back these three
types of discs. This means that the MD recorder/player is equipped
with a dual-function optical pick-up for the read-out of recorded pit
patterns and the magneto-optical disc.
Fig. 5. Recordable groove. |
The MD disc substrate is manufactured by using injection molding methods. Therefor, it is relatively easy to keep an optimal shape of the clamping area to allow for proper loading. Figure 6 shows the cross-sectional view of the disc substrate housed in the cartridge. The optical disc must be centered when it is mounted in the drive unit. As with the CD, the center is aligned using the edge of the inner circumference of the polycarbonate substrate as a reference. Constructed from simple steel, the plate is roughly positioned on the opposite side of the clamping area, to ensure proper clamping and centering. This simple method can also reduce the mechanical stress near the center area of the polycarbonate disc. This way it also helps to avoid birefringence of the polycarbonate substrate.
Fig. 6. Cross-sectional view of MD disc. |
Fig. 7. Configuration of recordable MiniDisc. |
The recorded signal of the premastered pits and of the recordable MD
are very similar to that of the CD. Eight-to-Fourteen Modulation (EFM)
and Cross Interleaved Reed-Solomon Code (CIRC) are employed. The
optical readout parameters of the MD disc, such as laser wavelength,
track pitch, Numerical Aperture (NA) of optical pick up, etc., are
almost the same as that of Compact Disc. (Table 1).
Main Parameters | |
---|---|
Playing & recording time | max 74 minutes |
Cartridge size | 68D x 72W x 5H mm |
Disc Parameters | |
Disc diameter | 64mm |
Disc thickness | 1.2mm |
Diameter of center hole | 11mm |
Starting diameter of program area | 32mm |
Starting diameter of lead in area | 29mm max |
Track pitch | 1.6um |
Scanning velocity | 1.2-1.4 m/sec |
Optical Parameters | |
Laser wave length | 780nm typ. |
Lens NA | 0.45 typ. |
Recording power | 2.5 - 5 mw |
Recording strategy | Magnetic field modulation |
Disc Type | |
Pre mastered disc | High/low reflectivety disc (premastered pits) |
Recordable disc | Magneto-optical disc |
Hybrid disc | Partial premastered pit area, partial magneto-optical area |
Audio Performance | |
Number of channels | Stereo and Mono |
Frequency range | 5-20,000 Hz |
Dynamic range | 105dB |
Wow and flutter | Quartz crystal precision |
Signal Format | |
Sampling frequency | 44.1 KHz |
Coding | ATRAC (Adaptive TRansform Coding) |
Modulation | EFM |
Error correction system | ACIRC |
Table 1. MD System Specification.
Fig. 8. Magnetic-field modulation overwrite system. |
Fig. 9. Comparison of recorded patterns on the disc. |
A different system altogether is used in a laser modulation system that records the signal by switching the level of the semiconductor laser power. The direction to which the magnetic fields are added is only in one direction. The area where the laser light is irradiated will be "1" and the area where no recording is made will be "0". The recording pattern will become nonsymmetrical. In addition to this, there is another problem with the laser modulation system. It is vulnerable to fluctuation in recording power. When the recording power fluctuates, the positions where the edge is started to be written or finished are easily distorted. The length of the pattern will fluctuate.
Fig. 10. Tangential skew tolerance |
Figure 10 shows the result of the experiment. The chart shows the fluctuation of the block error rate when the disc is tilted and recorded/played back. Even when the disc is tilted +/- 1.5deg parallel to the track, the block error rate stays almost stable as long as it is not tilted at the time it is read. Figure 11 shows the data taken when the disc is tilted in the direction toward the radius. The results are almost identical to those when it is tilted parallel to the track. As mentioned so far, there are scores of advantages in the magnetic-field modulation system. These advantages hold the promise of high recording and playback reliability in the mass-production stage of the MD system.
Fig. 11. Radial skew tolerance |
The MD system employs almost the same modulation and error correction code as that of the CD. Eight-to-Fourteen Modulation (EFM) and the Cross Interleave Reed-Solomon Code (CIRC) are employed as the error correction code. It is well known that the combination of the EFM and CIRC provide a high potential of error correction possibility and high recording density. The "naked" CD disc is able to provide sufficient data reliability for a long period. It shows that if the disc is enclosed in a cartridge, we can expect to increase the reliability. Hence we were strongly motivated to use the same system.
Different from the audio CD, we modified the interleave sequences slightly into a pattern suitable for keeping the interpolation between each audio sampling sequences to a continuous data sequence. We called it ACIRC (Advanced Cross Interleave Reed-Solomon Code). In case an uncorrectable error occurs during readout of the data from the MD disc, this modification helps to minimize the lack of the CD-ROM sync header. As mentioned before, MD uses compressed audio data, recorded by grouping the data into blocks. The block format is very similar to the CD-ROM mode-2 standard. In CD-ROM, we use a block address based on that of the CD's subcode address, so that we use the physical address data based on playing time. CD and CD-ROM use minutes, seconds, and frames. With MD we replaced the unit of the address from a time-base oriented one to binary-code based on frame (block) units. We call this unit (frame, block) a "sector." Because of the long interleaved ACIRC error correction code, three sectors must be used as "linking sectors." If the user changes or adds new data to the MD disc, two or three sectors for every start and end position of the new data need to be recorded.
Fig. 12. MiniDisc data configuration. |
The ATRAC encoding process starts with the 16-bit quantization and 44.1-kHz sampled stereo audio signal. The key to ATRAC's efficiency is its unique method of analysis. ATRAC employs a nonuniform slitting in both frequency and time axes, reflecting psychoacoustic principles.
Fig. 13. Block diagram of ATRAC encoder. |
The transform block size is adaptively chosen. There are two modes, long mode: 11.6 ms for all frequency bands and short mode: 1.45 ms for the high-frequency band and 2.9 ms for mid- and low-frequency bands. Normally, the long mode is chosen to provide optimum frequency resolution. However, problems may occur during attack portions of the signal. Specifically, the quantization noise is spread over the entire MDCT block time. Just before the attack of the sound, one may hear some noise. Therefore, ATRAC automatically switches to the short mode to avoid this type of noise.
The MDCT spectral coefficients are the grouped into a Block. The spectral values are quantized by using two parameters. One is word length, another is scale factor. The scale factor defines the full-scale range of he quantization and the word length defines the resolution of the scale. Each Block Floating Unit (BFU) has the same word length and scale factor, reflecting the psychoacoustic similarity of the grouped frequencies. The scale factor is chosen from a fixed table and reflects the magnitude of the spectral coefficients in each BFU. The word length is determined by the bit allocation algorithm. For each sound frame (512 data samples) the following information is stored on disc:
The data size of each sound frame is fixed as 212 bytes. On the disc, 11 stereo sound frames are recorded every 2 sectors. In case of monaural recording, twice the amount of time can be recorded on the disc.
The bit allocation algorithm divides the available data bits between the various BFU's. ATRAC does not specify a bit allocation algorithm. The word length of each BFU is stored on the MiniDisc along with the quantized spectra, so that the decoder is independent from the allocation algorithm. This allows for an evolutionary improvement of the encoder without changing the MiniDisc format. The decoding process is divided into two steps. The decoder first reconstructs the MDCT spectral coefficients from the quantized values, by using the word length and scale factor parameters. The coefficients are transformed back into the time domain by inverse MDCT using either the long mode or the short mode as specified in the parameters. Finally, the three time-domain signals are synthesized into the output signal by QMF synthesis filters.
Fig. 14. Shock resistant memory control. |
Fig. 15. Format comparison between MD and MD-data. |
Main Parameters | |
---|---|
Capacity | 140 Megabytes |
Cartridge size | 68D x 72W x 5H mm |
Normal data transfer rate | approx 150Kbyte/s (300Kbyte/sec double speed) |
Disc Parameters | |
Disc diameter | 64mm |
Disc thickness | 1.2mm |
Diameter of center hole | 11mm |
Starting diameter of program area | 32mm |
Starting diameter of lead in area | 29mm max |
Track pitch | 1.6um |
Scanning velocity | 1.2 m/sec or 2.4 m/sec |
Optical Parameters | |
Laser wave length | 780nm typ. |
Lens NA | 0.45 typ. |
Recording power | 2.5 - 5 mw |
Recording strategy | Magnetic field modulation |
Disc Type | |
Pre mastered disc | High/low reflectivety disc (premastered pits) |
Recordable disc | Magneto-optical disc |
Hybrid disc | Partial premastered pit area, partial magneto-optical area |
Physical Data Structure | |
Modulation | EFM |
Error correction system | ACIRC |
Additional layred ECC | selectable |
Sector size | 2048 or 2336 bytes |
Cluster size | 32 sectors |
Maximum cluster number | approx 2200 |
Maximum sector number | approx 70000 |
File Structure | MD data File System |
Block size | 2kbyte - 64kbyte |
Table 2. MD Data Specification.
The MD-Data volume and file structure was developed to cope with the MD's physical limitations. The data track, similar to the music track on the MD Audio, is defined in the TOC and the U-TOC. The first cluster is used for a boot cluster, and a further 16 clusters are used for the Volume Management Area (VMA), where all files and directory management information are gathered. Because the VMA is physically centralized and is usually cached in semiconductor memory, the number of accesses and rewriting operations are minimized. Even more, considering the physical cluster actually means a minimum rewriting unit, the MD-Data file system can manage a logical block size ranging from 2 to 64 kbytes. These block sizes are uniquely selected depending on the application by the user. The file system can also support a hierarchical directory structure, short- and long-file names, additional information, etc. It allows easy connection of MD Data to various host systems and allows for media exchanges between different systems (Table 2).