Plant Breeding

The nature of code: Why is it

Many find genetics, as a field of science on its own, charming. Many more are excited to learn about the science that fits seamlessly into complexity driven life of organisms, providing explanation for natural phenomena at both micro-evolutionary and macro-evolutionary scales. But only few find fascination with its deep running concepts, going down to more fundamental physical theories. This article tries to at least expose, if not laid satisfying argument, to some of fundamental questions in genetics concerning nature of code (mostly chemical behaviour). In particular, following are some of the questions I plan to touch upon (Credit goes to a student of mine who posed these questions one evening and left me pondering on details):

World production and grain composition of major cultivated species

Context

Out of crops raised for their seed/grains (listed under 35 species, by FAO; FAOSTAT, 2014), only 22 species are produced in substantial amounts. Species of graminae and leguminosae families alone account for about 85 percent of the total grain production. As presented here in Table 1.

Production

(\#tab:world-production)Global production of major cultivated crops
Crop Crop species World production^[Average of 2011 to 2014, FAOSTAT (2016)] (1000 t)
Poaceae
Maize Zea mays L. 950394
Rice Oryza sativa L. 733424
Wheat Triticum spp. 700828
Barley Hordeum vulgare L. 138252
Sorghum Sorghum bicolor (L.) Moench 58647
Millet^[May include members of other genera such as Pennisetum, Papspalm, Setoria and Echinochla] Panicum miliaceum L. 26528
Oat Avena sativa L. 22639
Rye Secale cereale L. 14906
Triticale X Triticosecale Wittm ex A. Camus 14653
Fabaceae
Soybean Glycine max (L.) Merrill 272426
Groundnut^[In the shell] Arachis hypogaea L. 41366
Bean^[Also includes other species of Phaseolus and, in some countries, Vigna species.] Phaseolus vulgaris L. 23898
Chickpea Cicer arietinum L. 12735
Pea, dry^[May include P. arvense (field pea).] Pisum sativum L. 11013
Cowpea Vigna unguiculata (L.) Walp. 6661
Lentil Lens culinaris Medikus 4831
Broad bean Vicia faba L. 4332
Pigeon pea Cajanus cajan L. Millsp. 4454
Others^[Rapeseed is in the Brassicaceae, sunflower and safflower are in the Asteraceae, and sesame is in Pedaliaceae.]
Rapeseed^[May include industrial and edible (canola) types, data from some countries includes mustard (Brassica juncea (L.) Czern, et Coss)] Brassica napus L., B campestris L. 67789
Sunflower Helianthus annuus L. 40931
Sesame Sesamum indicum L. 4738
Safflower Carthamus tinctoris L. 776

Grain composition

(\#tab:grain-comp)Global production of major cultivated crops
Crop Crop species Harvested unit Seed carbohydrate (g_per_kg) Seed oil (g_per_kg) Seed protein (g_per_kg)
Poaceae
Maize Zea mays L. Caryopsis 800 50 100
Rice Oryza sativa L. Caryopsis 880 20 80
Wheat Triticum spp. Caryopsis 750 20 120
Barley Hordeum vulgare L. Caryopsis^[Harvested grain usually includes the lemma and palea] 760 30 120
Sorghum Sorghum bicolor (L.) Moench Caryopsis 820 40 120
Millet^[May include members of other genera such as Pennisetum, Papspalm, Setoria and Echinochla] Panicum miliaceum L. Caryopsis 690 50 110
Oat Avena sativa L. Caryopsis^[Harvested grain usually includes the lemma and palea] 660 80 130
Rye Secale cereale L. Caryopsis 760 20 120
Triticale X Triticosecale Wittm ex A. Camus Caryopsis 594 18 131
Fabaceae
Soybean Glycine max (L.) Merrill Non-endospermic seed 260 170 370
Groundnut^[In the shell] Arachis hypogaea L. Non-endospermic seed 120 480 310
Bean^[Also includes other species of Phaseolus and, in some countries, Vigna species.] Phaseolus vulgaris L. Non-endospermic seed 620 20 240
Chickpea Cicer arietinum L. Non-endospermic seed 680 50 230
Pea, dry^[May include P. arvense (field pea).] Pisum sativum L. Non-endospermic seed 520 60 250
Cowpea Vigna unguiculata (L.) Walp. Non-endospermic seed 570 10 250
Lentil Lens culinaris Medikus Non-endospermic seed 670 10 280
Broad bean Vicia faba L. Non-endospermic seed 560 10 230
Pigeon pea Cajanus cajan L. Millsp. Non-endospermic seed 560 20 250
Others^[Rapeseed is in the Brassicaceae, sunflower and safflower are in the Asteraceae, and sesame is in Pedaliaceae.]
Rapeseed^[May include industrial and edible (canola) types, data from some countries includes mustard (Brassica juncea (L.) Czern, et Coss)] Brassica napus L., B campestris L. Non-endospermic seed 190 480 210
Sunflower Helianthus annuus L. Cypsela 480 290 200
Sesame Sesamum indicum L. Non-endospermic seed 190 540 200
Safflower Carthamus tinctoris L. Cypsela 500 330 140

References

Page 3 and 4, Seed Biology and Yield of Grain Crops, 2nd Edition

Seed growth rate of major cultivated crops

Context

(\#tab:smr-vegetables)Seed growth rate of common cultivated crop species
Crop Number of cultivars Seed growth rate_mean (mg per seed per day) Seed growth rate_range (mg per seed per day) Effective filling period_mean (day) Effective filling period_range (day) Maximum size_mean (mg per seed) Maximum size_range (mg per seed)
Cereals:
Wheat (Triticum aestivum L.) 26 1.4 2.1-1.0 29 45-19 41 55-23
Barley (Horedum vulgare L.) 13 1.6 2.4-0.6 25 43-18 38 50-22
Rice (Oryza sativa L.) 12 1.2 2.0-0.9 24 36-12 28 50-20
Sorghum (Sorghum bicolor (L.) Moench.) 9 0.9 1.9-0.4 23 42-20 28 37-19
Maize (Zea mays L.) (inbreds) 22 7.4 9.7-3.6 31 39-23 228 322-86
Maize (Zea mays L.) (hybrids) 10 8.8 10.4-7.0 35 41-23 302 410-229
Legumes:
Soybean (Glycine max (L.) Merr.) 21 6.8 14.7-3.6 29 46-13 202 484-84
Bean (Phaseolus vulgaris L.) 20 18.9 33.1-10.2 18 24-14 345 540-190
Pea (Pisum sativum L.) 5 10.5 14.3-5.6 22 35-12 195 224-150
Field pea (P. arvense) 2 9.5 13.0-6.0 25 32-18 211 232-190
Broad bean (Vicia faba L.) 3 36.9 55.0-20.0 31 57-16 1104 2017-414
Cowpea (Vigna unguiculata L. Walp) 3 8.4 12.2-4.4 8 9.0-7.0 73 122-32
Groundnut (Arachis hypogaea) 2 12.8 14.0-11.6 44 45-43 563 626-500
Oil seeds:
Flax (Linum usitatissimum L.) 2 0.2 0.3-0.2 31 35-27 8 8.0-7.0
Sunflower (Helianthus annuus L.) 7 1.6 2.0-1.2 34 48-30 54 75-39

References

Page 46, Seed Biology and Yield of Grain Crops, 2nd Edition

Seed Business: Process Management Guide

The problem

Seed business is a multifaceted undertake. Although most businesses suffer strong feedback effects, seed business are more markedly left with those than most others. Let us refer to a simple case description to recapitulate just how pronounced it can be, and we are talking about the success in a long term venture.

Linear mixed model formulation

Introduction

Mixed models are quite tricky, in that, while being very powerful extensions of linear models, they are somewhat difficult to conceptualize and otherwise to specify. Mixed models have, in addition to usual fixed effect combination of factors, random effects structure. These structure need to be specified in the model formula in R. While formula specification of a model is unique in it’s own respect, the formuala expression too leads to an object with differnt properties than a regular R object. Although, the complexity of formula syntax can arbitrary (constrained by classess and methods working on that), a general guideline is applicable for most of the mixed modeling utilities. These include: lme4, nlme, glmmADMB and glmmTMB.

Design and analysis of spit plot experiments

Split plot design

Design and fieldbook template

In a field experiment to test for effects of fungicide on crop, treatment of fungicides may be distinguised into multiple factors – based on chemical constituent, based on formulation, based on the mode of spray, etc. In a general case scenario where two former factors could be controlled, factor combinations may be organized in several different ways. When fully crossed implementation is not possible, split plot design comes to the rescue.

Layout and visualization of experimental design

Functional approach to creating and combing multiple plots

This approach highlights features of gridExtra package that allows combining multiple grob plots using function calls. We explicitly use lapply/split or similar class of purrr functions to really scale the graphics.

We load a Hybrid maize trial dataset, with fieldbook generated using agricolae::design.rcbd(). The dataset looks as shown in Table 1, after type conversion and cleaning.

(\#tab:rcbd-maize-fieldbook)Intermediate maturing hybrids with 50 entries each in 3 replicated blocks
Rep Block Plot Entry col row tillering moisture1 moisture2 Ear count Plant height
1 1 1 1 1 1 3.0 3.5 35 270
1 1 2 3 1 2 3.0 3.5 25 266
1 1 3 18 1 3 3.5 4.0 30 261
1 1 4 32 1 4 4.0 4.5 26 224
1 1 5 37 1 5 4.0 4.5 30 268
1 2 6 27 1 6 4.0 4.5 20 268
1 2 7 21 1 7 4.0 4.5 25 277
1 2 8 13 1 8 3.5 4.0 25 264

For the given dataset, we can draw on the information that Rep variable was used as field level blocking factor (Although separate, Block, variable exists, it was nested inside the Rep.) Therefore, to begin with, we ignore other spatial grouping variable. Now, since the grid graphics only requires two way represenation of plotting data, we have row and col information feeding for that.

Developing flowcharts: an illustration of wheat breeding scheme

Flow diagrams are jam-packed with information. They normally describe a process and actors that are involved in making that happen.

With r package diagram, which uses r’s basic plotting capabilities, constructing flowcharts is as easy as drawing any other graphics.

This post expands on creating simple flowdiagrams using example scenario of a wheat breeding program. The information for this graph was, most notably, deduced from those provided by senior wheat breeder of Nepal, Mr. Madan Raj Bhatta.

Flowering phenology of Rice genotypes in irrigated tropical condition of Nepal

Dhangadhi of Kailali, Nepal remains relatively hotter during summer season with respect to average condition of terai agro-ecology. A regime of high day temperature with bright sunshine and cool night temperature along with plenty of seasonal rain during larger part of rice growing season favors good growth of rice crop in the region.

Based on a range of cultivation sowings carried out in the summer/rainy season of 2018, flowering dates were recorded for each plot of varieties, and the duration since planting to flowering was determined. Here, flowering time is ascribed to the period when approximately 50% of the anthers in each plots could be promptly seen as extruding. I summarize the resulting days to flowering period in the bar chart shown in Figure 1.

Stability analysis: how to guide

Meaning of stability

Comparison of treatments may also imply cross comparison of their stability across multiple environments, especially when a study constitutes a series of trials that are each conducted at different locations and/or at different periods in time (henceforth referred to as MET; Multi-Environment Trial). Several situations exist where only mean based performance analysis are regarded inconclusive.

For example, in varietal release process the authorizing body seeks record of consistent trait performace of certain crop genotype. The imperative is: a variety needs to be stably exhibit it’s characters in the proposed domain of cultivation, which generally is a wide area, throughout a long duration of cultivation cycles. This pre-condition of stable character inheritance is more relevant to crops constituting a homogenous and homozygous population. Either of the location, time period or combination of both, more commonly framed as year in field researches, could be assumed to present an unique environment that treatment entries are tested in. Thus, for results to be widely applicable, performance measures across environments should be more or less stable. To the contrary, the concept of utilizing differential character expression across different environments is often explored when interaction between genotypes and environments result in more desirable character.