logo logo European Journal of Educational Research

EU-JER is is a, peer reviewed, online academic research journal.

Subscribe to

Receive Email Alerts

for special events, calls for papers, and professional development opportunities.

Subscribe

Publisher (HQ)

Eurasian Society of Educational Research
Eurasian Society of Educational Research
7321 Parkway Drive South, Hanover, MD 21076, USA
Eurasian Society of Educational Research
Headquarters
7321 Parkway Drive South, Hanover, MD 21076, USA

'polytomous' Search Results



...

Schools and teacher induction programs around the world routinely assess teaching best practice to inform accreditation, tenure/promotion, and professional development decisions. Routine assessment is also necessary to ensure that teachers entering the profession get the assistance they need to develop and succeed. We introduce the Item-Level Assessment of Teaching practice (I-LAST) as a flexible framework-based approach for quantitative evaluation of teaching best practice in the induction stages. We based the I-LAST on a novel framework for teaching best practice, and used Fuller’s scale as a framework for understanding the potential of the I-LAST in providing longitudinal measures for growth. Using the context of a year-long teacher induction program in the Midwestern United States, we collected data through an online survey from 46 teaching supervisors who were asked to evaluate their interns. We used the Rasch partial credit model as a criterion for construct validity, and measured dimensionality and reliability from both Rasch and classical frameworks. The I-LAST was found to be a unidimensional, valid, and reliable measure for teaching best practice. It demonstrated the ability to provide reliable scores for specific sub-dimensions of best practice, including those which manifest at various stages along Fuller’s scale. Potential uses of the I-LAST to advance understanding of the role of teacher induction programs in fostering productive growth in new teachers is discussed.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.3.2.87
Pages: 87-109
cloud_download 1204
visibility 1406
5
Article Metrics
Views
1204
Download
1406
Citations
Crossref
5

...

The purposes of this research are: 1) to compare two equalizing tests conducted with Hebara and Stocking Lord method; 2) to describe the characteristics of each equalizing test method using windows’ IRTEQ program. This research employs a participatory approach as the data are collected through questionnaires based on the National Examination Administration of 2018. The samples are classified into group A and group B respectively by 449 and 502 respondents. This paper discusses how to equalize shared items using the anchor method with a set of instruments in the forms of 35 questionnaire items and 6 shared items. In addition, the researcher also uses PARSCALE to estimate each respondent’s skills and each item’s characteristics. The shared items are eventually equalized using IRTEQ program. The results show that there is a significant difference between those conducted using Haebara method (0.592) which produces bigger mean-sigma value and Stocking & Lord (0.00213). Thus, the results show that the shared testing items may improve respondents’ discrimination and increase the difficulty level (parameter b). Due to the availability of shared items, it is good and appropriate to equalize two different tests on different theta skills.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.8.4.1071
Pages: 1071-1079
cloud_download 530
visibility 620
3
Article Metrics
Views
530
Download
620
Citations
Crossref
3

Scopus
2

...

The Computer has occupied a comprehensive coverage, especially in education scopes, including in learning-teaching processes, testing, and evaluating. This research aimed to develop computerized adaptive testing (CAT) to measure physics higher-order thinking skills (HOTS), namely PhysTHOTS-CAT. The Research Development used the 4-D developmental model carrying the four phases of define, design, development, and dissemination (4D) developed by Thiagarajan. This testing instrument can give the item test based on the student’s abilities. The research phases include (1) needs analysis and definition, (2) development design (3) development of CAT and assemble the test items into CAT, (4) validation by experts, and (5) feasibility try-out. The findings show that PhysTHOTS-CAT is valid to measure physics HOTS of the 10th-grade students of Senior High School according to 82.28% of teachers and students assessment on PhysTHOTS-CAT content and media. Therefore, it can conclude that PhysTHOTS-CAT can be used and feasible to measure physics HOTS of the 10th-grade students of the Senior High School.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.9.1.91
Pages: 91-101
cloud_download 1148
visibility 972
24
Article Metrics
Views
1148
Download
972
Citations
Crossref
24

Scopus
30

...

Research productivity plays an important role in the prestige and reputation among higher education institutions. However, the time spent to do research among Filipino academics is the most pressing issue since they can barely meet the requirement for research productivity. Further, the lack of time for data gathering aggravated the drawbacks for research productivity. Data gathering is at the core of almost all research activity, the absence of factual and reliable data will lead to an invalid and illogical inference. In research years, there has been a massive agglomeration of data in large volumes coming from diverse sources pertaining to almost all facets of human activity which is worthy of investigation- known today as Big Data. This research has two (2) main objectives; the first is to find out the underlying reasons why Filipino academics are not enthusiastic to do research. The second is to evaluate the value of big data utilization for research productivity based on the assessment of the faculty members. This research used the Rasch model to measure the responses of Filipino academics with regards to their reasons for not doing enough research work as well as on their assessment for value creation of big data utilization using a polytomous item response selection scale.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.9.1.423
Pages: 423-431
cloud_download 761
visibility 762
2
Article Metrics
Views
761
Download
762
Citations
Crossref
2

Scopus
2

Multiple Intelligences-based Creative Curriculum: The Best Practice

model assessment curriculum multiple intelligences kindergarten

Risky Setiawan , Djemari Mardapi , Aman , Umum Budi Karyanto


...

The purpose of this research is: 1) to develop the model and produce the assessment of creative curriculum-based learning program multiple intelligences (MI), 2) to know the characteristics and impacts of developed product models. Research using multi-years by method R & D (Research and Development) with two phases; First phase: 1) Preliminary survey stage, 2) definition stage, 3) design phase, 4) trial stage, and 5) development stage; The second phase: 1) the instrument design stage through the Forum Group Discussion, 2) the product trial phase of 100 children in Sleman Regency, 3) wide-scale implementation of 200 children in Yogyakarta Province, 4) the evaluation phase with construct analysis and achievement of research subjects' performance, 5 ) the stage of measuring the effectiveness of the product with user perception. The subject comprises 200 children of early childhood and 20 kindergarten teachers in 10 kindergartens in the Yogyakarta province in Indonesia, by the approach of Reflective Measurement Theory (RMT). The results showed that: 1) the MI-based creative curriculum assessment model was developed to meet valid, reliable and conformity criteria of an empirical data model, 2) The implementation of the assessment model had fulfilled the requirements worthy of using three criteria  aspect; 1) The results of the assessment using creative instruments based on multiple intelligences on children get "very good" results, 2) the readiness of the teacher in learning is included in the "good" category; 3) teacher performance appraisal shows the "very good" category, and 4) the benefits of the products developed are in the "very good" category. It was concluded that the developed product had tested empirically and practically so that it was useful in learning in early childhood.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.9.2.611
Pages: 611-627
cloud_download 1629
visibility 1372
7
Article Metrics
Views
1629
Download
1372
Citations
Crossref
7

Scopus
5

...

The current study investigated Student-Teacher Relationship Measure (STRM) psychometric properties using Rasch analysis in a sample of middle school female students (N = 995). Rasch Principal Components Analysis revealed psychometric support of two subscales (i.e., Academic and Social Relations). Summary statistics showed good psychometric properties. The category structure and individual statistics (i.e., items and person infit and outfit) were not ideal. Category structure showed that the distances between adjacent thresholds were lower than optimal criteria. Even though findings indicated that items mean square statistics (MNSQ) were optimal, standardized fit statistics (i.e., ZSTD) reflected many misfit persons and items in each subscale. After eliminating the misfit persons and items, the two subscales met the Rasch optimal criteria. The updated short 22-item scale had good psychometric properties, high item and person separation, and good item and person reliability for the two subscales and can be used as a reliable and valid scale.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.10.2.957
Pages: 957-973
cloud_download 425
visibility 473
2
Article Metrics
Views
425
Download
473
Citations
Crossref
2

Scopus
2

...

This research is a developmental research aiming at developing a good mathematical test instrument using polytomous responses based on classical and modern theories. This research design uses the Plomp model, which consists of five stages, (1) preliminary investigation, (2) design, (3) realization/construction, (4) revision, and (5) implementation (testing). The study was conducted in three vocational schools in Lampung Province, Indonesia. The study involved 413 students, consisting of 191 male and 222 female students. The data were collected through questionnaire and test. The questionnaire was used to identify the assessment instruments currently employed by teachers and to be validated by the experts of mathematics and educational evaluation. The test used an open polytomous response test numbering of 40 items. The data were analyzed using both classical and modern theories. The results show that (1) the open polytomous response test has a good category according to classical and modern theory. However, the discrimination power of test items in classical theory needs several revisions, (2) the assessment instrument using the polytomous response of open multiple choice can guarantee information on the actual competence of students. This is proven by the fact that there is a harmony between the analysis result obtained from classical and modern theory from the students' arguments when giving reasons for their choices. Therefore, the open polytomous response test can be used as an alternative to learning assessment.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.11.3.1441
Pages: 1441-1462
cloud_download 352
visibility 512
0
Article Metrics
Views
352
Download
512
Citations
Crossref
0

Scopus
0

The Development of a Four-Tier Diagnostic Test Based on Modern Test Theory in Physics Education

developing test four-tiers diagnostic test modern test theory

Edi Istiyono , Wipsar Sunu Brams Dwandaru , Kharisma Fenditasari , Made Rai Suci Shanti Nurani Ayub , Duden Saepuzaman


...

Diagnostic tests are generally two or three-tier and based on classical test theory. In this research, the Four-Tier Diagnostic Test (FTDT) was developed based on modern test theory to determine understanding of physics levels: scientific conception (SC), lack of knowledge (LK), misconception (MSC), false negatives (FN), and false positives (FP). The goals of the FTDT are to (a) find FTDT constructs, (b) test the quality of the FTDT, and (c) describe students' conceptual understanding of physics. The development process was conducted in the planning, testing, and measurement phases. The FTDT consists of four-layer multiple-choice with 100 items tested on 700 high school students in Yogyakarta. According to the partial credit models (PCM), the student's responses are in the form of eight categories of polytomous data. The results of the study show that (a) FTDT is built on the aspects of translation, interpretation, extrapolation, and explanation, with each aspect consisting of 25 items with five anchor items; (b) FTDT is valid with an Aiken's V value in the range of 0.85-0.94, and the items fit PCM with Infit Mean Square (INFIT MNSQ) of 0.77-1.30, item difficulty index of 0.12-0.38, and the reliability coefficient of Cronbach's alpha FTDT is 0.9; (c) the percentage of conceptual understanding of physics from large to small is LK type 2 (LK2), FP, LK type 1 (LK1), FN, LK type 3 (LK3), SC, LK type 4 (LK4), and MSC. The percentage sequence of MSC based on the successive material is momentum, Newton's law, particle dynamics, harmonic motion, work, and energy. In addition, failure to understand the concept sequentially is due to Newton's law, particle dynamics, work and energy, momentum, and harmonic motion.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.12.1.371
Pages: 371-385
cloud_download 377
visibility 406
0
Article Metrics
Views
377
Download
406
Citations
Crossref
0

Scopus
0

Course Dropout Intention Scale: Development and Validation of a New Brief Measure in Academic College Context

brief measure college student course dropout dropout intention dropout studies

Daniel E. Yupanqui-Lorenzo , Lizbeth Angela Jara-Osorio , Carlos Carbajal-León , Tomás Caycho-Rodríguez , Manuel Antonio Cardoza Sernaqué , Kerly Stefanny Duran Quispe


...

University students may encounter situations where they perform poorly in a course and contemplate dropping out. This intention to drop out of a course manifests not only in thoughts or ideas but also in a cognitive self-evaluation of their performance and skills, enabling them to reflect on the possibility of dropping out. In this sense, there is a shortage of instruments that evaluate the intention to drop out of a course, so the aim was to develop and validate the Course Dropout Intention Scale (CDIS). Data from two samples (N1 = 198; N2 = 675) were used; the first was for the EFA, and the second was for the CFA, GRM, and SEM. The one-factor model was derived from the EFA and confirmed in the second sample, exhibiting appropriate goodness-of-fit indices. Similarly, the GRM obtained adequate fit indices; all items discriminated adequately, and the difficulty parameter had a monotonic increase. The SEM model of the effect of satisfaction with studies on the CDIS showed a negative and statistically significant effect. Thus, it was demonstrated that the CDIS is a robust instrument in its psychometric properties and empirical evidence with other variables.

description Abstract
visibility View cloud_download PDF
10.12973/eu-jer.13.1.103
Pages: 103-113
cloud_download 340
visibility 381
0
Article Metrics
Views
340
Download
381
Citations
Crossref
0

Scopus
0

...